Picture for Siyuan Wang

Siyuan Wang

Fudan University

Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference

Add code
Dec 17, 2024
Figure 1 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 2 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 3 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 4 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Viaarxiv icon

Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions

Add code
Dec 03, 2024
Viaarxiv icon

Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving

Add code
Nov 15, 2024
Viaarxiv icon

Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules

Add code
Oct 24, 2024
Figure 1 for Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules
Figure 2 for Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules
Figure 3 for Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules
Figure 4 for Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules
Viaarxiv icon

Symbolic Working Memory Enhances Language Models for Complex Rule Application

Add code
Aug 24, 2024
Viaarxiv icon

Identity-Driven Hierarchical Role-Playing Agents

Add code
Jul 28, 2024
Viaarxiv icon

Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks

Add code
Jul 13, 2024
Viaarxiv icon

HAF-RM: A Hybrid Alignment Framework for Reward Model Training

Add code
Jul 04, 2024
Viaarxiv icon

From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking

Add code
Jun 21, 2024
Viaarxiv icon

ALaRM: Align Language Models via Hierarchical Rewards Modeling

Add code
Mar 16, 2024
Viaarxiv icon