Picture for Lanqing Hong

Lanqing Hong

Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

Add code
Nov 29, 2024
Viaarxiv icon

MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control

Add code
Nov 21, 2024
Viaarxiv icon

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Add code
Nov 18, 2024
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Add code
Sep 26, 2024
Figure 1 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 2 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 3 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 4 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Viaarxiv icon

CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration

Add code
Sep 17, 2024
Figure 1 for CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Figure 2 for CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Figure 3 for CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Figure 4 for CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Viaarxiv icon

CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference

Add code
Jun 25, 2024
Viaarxiv icon

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes

Add code
May 23, 2024
Figure 1 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 2 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 3 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Figure 4 for MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Viaarxiv icon

Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment

Add code
May 01, 2024
Figure 1 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 2 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 3 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Figure 4 for Mixture of insighTful Experts : The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Viaarxiv icon

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

Add code
Apr 16, 2024
Figure 1 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 2 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 3 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Figure 4 for Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases
Viaarxiv icon