Picture for Lanqing Hong

Lanqing Hong

Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?

Add code
Mar 08, 2025
Viaarxiv icon

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs

Add code
Mar 07, 2025
Viaarxiv icon

Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views

Add code
Mar 04, 2025
Viaarxiv icon

Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning

Add code
Feb 18, 2025
Viaarxiv icon

Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks

Add code
Feb 09, 2025
Viaarxiv icon

Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails

Add code
Feb 09, 2025
Viaarxiv icon

Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

Add code
Nov 29, 2024
Figure 1 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 2 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 3 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 4 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Viaarxiv icon

MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control

Add code
Nov 21, 2024
Viaarxiv icon

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Add code
Nov 18, 2024
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon