Picture for Xun Huang

Xun Huang

AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection

Add code
Mar 17, 2026
Viaarxiv icon

Memory-Guided View Refinement for Dynamic Human-in-the-loop EQA

Add code
Mar 10, 2026
Viaarxiv icon

Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search

Add code
Feb 26, 2026
Viaarxiv icon

MonarchRT: Efficient Attention for Real-Time Video Generation

Add code
Feb 12, 2026
Viaarxiv icon

Causality in Video Diffusers is Separable from Denoising

Add code
Feb 10, 2026
Viaarxiv icon

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Add code
Feb 09, 2026
Viaarxiv icon

V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

Add code
Nov 18, 2025
Figure 1 for V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization
Figure 2 for V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization
Figure 3 for V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization
Figure 4 for V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization
Viaarxiv icon

MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation

Add code
Nov 14, 2025
Figure 1 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 2 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 3 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 4 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Viaarxiv icon

Learning an Image Editing Model without Image Editing Pairs

Add code
Oct 16, 2025
Figure 1 for Learning an Image Editing Model without Image Editing Pairs
Figure 2 for Learning an Image Editing Model without Image Editing Pairs
Figure 3 for Learning an Image Editing Model without Image Editing Pairs
Figure 4 for Learning an Image Editing Model without Image Editing Pairs
Viaarxiv icon

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Add code
Jun 09, 2025
Figure 1 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 2 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 3 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 4 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Viaarxiv icon