Picture for Xin Huang

Xin Huang

Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei, China, Shanghai Branch, CAS Center for Excellence in Quantum Information and Quantum Physics, University of Science and Technology of China, Shanghai, China, Shanghai Research Center for Quantum Sciences, Shanghai, China

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Add code
Apr 02, 2025
Viaarxiv icon

M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?

Add code
Mar 27, 2025
Viaarxiv icon

R-PRM: Reasoning-Driven Process Reward Modeling

Add code
Mar 27, 2025
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Enhancing Web Service Anomaly Detection via Fine-grained Multi-modal Association and Frequency Domain Analysis

Add code
Jan 28, 2025
Viaarxiv icon

AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought

Add code
Jan 27, 2025
Viaarxiv icon

Causal Composition Diffusion Model for Closed-loop Traffic Generation

Add code
Dec 23, 2024
Viaarxiv icon

DriveGPT: Scaling Autoregressive Behavior Models for Driving

Add code
Dec 19, 2024
Figure 1 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 2 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 3 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Figure 4 for DriveGPT: Scaling Autoregressive Behavior Models for Driving
Viaarxiv icon

VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision

Add code
Dec 19, 2024
Figure 1 for VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
Figure 2 for VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
Figure 3 for VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
Figure 4 for VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
Viaarxiv icon

Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images

Add code
Dec 19, 2024
Viaarxiv icon