Picture for Yuxiao Chen

Yuxiao Chen

Accelerating Structured Chain-of-Thought in Autonomous Vehicles

Add code
Feb 02, 2026
Viaarxiv icon

Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs

Add code
Jan 06, 2026
Viaarxiv icon

Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning

Add code
Dec 30, 2025
Viaarxiv icon

Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving

Add code
Dec 12, 2025
Viaarxiv icon

Latent Chain-of-Thought World Modeling for End-to-End Driving

Add code
Dec 11, 2025
Figure 1 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 2 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 3 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 4 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Viaarxiv icon

RealDrive: Retrieval-Augmented Driving with Diffusion Models

Add code
May 30, 2025
Figure 1 for RealDrive: Retrieval-Augmented Driving with Diffusion Models
Figure 2 for RealDrive: Retrieval-Augmented Driving with Diffusion Models
Figure 3 for RealDrive: Retrieval-Augmented Driving with Diffusion Models
Figure 4 for RealDrive: Retrieval-Augmented Driving with Diffusion Models
Viaarxiv icon

Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions

Add code
May 14, 2025
Figure 1 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 2 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 3 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 4 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Figure 1 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 2 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 3 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Figure 4 for LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
Viaarxiv icon

STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion

Add code
Feb 10, 2025
Figure 1 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 2 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 3 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 4 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Viaarxiv icon

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Add code
Feb 05, 2025
Figure 1 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 2 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 3 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 4 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Viaarxiv icon