Picture for Yurong You

Yurong You

Cornell University

Accelerating Structured Chain-of-Thought in Autonomous Vehicles

Add code
Feb 02, 2026
Viaarxiv icon

Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning

Add code
Dec 30, 2025
Viaarxiv icon

Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving

Add code
Dec 12, 2025
Viaarxiv icon

Latent Chain-of-Thought World Modeling for End-to-End Driving

Add code
Dec 11, 2025
Figure 1 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 2 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 3 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 4 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Viaarxiv icon

Efficient Multi-Camera Tokenization with Triplanes for End-to-End Driving

Add code
Jun 13, 2025
Figure 1 for Efficient Multi-Camera Tokenization with Triplanes for End-to-End Driving
Figure 2 for Efficient Multi-Camera Tokenization with Triplanes for End-to-End Driving
Figure 3 for Efficient Multi-Camera Tokenization with Triplanes for End-to-End Driving
Figure 4 for Efficient Multi-Camera Tokenization with Triplanes for End-to-End Driving
Viaarxiv icon

DreamDrive: Generative 4D Scene Modeling from Street View Images

Add code
Jan 03, 2025
Figure 1 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 2 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 3 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 4 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Viaarxiv icon

STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes

Add code
Dec 31, 2024
Figure 1 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 2 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 3 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 4 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Viaarxiv icon

Extrapolated Urban View Synthesis Benchmark

Add code
Dec 10, 2024
Figure 1 for Extrapolated Urban View Synthesis Benchmark
Figure 2 for Extrapolated Urban View Synthesis Benchmark
Figure 3 for Extrapolated Urban View Synthesis Benchmark
Figure 4 for Extrapolated Urban View Synthesis Benchmark
Viaarxiv icon

DiffuBox: Refining 3D Object Detection with Point Diffusion

Add code
May 25, 2024
Figure 1 for DiffuBox: Refining 3D Object Detection with Point Diffusion
Figure 2 for DiffuBox: Refining 3D Object Detection with Point Diffusion
Figure 3 for DiffuBox: Refining 3D Object Detection with Point Diffusion
Figure 4 for DiffuBox: Refining 3D Object Detection with Point Diffusion
Viaarxiv icon

Language-Image Models with 3D Understanding

Add code
May 06, 2024
Viaarxiv icon