Picture for Zhengzhong Tu

Zhengzhong Tu

Ben

NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving

Add code
Apr 07, 2025
Viaarxiv icon

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

Add code
Mar 31, 2025
Viaarxiv icon

Can Large Vision Language Models Read Maps Like a Human?

Add code
Mar 18, 2025
Viaarxiv icon

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Add code
Mar 17, 2025
Viaarxiv icon

DecAlign: Hierarchical Cross-Modal Alignment for Decoupled Multimodal Representation Learning

Add code
Mar 14, 2025
Viaarxiv icon

Generative AI in Transportation Planning: A Survey

Add code
Mar 10, 2025
Viaarxiv icon

Secure On-Device Video OOD Detection Without Backpropagation

Add code
Mar 08, 2025
Viaarxiv icon

V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors

Add code
Mar 04, 2025
Viaarxiv icon

Complex LLM Planning via Automated Heuristics Discovery

Add code
Feb 26, 2025
Viaarxiv icon

Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization

Add code
Feb 18, 2025
Viaarxiv icon