Picture for Zhengzhong Tu

Zhengzhong Tu

Ben

Can Large Vision Language Models Read Maps Like a Human?

Add code
Mar 18, 2025
Viaarxiv icon

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Add code
Mar 17, 2025
Viaarxiv icon

DecAlign: Hierarchical Cross-Modal Alignment for Decoupled Multimodal Representation Learning

Add code
Mar 14, 2025
Viaarxiv icon

Generative AI in Transportation Planning: A Survey

Add code
Mar 10, 2025
Viaarxiv icon

Secure On-Device Video OOD Detection Without Backpropagation

Add code
Mar 08, 2025
Viaarxiv icon

V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors

Add code
Mar 04, 2025
Viaarxiv icon

Complex LLM Planning via Automated Heuristics Discovery

Add code
Feb 26, 2025
Viaarxiv icon

Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization

Add code
Feb 18, 2025
Viaarxiv icon

HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection

Add code
Jan 10, 2025
Viaarxiv icon

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

Add code
Dec 19, 2024
Figure 1 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 2 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 3 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 4 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Viaarxiv icon