Picture for Xinhu Zheng

Xinhu Zheng

Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance

Add code
Mar 27, 2026
Viaarxiv icon

Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection

Add code
Mar 20, 2026
Viaarxiv icon

Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning

Add code
Mar 20, 2026
Viaarxiv icon

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

RL-ScanIQA: Reinforcement-Learned Scanpaths for Blind 360°Image Quality Assessment

Add code
Mar 15, 2026
Viaarxiv icon

CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection

Add code
Mar 05, 2026
Viaarxiv icon

Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection

Add code
Feb 24, 2026
Viaarxiv icon

Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants

Add code
Jan 14, 2026
Viaarxiv icon

Relaying Signal When Monitoring Traffic: Double Use of Aerial Vehicles Towards Intelligent Low-Altitude Networking

Add code
Dec 16, 2025
Viaarxiv icon

An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment

Add code
Sep 26, 2025
Figure 1 for An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment
Figure 2 for An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment
Figure 3 for An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment
Figure 4 for An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment
Viaarxiv icon