Picture for Dinesh Manocha

Dinesh Manocha

University of Maryland

VEGA: Learning Navigation VLAs from In-the-Wild Egocentric Video with Geometric Trajectory Supervision

Add code
Jun 16, 2026
Viaarxiv icon

A Closer Look at Failure Modes in Temporal Understanding of Large Audio-Language Models

Add code
Jun 16, 2026
Viaarxiv icon

VISA: VLM-Guided Instance Semantic Auditing for 3D Occupancy World Models

Add code
Jun 11, 2026
Viaarxiv icon

Act on What You See: Unlocking Safe Social Navigation in Vision-Language-Action Models

Add code
Jun 09, 2026
Viaarxiv icon

FIGMA: Towards FIne-Grained Music retrievAl

Add code
Jun 04, 2026
Viaarxiv icon

Video2LoRA: Parametric Video Internalization for Vision-Language Models

Add code
Jun 03, 2026
Viaarxiv icon

Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for Large Language Models

Add code
Jun 03, 2026
Viaarxiv icon

VLM-Based Advanced Rider Assistance System for Motorcycle Safety

Add code
May 27, 2026
Viaarxiv icon

Uncovering the Representation Geometry of Minimal Cores in Overcomplete Reasoning Traces

Add code
May 14, 2026
Viaarxiv icon

PanoPlane: Plane-Aware Panoramic Completion for Sparse-View Indoor 3D Gaussian Splatting

Add code
May 13, 2026
Viaarxiv icon