Picture for Mohit Bansal

Mohit Bansal

Shammie

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Add code
Jun 23, 2025
Viaarxiv icon

Context-Informed Grounding Supervision

Add code
Jun 18, 2025
Viaarxiv icon

GenerationPrograms: Fine-grained Attribution with Executable Programs

Add code
Jun 17, 2025
Viaarxiv icon

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval

Add code
Jun 06, 2025
Viaarxiv icon

CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection

Add code
Jun 05, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

SiLVR: A Simple Language-based Video Reasoning Framework

Add code
May 30, 2025
Viaarxiv icon

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Add code
May 28, 2025
Viaarxiv icon

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation

Add code
May 01, 2025
Viaarxiv icon