Picture for Ranjay Krishna

Ranjay Krishna

VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition

Add code
May 05, 2026
Viaarxiv icon

MolmoAct2: Action Reasoning Models for Real-world Deployment

Add code
May 04, 2026
Viaarxiv icon

You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass

Add code
Apr 13, 2026
Viaarxiv icon

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Add code
Apr 09, 2026
Viaarxiv icon

WildDet3D: Scaling Promptable 3D Detection in the Wild

Add code
Apr 09, 2026
Viaarxiv icon

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Add code
Mar 30, 2026
Viaarxiv icon

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Add code
Mar 27, 2026
Viaarxiv icon

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Add code
Mar 25, 2026
Viaarxiv icon

Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Add code
Mar 18, 2026
Viaarxiv icon