Picture for Ranjay Krishna

Ranjay Krishna

Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Add code
Mar 18, 2026
Viaarxiv icon

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

Add code
Mar 17, 2026
Viaarxiv icon

vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models

Add code
Mar 14, 2026
Viaarxiv icon

Video-Based Reward Modeling for Computer-Use Agents

Add code
Mar 10, 2026
Viaarxiv icon

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

TrajTok: Learning Trajectory Tokens enables better Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Add code
Feb 22, 2026
Viaarxiv icon

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

Add code
Feb 11, 2026
Viaarxiv icon

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Add code
Feb 08, 2026
Viaarxiv icon