Picture for Trevor Darrell

Trevor Darrell

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Add code
Mar 12, 2026
Viaarxiv icon

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Add code
Mar 03, 2026
Viaarxiv icon

EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

Add code
Feb 18, 2026
Viaarxiv icon

Learning a Generative Meta-Model of LLM Activations

Add code
Feb 06, 2026
Viaarxiv icon

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

Add code
Jan 16, 2026
Viaarxiv icon

It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models

Add code
Dec 31, 2025
Viaarxiv icon

Latent Implicit Visual Reasoning

Add code
Dec 24, 2025
Viaarxiv icon

Visually Prompted Benchmarks Are Surprisingly Fragile

Add code
Dec 19, 2025
Figure 1 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 2 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 3 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 4 for Visually Prompted Benchmarks Are Surprisingly Fragile
Viaarxiv icon

DAVE: A VLM Vision Encoder for Document Understanding and Web Agents

Add code
Dec 19, 2025
Viaarxiv icon

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Add code
Dec 11, 2025
Figure 1 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 2 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 3 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 4 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Viaarxiv icon