Picture for Trevor Darrell

Trevor Darrell

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Add code
Mar 12, 2026
Viaarxiv icon

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Add code
Mar 03, 2026
Viaarxiv icon

EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

Add code
Feb 18, 2026
Viaarxiv icon

Learning a Generative Meta-Model of LLM Activations

Add code
Feb 06, 2026
Viaarxiv icon

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

Add code
Jan 16, 2026
Viaarxiv icon

It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models

Add code
Dec 31, 2025
Viaarxiv icon

Latent Implicit Visual Reasoning

Add code
Dec 24, 2025
Viaarxiv icon

DAVE: A VLM Vision Encoder for Document Understanding and Web Agents

Add code
Dec 19, 2025
Viaarxiv icon

Visually Prompted Benchmarks Are Surprisingly Fragile

Add code
Dec 19, 2025
Figure 1 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 2 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 3 for Visually Prompted Benchmarks Are Surprisingly Fragile
Figure 4 for Visually Prompted Benchmarks Are Surprisingly Fragile
Viaarxiv icon

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Add code
Dec 11, 2025
Figure 1 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 2 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 3 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Figure 4 for FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos
Viaarxiv icon