Picture for Trevor Darrell

Trevor Darrell

Risks and Opportunities of Open-Source Generative AI

Add code
May 14, 2024
Viaarxiv icon

Pose Priors from Language Models

Add code
May 06, 2024
Figure 1 for Pose Priors from Language Models
Figure 2 for Pose Priors from Language Models
Figure 3 for Pose Priors from Language Models
Figure 4 for Pose Priors from Language Models
Viaarxiv icon

Near to Mid-term Risks and Opportunities of Open Source Generative AI

Add code
Apr 25, 2024
Viaarxiv icon

EgoPet: Egomotion and Interaction Data from an Animal's Perspective

Add code
Apr 15, 2024
Viaarxiv icon

Finding Visual Task Vectors

Add code
Apr 08, 2024
Viaarxiv icon

ALOHa: A New Measure for Hallucination in Captioning Models

Add code
Apr 03, 2024
Viaarxiv icon

TraveLER: A Multi-LMM Agent Framework for Video Question-Answering

Add code
Apr 01, 2024
Figure 1 for TraveLER: A Multi-LMM Agent Framework for Video Question-Answering
Figure 2 for TraveLER: A Multi-LMM Agent Framework for Video Question-Answering
Figure 3 for TraveLER: A Multi-LMM Agent Framework for Video Question-Answering
Figure 4 for TraveLER: A Multi-LMM Agent Framework for Video Question-Answering
Viaarxiv icon

When Do We Not Need Larger Vision Models?

Add code
Mar 19, 2024
Figure 1 for When Do We Not Need Larger Vision Models?
Figure 2 for When Do We Not Need Larger Vision Models?
Figure 3 for When Do We Not Need Larger Vision Models?
Figure 4 for When Do We Not Need Larger Vision Models?
Viaarxiv icon

xT: Nested Tokenization for Larger Context in Large Images

Add code
Mar 04, 2024
Figure 1 for xT: Nested Tokenization for Larger Context in Large Images
Figure 2 for xT: Nested Tokenization for Larger Context in Large Images
Figure 3 for xT: Nested Tokenization for Larger Context in Large Images
Figure 4 for xT: Nested Tokenization for Larger Context in Large Images
Viaarxiv icon

Humanoid Locomotion as Next Token Prediction

Add code
Feb 29, 2024
Viaarxiv icon