Picture for Bryan Russell

Bryan Russell

Video-Guided Foley Sound Generation with Multimodal Controls

Add code
Nov 26, 2024
Viaarxiv icon

Generative Timelines for Instructed Visual Assembly

Add code
Nov 19, 2024
Viaarxiv icon

Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval

Add code
May 06, 2024
Figure 1 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 2 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 3 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 4 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Viaarxiv icon

Koala: Key frame-conditioned long video-LLM

Add code
Apr 05, 2024
Viaarxiv icon

Customizing Motion in Text-to-Video Diffusion Models

Add code
Dec 07, 2023
Viaarxiv icon

Meta-Personalizing Vision-Language Models to Find Named Instances in Video

Add code
Jun 16, 2023
Viaarxiv icon

Language-Guided Music Recommendation for Video via Prompt Analogies

Add code
Jun 15, 2023
Viaarxiv icon

Conditional Generation of Audio from Video via Foley Analogies

Add code
Apr 17, 2023
Viaarxiv icon

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Add code
Mar 28, 2023
Viaarxiv icon

Monocular Dynamic View Synthesis: A Reality Check

Add code
Oct 24, 2022
Figure 1 for Monocular Dynamic View Synthesis: A Reality Check
Figure 2 for Monocular Dynamic View Synthesis: A Reality Check
Figure 3 for Monocular Dynamic View Synthesis: A Reality Check
Figure 4 for Monocular Dynamic View Synthesis: A Reality Check
Viaarxiv icon