Picture for Vicente Ordonez

Vicente Ordonez

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Add code
Dec 19, 2024
Figure 1 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 2 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 3 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 4 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Viaarxiv icon

ParallelSpec: Parallel Drafter for Efficient Speculative Decoding

Add code
Oct 08, 2024
Viaarxiv icon

Fairness and Bias Mitigation in Computer Vision: A Survey

Add code
Aug 05, 2024
Figure 1 for Fairness and Bias Mitigation in Computer Vision: A Survey
Figure 2 for Fairness and Bias Mitigation in Computer Vision: A Survey
Figure 3 for Fairness and Bias Mitigation in Computer Vision: A Survey
Figure 4 for Fairness and Bias Mitigation in Computer Vision: A Survey
Viaarxiv icon

Taming Data and Transformers for Audio Generation

Add code
Jun 27, 2024
Viaarxiv icon

Generative Visual Instruction Tuning

Add code
Jun 17, 2024
Figure 1 for Generative Visual Instruction Tuning
Figure 2 for Generative Visual Instruction Tuning
Figure 3 for Generative Visual Instruction Tuning
Figure 4 for Generative Visual Instruction Tuning
Viaarxiv icon

FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation

Add code
May 08, 2024
Viaarxiv icon

PropTest: Automatic Property Testing for Improved Visual Programming

Add code
Mar 25, 2024
Figure 1 for PropTest: Automatic Property Testing for Improved Visual Programming
Figure 2 for PropTest: Automatic Property Testing for Improved Visual Programming
Figure 3 for PropTest: Automatic Property Testing for Improved Visual Programming
Figure 4 for PropTest: Automatic Property Testing for Improved Visual Programming
Viaarxiv icon

Learning from Models and Data for Visual Grounding

Add code
Mar 20, 2024
Figure 1 for Learning from Models and Data for Visual Grounding
Figure 2 for Learning from Models and Data for Visual Grounding
Figure 3 for Learning from Models and Data for Visual Grounding
Figure 4 for Learning from Models and Data for Visual Grounding
Viaarxiv icon

Grounding Language Models for Visual Entity Recognition

Add code
Feb 28, 2024
Viaarxiv icon

Improved Visual Grounding through Self-Consistent Explanations

Add code
Dec 07, 2023
Viaarxiv icon