Picture for Yaser Yacoob

Yaser Yacoob

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection

Add code
Jun 05, 2024
Figure 1 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 2 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 3 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 4 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Viaarxiv icon

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning

Add code
Nov 15, 2023
Viaarxiv icon

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V, LLaVA-1.5, and Other Multi-modality Models

Add code
Oct 23, 2023
Viaarxiv icon

Aligning Large Multi-Modal Model with Robust Instruction Tuning

Add code
Jun 26, 2023
Viaarxiv icon

One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2

Add code
Feb 15, 2023
Viaarxiv icon

COVID-VTS: Fact Extraction and Verification on Short Video Platforms

Add code
Feb 15, 2023
Viaarxiv icon

Encode-in-Style: Latent-based Video Encoding using StyleGAN2

Add code
Mar 28, 2022
Viaarxiv icon

Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images

Add code
Sep 06, 2017
Figure 1 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 2 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 3 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 4 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Viaarxiv icon

Modeling Colors of Single Attribute Variations with Application to Food Appearance

Add code
Dec 18, 2015
Figure 1 for Modeling Colors of Single Attribute Variations with Application to Food Appearance
Figure 2 for Modeling Colors of Single Attribute Variations with Application to Food Appearance
Figure 3 for Modeling Colors of Single Attribute Variations with Application to Food Appearance
Figure 4 for Modeling Colors of Single Attribute Variations with Application to Food Appearance
Viaarxiv icon