Picture for Yaser Yacoob

Yaser Yacoob

Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation

Add code
Feb 24, 2025
Viaarxiv icon

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection

Add code
Jun 05, 2024
Figure 1 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 2 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 3 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Figure 4 for AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Viaarxiv icon

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning

Add code
Nov 15, 2023
Viaarxiv icon

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V, LLaVA-1.5, and Other Multi-modality Models

Add code
Oct 23, 2023
Viaarxiv icon

Aligning Large Multi-Modal Model with Robust Instruction Tuning

Add code
Jun 26, 2023
Viaarxiv icon

COVID-VTS: Fact Extraction and Verification on Short Video Platforms

Add code
Feb 15, 2023
Viaarxiv icon

One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2

Add code
Feb 15, 2023
Viaarxiv icon

Encode-in-Style: Latent-based Video Encoding using StyleGAN2

Add code
Mar 28, 2022
Viaarxiv icon

Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images

Add code
Sep 06, 2017
Figure 1 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 2 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 3 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Figure 4 for Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images
Viaarxiv icon