Picture for Aman Chadha

Aman Chadha

Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting

Add code
Dec 01, 2024
Viaarxiv icon

Improving speaker verification robustness with synthetic emotional utterances

Add code
Nov 30, 2024
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

Add code
Nov 24, 2024
Viaarxiv icon

ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models

Add code
Nov 16, 2024
Viaarxiv icon

DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Add code
Oct 19, 2024
Viaarxiv icon

Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Add code
Oct 05, 2024
Figure 1 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering
Figure 2 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering
Figure 3 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering
Figure 4 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering
Viaarxiv icon

MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation

Add code
Oct 03, 2024
Figure 1 for MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation
Figure 2 for MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation
Figure 3 for MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation
Figure 4 for MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation
Viaarxiv icon

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types

Add code
Sep 14, 2024
Viaarxiv icon

Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders

Add code
Aug 31, 2024
Figure 1 for Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders
Figure 2 for Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders
Figure 3 for Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders
Figure 4 for Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders
Viaarxiv icon