Picture for Mubarak Shah

Mubarak Shah

SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models

Add code
Feb 12, 2025
Viaarxiv icon

TimeLogic: A Temporal Logic Benchmark for Video QA

Add code
Jan 13, 2025
Viaarxiv icon

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Add code
Jan 10, 2025
Figure 1 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 2 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 3 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 4 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Viaarxiv icon

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Figure 1 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 2 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 3 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 4 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Viaarxiv icon

Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook

Add code
Nov 29, 2024
Figure 1 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 2 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 3 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 4 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID

Add code
Nov 11, 2024
Figure 1 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 2 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 3 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 4 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Viaarxiv icon

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Add code
Nov 10, 2024
Figure 1 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 2 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 3 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 4 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Viaarxiv icon

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

Add code
Nov 01, 2024
Figure 1 for Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Figure 2 for Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Figure 3 for Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Figure 4 for Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Viaarxiv icon

Investigating Memorization in Video Diffusion Models

Add code
Oct 29, 2024
Viaarxiv icon