Picture for Mubarak Shah

Mubarak Shah

TimeLogic: A Temporal Logic Benchmark for Video QA

Add code
Jan 13, 2025
Viaarxiv icon

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Add code
Jan 10, 2025
Figure 1 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 2 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 3 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 4 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Viaarxiv icon

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Viaarxiv icon

Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook

Add code
Nov 29, 2024
Figure 1 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 2 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 3 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Figure 4 for Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID

Add code
Nov 11, 2024
Figure 1 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 2 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 3 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Figure 4 for DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID
Viaarxiv icon

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Add code
Nov 10, 2024
Figure 1 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 2 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 3 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Figure 4 for CityGuessr: City-Level Video Geo-Localization on a Global Scale
Viaarxiv icon

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

Add code
Nov 01, 2024
Viaarxiv icon

Investigating Memorization in Video Diffusion Models

Add code
Oct 29, 2024
Viaarxiv icon

Exploring Local Memorization in Diffusion Models via Bright Ending Attention

Add code
Oct 29, 2024
Viaarxiv icon