Picture for Mohan Kankanhalli

Mohan Kankanhalli

VidHal: Benchmarking Temporal Hallucinations in Vision LLMs

Add code
Nov 25, 2024
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

Joint Vision-Language Social Bias Removal for CLIP

Add code
Nov 19, 2024
Viaarxiv icon

SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency

Add code
Nov 14, 2024
Viaarxiv icon

The VLLM Safety Paradox: Dual Ease in Jailbreak Attack and Defense

Add code
Nov 13, 2024
Viaarxiv icon

UnStar: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs

Add code
Oct 22, 2024
Viaarxiv icon

Strong Preferences Affect the Robustness of Value Alignment

Add code
Oct 03, 2024
Viaarxiv icon

STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting

Add code
Jun 07, 2024
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Viaarxiv icon

Multi-Modal Recommendation Unlearning

Add code
May 24, 2024
Figure 1 for Multi-Modal Recommendation Unlearning
Figure 2 for Multi-Modal Recommendation Unlearning
Figure 3 for Multi-Modal Recommendation Unlearning
Figure 4 for Multi-Modal Recommendation Unlearning
Viaarxiv icon