Picture for Mahyar Khayatkhoei

Mahyar Khayatkhoei

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

Add code
Feb 24, 2025
Viaarxiv icon

Look, Learn and Leverage (L$^3$): Mitigating Visual-Domain Shift and Discovering Intrinsic Relations via Symbolic Alignment

Add code
Aug 30, 2024
Viaarxiv icon

An Investigation on The Position Encoding in Vision-Based Dynamics Prediction

Add code
Aug 27, 2024
Viaarxiv icon

ManiFPT: Defining and Analyzing Fingerprints of Generative Models

Add code
Feb 29, 2024
Viaarxiv icon

Exploring Perceptual Limitation of Multimodal Large Language Models

Add code
Feb 12, 2024
Figure 1 for Exploring Perceptual Limitation of Multimodal Large Language Models
Figure 2 for Exploring Perceptual Limitation of Multimodal Large Language Models
Figure 3 for Exploring Perceptual Limitation of Multimodal Large Language Models
Figure 4 for Exploring Perceptual Limitation of Multimodal Large Language Models
Viaarxiv icon

Unsupervised Multimodal Deepfake Detection Using Intra- and Cross-Modal Inconsistencies

Add code
Nov 28, 2023
Viaarxiv icon

SABAF: Removing Strong Attribute Bias from Neural Networks with Adversarial Filtering

Add code
Nov 16, 2023
Viaarxiv icon

Visual Cropping Improves Zero-Shot Question Answering of Multimodal Large Language Models

Add code
Oct 24, 2023
Viaarxiv icon

Information-Theoretic Bounds on The Removal of Attribute-Specific Bias From Neural Networks

Add code
Oct 08, 2023
Viaarxiv icon

Shadow Datasets, New challenging datasets for Causal Representation Learning

Add code
Aug 11, 2023
Viaarxiv icon