Picture for Deepti Ghadiyaram

Deepti Ghadiyaram

Facebook AI

A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning

Add code
Apr 05, 2026
Viaarxiv icon

Semantic Richness or Geometric Reasoning? The Fragility of VLM's Visual Invariance

Add code
Apr 02, 2026
Viaarxiv icon

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary

Add code
Mar 12, 2026
Viaarxiv icon

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Add code
Feb 19, 2026
Viaarxiv icon

Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks

Add code
May 29, 2025
Viaarxiv icon

Improving Physical Object State Representation in Text-to-Image Generative Systems

Add code
May 04, 2025
Figure 1 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 2 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 3 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 4 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Viaarxiv icon

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Add code
Mar 09, 2025
Figure 1 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 2 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 3 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 4 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Viaarxiv icon

Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations

Add code
Jan 31, 2025
Figure 1 for Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations
Figure 2 for Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations
Figure 3 for Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations
Figure 4 for Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations
Viaarxiv icon

DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model

Add code
Jan 28, 2025
Figure 1 for DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model
Figure 2 for DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model
Figure 3 for DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model
Figure 4 for DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model
Viaarxiv icon

$\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

Add code
Nov 23, 2024
Figure 1 for $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models
Figure 2 for $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models
Figure 3 for $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models
Figure 4 for $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models
Viaarxiv icon