Picture for Maitreya Patel

Maitreya Patel

Steering Rectified Flow Models in the Vector Field for Controlled Image Generation

Add code
Nov 27, 2024
Viaarxiv icon

Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model

Add code
Nov 07, 2024
Figure 1 for Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
Figure 2 for Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
Figure 3 for Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
Figure 4 for Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
Viaarxiv icon

TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives

Add code
Nov 04, 2024
Figure 1 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 2 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 3 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 4 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Viaarxiv icon

Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?

Add code
Oct 17, 2024
Figure 1 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 2 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 3 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 4 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Viaarxiv icon

$λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

Add code
Feb 07, 2024
Figure 1 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 2 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 3 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Figure 4 for $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Viaarxiv icon

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

Add code
Dec 07, 2023
Figure 1 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 2 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 3 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Figure 4 for ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Viaarxiv icon

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Figure 1 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 2 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 3 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 4 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Viaarxiv icon

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Viaarxiv icon

CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering

Add code
Nov 07, 2022
Viaarxiv icon

Reasoning about Actions over Visual and Linguistic Modalities: A Survey

Add code
Jul 15, 2022
Figure 1 for Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Figure 2 for Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Figure 3 for Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Figure 4 for Reasoning about Actions over Visual and Linguistic Modalities: A Survey
Viaarxiv icon