Picture for Anjan Dutta

Anjan Dutta

RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation

Add code
Sep 18, 2025
Viaarxiv icon

TaleDiffusion: Multi-Character Story Generation with Dialogue Rendering

Add code
Sep 04, 2025
Viaarxiv icon

A Closer Look at Multimodal Representation Collapse

Add code
May 28, 2025
Viaarxiv icon

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

Add code
Mar 30, 2024
Figure 1 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 2 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 3 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Figure 4 for SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
Viaarxiv icon

DeNetDM: Debiasing by Network Depth Modulation

Add code
Mar 28, 2024
Viaarxiv icon

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Add code
Mar 14, 2024
Viaarxiv icon

Learning Conditional Invariances through Non-Commutativity

Add code
Feb 18, 2024
Viaarxiv icon

CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis

Add code
Dec 04, 2023
Figure 1 for CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis
Figure 2 for CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis
Figure 3 for CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis
Figure 4 for CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis
Viaarxiv icon

Transitivity Recovering Decompositions: Interpretable and Robust Fine-Grained Relationships

Add code
Oct 24, 2023
Viaarxiv icon

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

Add code
Sep 29, 2023
Figure 1 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 2 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 3 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 4 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Viaarxiv icon