Picture for Animesh Sinha

Animesh Sinha

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Add code
Dec 07, 2023
Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

Gen2Det: Generate to Detect

Add code
Dec 07, 2023
Viaarxiv icon

Context Diffusion: In-Context Aware Image Generation

Add code
Dec 06, 2023
Viaarxiv icon

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Add code
Nov 17, 2023
Figure 1 for Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Figure 2 for Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Figure 3 for Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Figure 4 for Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Viaarxiv icon

FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning

Add code
Oct 26, 2022
Viaarxiv icon

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

Add code
Feb 15, 2022
Figure 1 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 2 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 3 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 4 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Viaarxiv icon

Large-Scale Attribute-Object Compositions

Add code
May 24, 2021
Figure 1 for Large-Scale Attribute-Object Compositions
Figure 2 for Large-Scale Attribute-Object Compositions
Figure 3 for Large-Scale Attribute-Object Compositions
Figure 4 for Large-Scale Attribute-Object Compositions
Viaarxiv icon

Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search

Add code
Apr 01, 2021
Figure 1 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 2 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 3 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 4 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Viaarxiv icon