Picture for Mengmeng Xu

Mengmeng Xu

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

Add code
Feb 27, 2025
Viaarxiv icon

Learning Flow Fields in Attention for Controllable Person Image Generation

Add code
Dec 12, 2024
Figure 1 for Learning Flow Fields in Attention for Controllable Person Image Generation
Figure 2 for Learning Flow Fields in Attention for Controllable Person Image Generation
Figure 3 for Learning Flow Fields in Attention for Controllable Person Image Generation
Figure 4 for Learning Flow Fields in Attention for Controllable Person Image Generation
Viaarxiv icon

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Add code
Oct 26, 2024
Figure 1 for MarDini: Masked Autoregressive Diffusion for Video Generation at Scale
Figure 2 for MarDini: Masked Autoregressive Diffusion for Video Generation at Scale
Figure 3 for MarDini: Masked Autoregressive Diffusion for Video Generation at Scale
Figure 4 for MarDini: Masked Autoregressive Diffusion for Video Generation at Scale
Viaarxiv icon

Move Anything with Layered Scene Diffusion

Add code
Apr 10, 2024
Figure 1 for Move Anything with Layered Scene Diffusion
Figure 2 for Move Anything with Layered Scene Diffusion
Figure 3 for Move Anything with Layered Scene Diffusion
Figure 4 for Move Anything with Layered Scene Diffusion
Viaarxiv icon

Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks

Add code
Jan 05, 2024
Figure 1 for Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks
Figure 2 for Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks
Figure 3 for Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks
Figure 4 for Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks
Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Add code
Dec 07, 2023
Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Add code
Oct 09, 2023
Figure 1 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 2 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 3 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 4 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Viaarxiv icon

Mindstorms in Natural Language-Based Societies of Mind

Add code
May 26, 2023
Figure 1 for Mindstorms in Natural Language-Based Societies of Mind
Figure 2 for Mindstorms in Natural Language-Based Societies of Mind
Figure 3 for Mindstorms in Natural Language-Based Societies of Mind
Figure 4 for Mindstorms in Natural Language-Based Societies of Mind
Viaarxiv icon

Boundary-Denoising for Video Activity Localization

Add code
Apr 06, 2023
Figure 1 for Boundary-Denoising for Video Activity Localization
Figure 2 for Boundary-Denoising for Video Activity Localization
Figure 3 for Boundary-Denoising for Video Activity Localization
Figure 4 for Boundary-Denoising for Video Activity Localization
Viaarxiv icon

Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation

Add code
Nov 27, 2022
Figure 1 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 2 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 3 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 4 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Viaarxiv icon