Picture for Long Mai

Long Mai

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Add code
Feb 06, 2025
Figure 1 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 2 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 3 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 4 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Viaarxiv icon

Pushing the Boundaries of State Space Models for Image and Video Generation

Add code
Feb 03, 2025
Viaarxiv icon

Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces

Add code
Jan 09, 2025
Viaarxiv icon

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Add code
Jan 08, 2025
Viaarxiv icon

Real-Time Textless Dialogue Generation

Add code
Jan 08, 2025
Viaarxiv icon

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Add code
Dec 24, 2024
Viaarxiv icon

Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning

Add code
Dec 04, 2024
Figure 1 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 2 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 3 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 4 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Viaarxiv icon

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

Add code
May 31, 2024
Figure 1 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 2 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 3 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 4 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Viaarxiv icon

SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels

Add code
Sep 21, 2023
Viaarxiv icon

MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation

Add code
Sep 02, 2023
Viaarxiv icon