Picture for Long Mai

Long Mai

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

Add code
Mar 11, 2025
Viaarxiv icon

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Add code
Feb 06, 2025
Figure 1 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 2 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 3 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 4 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Viaarxiv icon

Pushing the Boundaries of State Space Models for Image and Video Generation

Add code
Feb 03, 2025
Figure 1 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 2 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 3 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 4 for Pushing the Boundaries of State Space Models for Image and Video Generation
Viaarxiv icon

Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces

Add code
Jan 09, 2025
Viaarxiv icon

Real-Time Textless Dialogue Generation

Add code
Jan 08, 2025
Viaarxiv icon

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Add code
Jan 08, 2025
Viaarxiv icon

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Add code
Dec 24, 2024
Viaarxiv icon

Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning

Add code
Dec 04, 2024
Figure 1 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 2 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 3 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 4 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Viaarxiv icon

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

Add code
May 31, 2024
Figure 1 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 2 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 3 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 4 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Viaarxiv icon

SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels

Add code
Sep 21, 2023
Viaarxiv icon