Picture for Yiyang Ma

Yiyang Ma

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Add code
Dec 13, 2024
Viaarxiv icon

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Add code
Nov 12, 2024
Viaarxiv icon

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Add code
Oct 17, 2024
Viaarxiv icon

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Add code
Sep 16, 2024
Viaarxiv icon

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder

Add code
Apr 07, 2024
Viaarxiv icon

Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery

Add code
Jan 25, 2024
Viaarxiv icon

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Add code
May 24, 2023
Viaarxiv icon

Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation

Add code
Mar 16, 2023
Viaarxiv icon

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Add code
Dec 19, 2022
Viaarxiv icon

AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation

Add code
Sep 08, 2022
Figure 1 for AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Figure 2 for AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Figure 3 for AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Figure 4 for AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Viaarxiv icon