Picture for Yingda Chen

Yingda Chen

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

EliGen: Entity-Level Controlled Image Generation with Regional Attention

Add code
Jan 02, 2025
Figure 1 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 2 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 3 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 4 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Viaarxiv icon

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

Add code
Dec 18, 2024
Figure 1 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 2 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 3 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 4 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Viaarxiv icon

Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key

Add code
Oct 15, 2024
Viaarxiv icon

SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning

Add code
Aug 13, 2024
Figure 1 for SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
Figure 2 for SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
Figure 3 for SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
Figure 4 for SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
Viaarxiv icon

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Add code
Sep 02, 2023
Viaarxiv icon

FaceChain: A Playground for Identity-Preserving Portrait Generation

Add code
Aug 28, 2023
Figure 1 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 2 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 3 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Figure 4 for FaceChain: A Playground for Identity-Preserving Portrait Generation
Viaarxiv icon