Picture for Yongqi Wang

Yongqi Wang

Design of an Expression Recognition Solution Employing the Global Channel-Spatial Attention Mechanism

Add code
Mar 15, 2025
Viaarxiv icon

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Add code
Mar 14, 2025
Viaarxiv icon

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Add code
Mar 13, 2025
Viaarxiv icon

Interactive Multimodal Fusion with Temporal Modeling

Add code
Mar 13, 2025
Viaarxiv icon

MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence

Add code
Nov 04, 2024
Viaarxiv icon

Accompanied Singing Voice Synthesis with Fully Text-controlled Melody

Add code
Jul 02, 2024
Viaarxiv icon

Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion

Add code
Jun 04, 2024
Viaarxiv icon

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

Add code
Jun 01, 2024
Figure 1 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 2 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 3 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 4 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Viaarxiv icon

Robust Singing Voice Transcription Serves Synthesis

Add code
May 16, 2024
Viaarxiv icon

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Apr 16, 2024
Viaarxiv icon