Picture for Yongqi Wang

Yongqi Wang

Design of an Expression Recognition Solution Employing the Global Channel-Spatial Attention Mechanism

Add code
Mar 15, 2025
Viaarxiv icon

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Add code
Mar 14, 2025
Viaarxiv icon

Interactive Multimodal Fusion with Temporal Modeling

Add code
Mar 13, 2025
Viaarxiv icon

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Add code
Mar 13, 2025
Viaarxiv icon

MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence

Add code
Nov 04, 2024
Viaarxiv icon

Accompanied Singing Voice Synthesis with Fully Text-controlled Melody

Add code
Jul 02, 2024
Viaarxiv icon

Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion

Add code
Jun 04, 2024
Viaarxiv icon

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

Add code
Jun 01, 2024
Figure 1 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 2 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 3 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 4 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Viaarxiv icon

Robust Singing Voice Transcription Serves Synthesis

Add code
May 16, 2024
Viaarxiv icon

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Apr 16, 2024
Viaarxiv icon