Picture for Yongqi Wang

Yongqi Wang

MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence

Add code
Nov 04, 2024
Viaarxiv icon

Accompanied Singing Voice Synthesis with Fully Text-controlled Melody

Add code
Jul 02, 2024
Viaarxiv icon

Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion

Add code
Jun 04, 2024
Viaarxiv icon

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

Add code
Jun 01, 2024
Viaarxiv icon

Robust Singing Voice Transcription Serves Synthesis

Add code
May 16, 2024
Viaarxiv icon

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Apr 16, 2024
Viaarxiv icon

Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation

Add code
Mar 20, 2024
Viaarxiv icon

AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts

Add code
Mar 20, 2024
Viaarxiv icon

Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling

Add code
Mar 19, 2024
Viaarxiv icon

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

Add code
Mar 18, 2024
Viaarxiv icon