Picture for Xihua Wang

Xihua Wang

Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization

Add code
Dec 26, 2024
Viaarxiv icon

Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer

Add code
Dec 21, 2024
Viaarxiv icon

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

Add code
Jan 31, 2024
Viaarxiv icon