Picture for Mingze Li

Mingze Li

A MEMS-based terahertz broadband beam steering technique

Add code
Sep 06, 2024
Viaarxiv icon

MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis

Add code
Jul 19, 2024
Figure 1 for MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis
Figure 2 for MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis
Figure 3 for MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis
Figure 4 for MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis
Viaarxiv icon

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Add code
Apr 25, 2023
Viaarxiv icon

Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer

Add code
Mar 21, 2023
Viaarxiv icon

Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models

Add code
Jan 30, 2023
Viaarxiv icon

TSRFormer: Table Structure Recognition with Transformers

Add code
Aug 09, 2022
Figure 1 for TSRFormer: Table Structure Recognition with Transformers
Figure 2 for TSRFormer: Table Structure Recognition with Transformers
Figure 3 for TSRFormer: Table Structure Recognition with Transformers
Figure 4 for TSRFormer: Table Structure Recognition with Transformers
Viaarxiv icon