Picture for Shengqiang Li

Shengqiang Li

TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch

Add code
Dec 12, 2024
Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Add code
Dec 12, 2023
Viaarxiv icon

Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

Add code
Nov 02, 2022
Viaarxiv icon

Conformer-based End-to-end Speech Recognition With Rotary Position Embedding

Add code
Jul 13, 2021
Figure 1 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 2 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 3 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 4 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Viaarxiv icon

AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data

Add code
Jul 13, 2021
Figure 1 for AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
Figure 2 for AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
Figure 3 for AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
Figure 4 for AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
Viaarxiv icon

Efficient conformer-based speech recognition with linear attention

Add code
Apr 14, 2021
Figure 1 for Efficient conformer-based speech recognition with linear attention
Figure 2 for Efficient conformer-based speech recognition with linear attention
Figure 3 for Efficient conformer-based speech recognition with linear attention
Figure 4 for Efficient conformer-based speech recognition with linear attention
Viaarxiv icon

Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays

Add code
Apr 07, 2021
Figure 1 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 2 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 3 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 4 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Viaarxiv icon

Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

Add code
Oct 23, 2020
Figure 1 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 2 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 3 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 4 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Viaarxiv icon