Picture for Rong Gong

Rong Gong

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

Add code
Sep 09, 2024
Figure 1 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 2 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 3 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 4 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Viaarxiv icon

AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection

Add code
Jun 11, 2024
Figure 1 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 2 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Figure 3 for AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Viaarxiv icon

Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator

Add code
Mar 25, 2022
Figure 1 for Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator
Figure 2 for Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator
Figure 3 for Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator
Figure 4 for Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator
Viaarxiv icon

Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition

Add code
Sep 10, 2021
Figure 1 for Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition
Figure 2 for Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition
Figure 3 for Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition
Figure 4 for Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition
Viaarxiv icon

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

Add code
Jun 27, 2018
Figure 1 for A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
Figure 2 for A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
Figure 3 for A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
Figure 4 for A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification
Viaarxiv icon