Picture for Gaofeng Cheng

Gaofeng Cheng

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Add code
Aug 12, 2023
Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

Add code
Jul 05, 2023
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Add code
Feb 26, 2023
Viaarxiv icon

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Add code
Oct 13, 2022
Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Add code
Aug 17, 2022
Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Add code
Jul 06, 2022
Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Add code
Jun 28, 2022
Figure 1 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 2 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Figure 3 for Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
Viaarxiv icon

Boosting Cross-Domain Speech Recognition with Self-Supervision

Add code
Jun 20, 2022
Figure 1 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 2 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 3 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 4 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Viaarxiv icon

Decoupled Federated Learning for ASR with Non-IID Data

Add code
Jun 18, 2022
Figure 1 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 2 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 3 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 4 for Decoupled Federated Learning for ASR with Non-IID Data
Viaarxiv icon

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational Speech Dataset

Add code
Mar 31, 2022
Figure 1 for Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational Speech Dataset
Figure 2 for Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational Speech Dataset
Figure 3 for Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational Speech Dataset
Figure 4 for Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational Speech Dataset
Viaarxiv icon