Picture for Fangjun Kuang

Fangjun Kuang

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Add code
Nov 26, 2024
Viaarxiv icon

CR-CTC: Consistency regularization on CTC for improved speech recognition

Add code
Oct 07, 2024
Viaarxiv icon

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Add code
Sep 01, 2024
Figure 1 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 2 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 3 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 4 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Viaarxiv icon

Zipformer: A faster and better encoder for automatic speech recognition

Add code
Oct 17, 2023
Figure 1 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 2 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 3 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 4 for Zipformer: A faster and better encoder for automatic speech recognition
Viaarxiv icon

PromptASR for contextualized ASR with controllable style

Add code
Sep 20, 2023
Viaarxiv icon

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Add code
Sep 15, 2023
Figure 1 for Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Figure 2 for Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Figure 3 for Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Figure 4 for Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Viaarxiv icon

Blank-regularized CTC for Frame Skipping in Neural Transducer

Add code
May 19, 2023
Viaarxiv icon

Delay-penalized CTC implemented based on Finite State Transducer

Add code
May 19, 2023
Figure 1 for Delay-penalized CTC implemented based on Finite State Transducer
Figure 2 for Delay-penalized CTC implemented based on Finite State Transducer
Figure 3 for Delay-penalized CTC implemented based on Finite State Transducer
Figure 4 for Delay-penalized CTC implemented based on Finite State Transducer
Viaarxiv icon

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Add code
Oct 31, 2022
Figure 1 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 2 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 3 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 4 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Viaarxiv icon

Fast and parallel decoding for transducer

Add code
Oct 31, 2022
Viaarxiv icon