Picture for Daniel Povey

Daniel Povey

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Add code
Nov 26, 2024
Viaarxiv icon

CR-CTC: Consistency regularization on CTC for improved speech recognition

Add code
Oct 07, 2024
Viaarxiv icon

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Add code
Sep 01, 2024
Figure 1 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 2 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 3 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 4 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Viaarxiv icon

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

Add code
Jul 14, 2024
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

Add code
Jun 17, 2024
Viaarxiv icon

SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

Add code
Jun 03, 2024
Viaarxiv icon

On Speaker Attribution with SURT

Add code
Jan 28, 2024
Figure 1 for On Speaker Attribution with SURT
Figure 2 for On Speaker Attribution with SURT
Figure 3 for On Speaker Attribution with SURT
Figure 4 for On Speaker Attribution with SURT
Viaarxiv icon

Zipformer: A faster and better encoder for automatic speech recognition

Add code
Oct 17, 2023
Figure 1 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 2 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 3 for Zipformer: A faster and better encoder for automatic speech recognition
Figure 4 for Zipformer: A faster and better encoder for automatic speech recognition
Viaarxiv icon

Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition

Add code
Sep 26, 2023
Viaarxiv icon

PromptASR for contextualized ASR with controllable style

Add code
Sep 20, 2023
Viaarxiv icon