Picture for Brian King

Brian King

Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers

Add code
May 09, 2023
Viaarxiv icon

Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition

Add code
Mar 01, 2023
Viaarxiv icon

Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

Add code
Jul 22, 2022
Figure 1 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 2 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 3 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 4 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Viaarxiv icon

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation

Add code
Jul 16, 2022
Figure 1 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Figure 2 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Viaarxiv icon

Compute Cost Amortized Transformer for Streaming ASR

Add code
Jul 05, 2022
Figure 1 for Compute Cost Amortized Transformer for Streaming ASR
Figure 2 for Compute Cost Amortized Transformer for Streaming ASR
Figure 3 for Compute Cost Amortized Transformer for Streaming ASR
Figure 4 for Compute Cost Amortized Transformer for Streaming ASR
Viaarxiv icon

Investigation of Training Label Error Impact on RNN-T

Add code
Dec 01, 2021
Figure 1 for Investigation of Training Label Error Impact on RNN-T
Figure 2 for Investigation of Training Label Error Impact on RNN-T
Figure 3 for Investigation of Training Label Error Impact on RNN-T
Figure 4 for Investigation of Training Label Error Impact on RNN-T
Viaarxiv icon

Warped Dynamic Linear Models for Time Series of Counts

Add code
Oct 27, 2021
Figure 1 for Warped Dynamic Linear Models for Time Series of Counts
Figure 2 for Warped Dynamic Linear Models for Time Series of Counts
Figure 3 for Warped Dynamic Linear Models for Time Series of Counts
Figure 4 for Warped Dynamic Linear Models for Time Series of Counts
Viaarxiv icon

Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio

Add code
Jun 28, 2021
Figure 1 for Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio
Figure 2 for Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio
Figure 3 for Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio
Figure 4 for Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio
Viaarxiv icon

CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition

Add code
Jun 14, 2021
Figure 1 for CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition
Figure 2 for CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition
Figure 3 for CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition
Figure 4 for CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition
Viaarxiv icon

Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition

Add code
May 14, 2021
Figure 1 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 2 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 3 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 4 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Viaarxiv icon