Picture for Chunxi Liu

Chunxi Liu

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Viaarxiv icon

Multi-Head State Space Model for Speech Recognition

Add code
May 25, 2023
Figure 1 for Multi-Head State Space Model for Speech Recognition
Figure 2 for Multi-Head State Space Model for Speech Recognition
Figure 3 for Multi-Head State Space Model for Speech Recognition
Figure 4 for Multi-Head State Space Model for Speech Recognition
Viaarxiv icon

Learning ASR pathways: A sparse multilingual ASR model

Add code
Sep 13, 2022
Figure 1 for Learning ASR pathways: A sparse multilingual ASR model
Figure 2 for Learning ASR pathways: A sparse multilingual ASR model
Figure 3 for Learning ASR pathways: A sparse multilingual ASR model
Figure 4 for Learning ASR pathways: A sparse multilingual ASR model
Viaarxiv icon

Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Add code
Jul 25, 2022
Figure 1 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 2 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 3 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 4 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Add code
Nov 18, 2021
Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks

Add code
Nov 10, 2021
Figure 1 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 2 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 3 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 4 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Viaarxiv icon

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

Add code
Oct 07, 2021
Figure 1 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 2 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 3 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 4 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Viaarxiv icon

Improving RNN Transducer Based ASR with Auxiliary Tasks

Add code
Nov 09, 2020
Figure 1 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 2 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 3 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 4 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Viaarxiv icon

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces

Add code
May 19, 2020
Figure 1 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 2 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 3 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Figure 4 for Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Viaarxiv icon

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model

Add code
May 15, 2020
Figure 1 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 2 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 3 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Figure 4 for Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Viaarxiv icon