Picture for Martin Radfar

Martin Radfar

Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers

Add code
May 09, 2023
Viaarxiv icon

End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders

Add code
May 04, 2023
Viaarxiv icon

Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition

Add code
Mar 01, 2023
Viaarxiv icon

Sub-8-bit quantization for on-device speech recognition: a regularization-free approach

Add code
Oct 17, 2022
Figure 1 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 2 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 3 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 4 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Viaarxiv icon

ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition

Add code
Sep 29, 2022
Figure 1 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 2 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 3 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 4 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Viaarxiv icon

Compute Cost Amortized Transformer for Streaming ASR

Add code
Jul 05, 2022
Figure 1 for Compute Cost Amortized Transformer for Streaming ASR
Figure 2 for Compute Cost Amortized Transformer for Streaming ASR
Figure 3 for Compute Cost Amortized Transformer for Streaming ASR
Figure 4 for Compute Cost Amortized Transformer for Streaming ASR
Viaarxiv icon

A neural prosody encoder for end-ro-end dialogue act classification

Add code
May 11, 2022
Figure 1 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 2 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 3 for A neural prosody encoder for end-ro-end dialogue act classification
Figure 4 for A neural prosody encoder for end-ro-end dialogue act classification
Viaarxiv icon

Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding

Add code
Apr 01, 2022
Figure 1 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 2 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 3 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 4 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Viaarxiv icon

Context-Aware Transformer Transducer for Speech Recognition

Add code
Nov 05, 2021
Figure 1 for Context-Aware Transformer Transducer for Speech Recognition
Figure 2 for Context-Aware Transformer Transducer for Speech Recognition
Figure 3 for Context-Aware Transformer Transducer for Speech Recognition
Figure 4 for Context-Aware Transformer Transducer for Speech Recognition
Viaarxiv icon

Speech Emotion Recognition Using Quaternion Convolutional Neural Networks

Add code
Oct 31, 2021
Figure 1 for Speech Emotion Recognition Using Quaternion Convolutional Neural Networks
Figure 2 for Speech Emotion Recognition Using Quaternion Convolutional Neural Networks
Figure 3 for Speech Emotion Recognition Using Quaternion Convolutional Neural Networks
Figure 4 for Speech Emotion Recognition Using Quaternion Convolutional Neural Networks
Viaarxiv icon