Picture for Michael L. Seltzer

Michael L. Seltzer

Sid

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

End-to-End Speech Recognition Contextualization with Large Language Models

Add code
Sep 19, 2023
Figure 1 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 2 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 3 for End-to-End Speech Recognition Contextualization with Large Language Models
Figure 4 for End-to-End Speech Recognition Contextualization with Large Language Models
Viaarxiv icon

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Add code
Jul 22, 2023
Viaarxiv icon

Improving Fast-slow Encoder based Transducer with Streaming Deliberation

Add code
Dec 15, 2022
Viaarxiv icon

Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities

Add code
Nov 10, 2022
Viaarxiv icon

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers

Add code
Nov 02, 2022
Viaarxiv icon

Deliberation Model for On-Device Spoken Language Understanding

Add code
Apr 04, 2022
Figure 1 for Deliberation Model for On-Device Spoken Language Understanding
Figure 2 for Deliberation Model for On-Device Spoken Language Understanding
Figure 3 for Deliberation Model for On-Device Spoken Language Understanding
Figure 4 for Deliberation Model for On-Device Spoken Language Understanding
Viaarxiv icon

Neural-FST Class Language Model for End-to-End Speech Recognition

Add code
Jan 31, 2022
Figure 1 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 2 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 3 for Neural-FST Class Language Model for End-to-End Speech Recognition
Viaarxiv icon

Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric

Add code
Oct 11, 2021
Figure 1 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 2 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 3 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 4 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Viaarxiv icon

Collaborative Training of Acoustic Encoders for Speech Recognition

Add code
Jul 13, 2021
Figure 1 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 2 for Collaborative Training of Acoustic Encoders for Speech Recognition
Figure 3 for Collaborative Training of Acoustic Encoders for Speech Recognition
Viaarxiv icon