Picture for Shuangyu Chang

Shuangyu Chang

External Language Model Integration for Factorized Neural Transducers

Add code
May 26, 2023
Viaarxiv icon

Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context for Continuous Speech Recognition

Add code
Jan 10, 2023
Viaarxiv icon

Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

Add code
Oct 27, 2022
Viaarxiv icon

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

Add code
Oct 27, 2022
Viaarxiv icon

Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition

Add code
Oct 26, 2022
Viaarxiv icon

Streaming Punctuation for Long-form Dictation with Transformers

Add code
Oct 11, 2022
Figure 1 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 2 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 3 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 4 for Streaming Punctuation for Long-form Dictation with Transformers
Viaarxiv icon

Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages

Add code
Sep 08, 2022
Figure 1 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 2 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 3 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 4 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Viaarxiv icon

LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition

Add code
Oct 21, 2020
Figure 1 for LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
Figure 2 for LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
Figure 3 for LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
Figure 4 for LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
Viaarxiv icon

Long-span language modeling for speech recognition

Add code
Nov 11, 2019
Figure 1 for Long-span language modeling for speech recognition
Figure 2 for Long-span language modeling for speech recognition
Figure 3 for Long-span language modeling for speech recognition
Figure 4 for Long-span language modeling for speech recognition
Viaarxiv icon