Picture for Gakuto Kurata

Gakuto Kurata

Robust ASR Error Correction with Conservative Data Filtering

Add code
Jul 18, 2024
Viaarxiv icon

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Add code
Sep 07, 2023
Viaarxiv icon

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

Add code
Apr 01, 2022
Figure 1 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 2 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 3 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 4 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Viaarxiv icon

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Add code
Mar 29, 2022
Figure 1 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 2 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 3 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 4 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Viaarxiv icon

Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data

Add code
Dec 16, 2021
Figure 1 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Figure 2 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Figure 3 for Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Apr 08, 2021
Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Sep 30, 2020
Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Add code
Apr 30, 2019
Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

Add code
Apr 17, 2019
Figure 1 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 2 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 3 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Figure 4 for Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Viaarxiv icon

Language Modeling with Highway LSTM

Add code
Sep 19, 2017
Figure 1 for Language Modeling with Highway LSTM
Figure 2 for Language Modeling with Highway LSTM
Figure 3 for Language Modeling with Highway LSTM
Figure 4 for Language Modeling with Highway LSTM
Viaarxiv icon