Picture for Jaesong Lee

Jaesong Lee

Lightweight Audio Segmentation for Long-form Speech Translation

Add code
Jun 15, 2024
Viaarxiv icon

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

Add code
Mar 14, 2023
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Apr 01, 2022
Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Memory-Efficient Training of RNN-Transducer with Sampled Softmax

Add code
Mar 31, 2022
Figure 1 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 2 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 3 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Viaarxiv icon

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Add code
Oct 11, 2021
Figure 1 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 2 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 3 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 4 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Viaarxiv icon

Layer Pruning on Demand with Intermediate CTC

Add code
Jun 17, 2021
Figure 1 for Layer Pruning on Demand with Intermediate CTC
Figure 2 for Layer Pruning on Demand with Intermediate CTC
Figure 3 for Layer Pruning on Demand with Intermediate CTC
Figure 4 for Layer Pruning on Demand with Intermediate CTC
Viaarxiv icon

Intermediate Loss Regularization for CTC-based Speech Recognition

Add code
Feb 05, 2021
Figure 1 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 2 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 3 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 4 for Intermediate Loss Regularization for CTC-based Speech Recognition
Viaarxiv icon