Picture for Sungsoo Kim

Sungsoo Kim

Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition

Add code
Nov 26, 2024
Figure 1 for Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition
Figure 2 for Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition
Viaarxiv icon

TiVaT: Joint-Axis Attention for Time Series Forecasting with Lead-Lag Dynamics

Add code
Oct 02, 2024
Viaarxiv icon

Two-Pass End-to-End ASR Model Compression

Add code
Jan 08, 2022
Figure 1 for Two-Pass End-to-End ASR Model Compression
Figure 2 for Two-Pass End-to-End ASR Model Compression
Figure 3 for Two-Pass End-to-End ASR Model Compression
Figure 4 for Two-Pass End-to-End ASR Model Compression
Viaarxiv icon

A review of on-device fully neural end-to-end automatic speech recognition algorithms

Add code
Dec 19, 2020
Figure 1 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 2 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 3 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Figure 4 for A review of on-device fully neural end-to-end automatic speech recognition algorithms
Viaarxiv icon

Sequential Routing Framework: Fully Capsule Network-based Speech Recognition

Add code
Jul 23, 2020
Figure 1 for Sequential Routing Framework: Fully Capsule Network-based Speech Recognition
Figure 2 for Sequential Routing Framework: Fully Capsule Network-based Speech Recognition
Figure 3 for Sequential Routing Framework: Fully Capsule Network-based Speech Recognition
Figure 4 for Sequential Routing Framework: Fully Capsule Network-based Speech Recognition
Viaarxiv icon

Attention based on-device streaming speech recognition with large speech corpus

Add code
Jan 02, 2020
Figure 1 for Attention based on-device streaming speech recognition with large speech corpus
Figure 2 for Attention based on-device streaming speech recognition with large speech corpus
Figure 3 for Attention based on-device streaming speech recognition with large speech corpus
Figure 4 for Attention based on-device streaming speech recognition with large speech corpus
Viaarxiv icon

end-to-end training of a large vocabulary end-to-end speech recognition system

Add code
Dec 22, 2019
Figure 1 for end-to-end training of a large vocabulary end-to-end speech recognition system
Figure 2 for end-to-end training of a large vocabulary end-to-end speech recognition system
Figure 3 for end-to-end training of a large vocabulary end-to-end speech recognition system
Figure 4 for end-to-end training of a large vocabulary end-to-end speech recognition system
Viaarxiv icon

Adversarial Video Compression Guided by Soft Edge Detection

Add code
Nov 26, 2018
Figure 1 for Adversarial Video Compression Guided by Soft Edge Detection
Figure 2 for Adversarial Video Compression Guided by Soft Edge Detection
Figure 3 for Adversarial Video Compression Guided by Soft Edge Detection
Figure 4 for Adversarial Video Compression Guided by Soft Edge Detection
Viaarxiv icon