Picture for Yusuke Kida

Yusuke Kida

Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition

Add code
Sep 09, 2023
Viaarxiv icon

Neural Diarization with Non-autoregressive Intermediate Attractors

Add code
Mar 13, 2023
Viaarxiv icon

Conversation-oriented ASR with multi-look-ahead CBS architecture

Add code
Nov 02, 2022
Viaarxiv icon

Tourist Guidance Robot Based on HyperCLOVA

Add code
Oct 19, 2022
Figure 1 for Tourist Guidance Robot Based on HyperCLOVA
Figure 2 for Tourist Guidance Robot Based on HyperCLOVA
Figure 3 for Tourist Guidance Robot Based on HyperCLOVA
Figure 4 for Tourist Guidance Robot Based on HyperCLOVA
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Apr 01, 2022
Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon

Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers

Add code
Apr 21, 2021
Figure 1 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 2 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 3 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 4 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Viaarxiv icon

Speaker Selective Beamformer with Keyword Mask Estimation

Add code
Oct 25, 2018
Figure 1 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 2 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 3 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 4 for Speaker Selective Beamformer with Keyword Mask Estimation
Viaarxiv icon