Picture for Yusuke Fujita

Yusuke Fujita

Song Data Cleansing for End-to-End Neural Singer Diarization Using Neural Analysis and Synthesis Framework

Add code
Jun 24, 2024
Viaarxiv icon

Audio Fingerprinting with Holographic Reduced Representations

Add code
Jun 19, 2024
Viaarxiv icon

Universal Score-based Speech Enhancement with High Content Preservation

Add code
Jun 18, 2024
Viaarxiv icon

Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System

Add code
May 17, 2024
Viaarxiv icon

Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers

Add code
Jan 22, 2024
Viaarxiv icon

Audio Difference Learning for Audio Captioning

Add code
Sep 15, 2023
Figure 1 for Audio Difference Learning for Audio Captioning
Figure 2 for Audio Difference Learning for Audio Captioning
Figure 3 for Audio Difference Learning for Audio Captioning
Viaarxiv icon

Neural Diarization with Non-autoregressive Intermediate Attractors

Add code
Mar 13, 2023
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Apr 01, 2022
Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon