Picture for Yuya Fujita

Yuya Fujita

LV-CTC: Non-autoregressive ASR with CTC and latent variable models

Add code
Mar 28, 2024
Viaarxiv icon

HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model

Add code
Oct 06, 2023
Viaarxiv icon

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing

Add code
Sep 27, 2023
Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Sep 27, 2023
Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning

Add code
May 29, 2023
Viaarxiv icon

Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation

Add code
Nov 11, 2022
Viaarxiv icon

End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation

Add code
Apr 01, 2022
Figure 1 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 2 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 3 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Figure 4 for End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Viaarxiv icon

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Add code
Oct 11, 2021
Figure 1 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 2 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 3 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 4 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Viaarxiv icon

Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models

Add code
Jul 20, 2021
Figure 1 for Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Figure 2 for Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Figure 3 for Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Figure 4 for Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Viaarxiv icon

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021

Add code
Jul 13, 2021
Figure 1 for Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Figure 2 for Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Figure 3 for Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Figure 4 for Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Viaarxiv icon