Picture for Shuichiro Shimizu

Shuichiro Shimizu

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems

Add code
Mar 11, 2025
Viaarxiv icon

When Large Language Models Meet Speech: A Survey on Integration Approaches

Add code
Feb 26, 2025
Viaarxiv icon

MELD-ST: An Emotion-aware Speech Translation Dataset

Add code
May 21, 2024
Viaarxiv icon

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

Add code
Jan 18, 2024
Viaarxiv icon

Video-Helpful Multimodal Machine Translation

Add code
Oct 31, 2023
Viaarxiv icon

Towards Speech Dialogue Translation Mediating Speakers of Different Languages

Add code
May 22, 2023
Figure 1 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 2 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 3 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Figure 4 for Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Viaarxiv icon

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

Add code
Jan 21, 2022
Figure 1 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 2 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 3 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Figure 4 for VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Viaarxiv icon