Picture for Qianqian Dong

Qianqian Dong

Findings of the IWSLT 2024 Evaluation Campaign

Add code
Nov 07, 2024
Viaarxiv icon

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

Add code
Jul 05, 2024
Viaarxiv icon

Revisiting Interpolation Augmentation for Speech-to-Text Generation

Add code
Jun 22, 2024
Figure 1 for Revisiting Interpolation Augmentation for Speech-to-Text Generation
Figure 2 for Revisiting Interpolation Augmentation for Speech-to-Text Generation
Figure 3 for Revisiting Interpolation Augmentation for Speech-to-Text Generation
Figure 4 for Revisiting Interpolation Augmentation for Speech-to-Text Generation
Viaarxiv icon

Speech Translation with Large Language Models: An Industrial Practice

Add code
Dec 21, 2023
Viaarxiv icon

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

Add code
Sep 21, 2023
Viaarxiv icon

Recent Advances in Direct Speech-to-text Translation

Add code
Jun 20, 2023
Viaarxiv icon

MOSPC: MOS Prediction Based on Pairwise Comparison

Add code
Jun 18, 2023
Viaarxiv icon

PolyVoice: Language Models for Speech to Speech Translation

Add code
Jun 13, 2023
Figure 1 for PolyVoice: Language Models for Speech to Speech Translation
Figure 2 for PolyVoice: Language Models for Speech to Speech Translation
Figure 3 for PolyVoice: Language Models for Speech to Speech Translation
Figure 4 for PolyVoice: Language Models for Speech to Speech Translation
Viaarxiv icon

CTC-based Non-autoregressive Speech Translation

Add code
May 27, 2023
Figure 1 for CTC-based Non-autoregressive Speech Translation
Figure 2 for CTC-based Non-autoregressive Speech Translation
Figure 3 for CTC-based Non-autoregressive Speech Translation
Figure 4 for CTC-based Non-autoregressive Speech Translation
Viaarxiv icon

M3ST: Mix at Three Levels for Speech Translation

Add code
Dec 07, 2022
Viaarxiv icon