Picture for Keqi Deng

Keqi Deng

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Add code
Oct 09, 2024
Viaarxiv icon

CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought

Add code
Sep 29, 2024
Figure 1 for CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought
Figure 2 for CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought
Figure 3 for CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought
Figure 4 for CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought
Viaarxiv icon

Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation

Add code
Jun 06, 2024
Viaarxiv icon

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

Add code
Jun 01, 2024
Viaarxiv icon

FastInject: Injecting Unpaired Text Data into CTC-based ASR training

Add code
Dec 14, 2023
Viaarxiv icon

Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition

Add code
Nov 19, 2023
Viaarxiv icon

Decoupled Structure for Improved Adaptability of End-to-End Models

Add code
Aug 25, 2023
Viaarxiv icon

Label-Synchronous Neural Transducer for End-to-End ASR

Add code
Jul 06, 2023
Viaarxiv icon

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax

Add code
Feb 16, 2023
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Add code
Jul 06, 2022
Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon