Picture for Ming Tu

Ming Tu

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

Add code
Jul 05, 2024
Figure 1 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 2 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 3 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 4 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Viaarxiv icon

VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing

Add code
Apr 11, 2024
Viaarxiv icon

Efficient Neural Music Generation

Add code
May 25, 2023
Figure 1 for Efficient Neural Music Generation
Figure 2 for Efficient Neural Music Generation
Figure 3 for Efficient Neural Music Generation
Figure 4 for Efficient Neural Music Generation
Viaarxiv icon

Language-universal phonetic encoder for low-resource speech recognition

Add code
May 19, 2023
Figure 1 for Language-universal phonetic encoder for low-resource speech recognition
Figure 2 for Language-universal phonetic encoder for low-resource speech recognition
Figure 3 for Language-universal phonetic encoder for low-resource speech recognition
Figure 4 for Language-universal phonetic encoder for low-resource speech recognition
Viaarxiv icon

Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition

Add code
May 19, 2023
Figure 1 for Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition
Figure 2 for Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition
Figure 3 for Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition
Figure 4 for Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition
Viaarxiv icon

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition

Add code
Dec 30, 2022
Figure 1 for Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Figure 2 for Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Figure 3 for Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Figure 4 for Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Add code
Oct 27, 2022
Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon

Cloning one's voice using very limited data in the wild

Add code
Oct 08, 2021
Figure 1 for Cloning one's voice using very limited data in the wild
Figure 2 for Cloning one's voice using very limited data in the wild
Figure 3 for Cloning one's voice using very limited data in the wild
Figure 4 for Cloning one's voice using very limited data in the wild
Viaarxiv icon

Graph Sequential Network for Reasoning over Sequences

Add code
Apr 04, 2020
Figure 1 for Graph Sequential Network for Reasoning over Sequences
Figure 2 for Graph Sequential Network for Reasoning over Sequences
Figure 3 for Graph Sequential Network for Reasoning over Sequences
Figure 4 for Graph Sequential Network for Reasoning over Sequences
Viaarxiv icon

Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents

Add code
Nov 22, 2019
Figure 1 for Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents
Figure 2 for Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents
Figure 3 for Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents
Figure 4 for Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents
Viaarxiv icon