Picture for Mu Yang

Mu Yang

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon

Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation

Add code
Nov 07, 2024
Viaarxiv icon

DiariST: Streaming Speech Translation with Speaker Diarization

Add code
Sep 14, 2023
Viaarxiv icon

What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model

Add code
Jun 10, 2023
Viaarxiv icon

Learning ASR pathways: A sparse multilingual ASR model

Add code
Sep 13, 2022
Figure 1 for Learning ASR pathways: A sparse multilingual ASR model
Figure 2 for Learning ASR pathways: A sparse multilingual ASR model
Figure 3 for Learning ASR pathways: A sparse multilingual ASR model
Figure 4 for Learning ASR pathways: A sparse multilingual ASR model
Viaarxiv icon

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment

Add code
Apr 07, 2022
Figure 1 for Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Figure 2 for Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Figure 3 for Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Figure 4 for Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Viaarxiv icon

InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Add code
Dec 31, 2021
Figure 1 for InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Figure 2 for InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Figure 3 for InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Figure 4 for InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Viaarxiv icon

Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis

Add code
Oct 09, 2021
Figure 1 for Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis
Figure 2 for Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis
Figure 3 for Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis
Figure 4 for Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis
Viaarxiv icon

EventPlus: A Temporal Event Understanding Pipeline

Add code
Jan 13, 2021
Figure 1 for EventPlus: A Temporal Event Understanding Pipeline
Figure 2 for EventPlus: A Temporal Event Understanding Pipeline
Figure 3 for EventPlus: A Temporal Event Understanding Pipeline
Figure 4 for EventPlus: A Temporal Event Understanding Pipeline
Viaarxiv icon

Biomedical Event Extraction with Hierarchical Knowledge Graphs

Add code
Oct 12, 2020
Figure 1 for Biomedical Event Extraction with Hierarchical Knowledge Graphs
Figure 2 for Biomedical Event Extraction with Hierarchical Knowledge Graphs
Figure 3 for Biomedical Event Extraction with Hierarchical Knowledge Graphs
Figure 4 for Biomedical Event Extraction with Hierarchical Knowledge Graphs
Viaarxiv icon