Picture for Chien-yu Huang

Chien-yu Huang

Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition

Add code
Nov 27, 2024
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning

Add code
Aug 25, 2024
Figure 1 for SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning
Figure 2 for SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning
Figure 3 for SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning
Figure 4 for SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning
Viaarxiv icon

Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model

Add code
Oct 04, 2023
Viaarxiv icon

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

Add code
Sep 18, 2023
Figure 1 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 2 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 3 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 4 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Viaarxiv icon

Toward Degradation-Robust Voice Conversion

Add code
Oct 14, 2021
Figure 1 for Toward Degradation-Robust Voice Conversion
Figure 2 for Toward Degradation-Robust Voice Conversion
Figure 3 for Toward Degradation-Robust Voice Conversion
Figure 4 for Toward Degradation-Robust Voice Conversion
Viaarxiv icon

Improving Cross-Lingual Reading Comprehension with Self-Training

Add code
May 08, 2021
Figure 1 for Improving Cross-Lingual Reading Comprehension with Self-Training
Figure 2 for Improving Cross-Lingual Reading Comprehension with Self-Training
Figure 3 for Improving Cross-Lingual Reading Comprehension with Self-Training
Figure 4 for Improving Cross-Lingual Reading Comprehension with Self-Training
Viaarxiv icon

Utilizing Self-supervised Representations for MOS Prediction

Add code
Apr 21, 2021
Figure 1 for Utilizing Self-supervised Representations for MOS Prediction
Figure 2 for Utilizing Self-supervised Representations for MOS Prediction
Figure 3 for Utilizing Self-supervised Representations for MOS Prediction
Figure 4 for Utilizing Self-supervised Representations for MOS Prediction
Viaarxiv icon

Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech

Add code
Mar 20, 2021
Figure 1 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 2 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 3 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 4 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Viaarxiv icon

Defending Your Voice: Adversarial Attack on Voice Conversion

Add code
May 18, 2020
Figure 1 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 2 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 3 for Defending Your Voice: Adversarial Attack on Voice Conversion
Figure 4 for Defending Your Voice: Adversarial Attack on Voice Conversion
Viaarxiv icon