Picture for Ying Qin

Ying Qin

Sparsely Shared LoRA on Whisper for Child Speech Recognition

Add code
Sep 21, 2023
Figure 1 for Sparsely Shared LoRA on Whisper for Child Speech Recognition
Figure 2 for Sparsely Shared LoRA on Whisper for Child Speech Recognition
Figure 3 for Sparsely Shared LoRA on Whisper for Child Speech Recognition
Figure 4 for Sparsely Shared LoRA on Whisper for Child Speech Recognition
Viaarxiv icon

KG-BERTScore: Incorporating Knowledge Graph into BERTScore for Reference-Free Machine Translation Evaluation

Add code
Jan 30, 2023
Viaarxiv icon

Explicitly Increasing Input Information Density for Vision Transformers on Small Datasets

Add code
Oct 25, 2022
Viaarxiv icon

iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

Add code
Jun 29, 2022
Figure 1 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 2 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 3 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 4 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Viaarxiv icon

Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints

Add code
Apr 28, 2022
Figure 1 for Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
Figure 2 for Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
Figure 3 for Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
Figure 4 for Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
Viaarxiv icon

A study on the efficacy of model pre-training in developing neural text-to-speech system

Add code
Oct 08, 2021
Figure 1 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 2 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 3 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 4 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Viaarxiv icon

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Add code
Oct 04, 2021
Figure 1 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 2 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 3 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 4 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Viaarxiv icon

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation

Add code
Aug 09, 2021
Figure 1 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 2 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 3 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 4 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Viaarxiv icon

Applying the Information Bottleneck Principle to Prosodic Representation Learning

Add code
Aug 05, 2021
Figure 1 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 2 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 3 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 4 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Viaarxiv icon