Picture for Yeunju Choi

Yeunju Choi

VoxSim: A perceptual voice similarity dataset

Add code
Jul 26, 2024
Viaarxiv icon

Perceptually Guided End-to-End Text-to-Speech

Add code
Nov 02, 2020
Figure 1 for Perceptually Guided End-to-End Text-to-Speech
Figure 2 for Perceptually Guided End-to-End Text-to-Speech
Figure 3 for Perceptually Guided End-to-End Text-to-Speech
Viaarxiv icon

A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments

Add code
Oct 06, 2020
Figure 1 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 2 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 3 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 4 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Viaarxiv icon

Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling

Add code
Aug 09, 2020
Figure 1 for Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Figure 2 for Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Figure 3 for Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Figure 4 for Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling
Viaarxiv icon

Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification

Add code
Jul 16, 2020
Figure 1 for Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Figure 2 for Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Figure 3 for Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Figure 4 for Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification
Viaarxiv icon

Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification

Add code
Apr 14, 2020
Figure 1 for Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification
Figure 2 for Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification
Figure 3 for Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification
Figure 4 for Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification
Viaarxiv icon

Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification

Add code
Sep 26, 2019
Figure 1 for Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Figure 2 for Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Figure 3 for Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Figure 4 for Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification
Viaarxiv icon

Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification

Add code
Jun 19, 2019
Figure 1 for Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Figure 2 for Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Figure 3 for Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Figure 4 for Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Viaarxiv icon