Picture for Yu-Huai Peng

Yu-Huai Peng

Generation of Speaker Representations Using Heterogeneous Training Batch Assembly

Add code
Mar 30, 2022
Figure 1 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 2 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 3 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 4 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Viaarxiv icon

Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion

Add code
Sep 08, 2021
Figure 1 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 2 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 3 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 4 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Viaarxiv icon

SVSNet: An End-to-end Speaker Voice Similarity Assessment Model

Add code
Jul 20, 2021
Figure 1 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 2 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 3 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Figure 4 for SVSNet: An End-to-end Speaker Voice Similarity Assessment Model
Viaarxiv icon

Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation

Add code
Jun 14, 2021
Figure 1 for Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation
Figure 2 for Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation
Figure 3 for Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation
Figure 4 for Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation
Viaarxiv icon

Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder

Add code
Jun 10, 2021
Figure 1 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Figure 2 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Figure 3 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Viaarxiv icon

A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion

Add code
Jun 02, 2021
Figure 1 for A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Figure 2 for A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Figure 3 for A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Figure 4 for A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Viaarxiv icon

The AS-NU System for the M2VoC Challenge

Add code
Apr 07, 2021
Figure 1 for The AS-NU System for the M2VoC Challenge
Figure 2 for The AS-NU System for the M2VoC Challenge
Figure 3 for The AS-NU System for the M2VoC Challenge
Figure 4 for The AS-NU System for the M2VoC Challenge
Viaarxiv icon

Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion

Add code
Feb 07, 2020
Figure 1 for Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Figure 2 for Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Figure 3 for Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Figure 4 for Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Viaarxiv icon

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders

Add code
Aug 29, 2018
Figure 1 for Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Figure 2 for Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Figure 3 for Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Figure 4 for Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Viaarxiv icon