Picture for Hao Che

Hao Che

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

Add code
Mar 14, 2023
Figure 1 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 2 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 3 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 4 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Viaarxiv icon

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

Add code
Dec 13, 2022
Figure 1 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 2 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 3 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 4 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Viaarxiv icon

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Add code
Nov 17, 2022
Viaarxiv icon

Multi-layered Semantic Representation Network for Multi-label Image Classification

Add code
Jun 22, 2021
Figure 1 for Multi-layered Semantic Representation Network for Multi-label Image Classification
Figure 2 for Multi-layered Semantic Representation Network for Multi-label Image Classification
Figure 3 for Multi-layered Semantic Representation Network for Multi-label Image Classification
Figure 4 for Multi-layered Semantic Representation Network for Multi-label Image Classification
Viaarxiv icon

a novel cross-lingual voice cloning approach with a few text-free samples

Add code
Oct 30, 2019
Figure 1 for a novel cross-lingual voice cloning approach with a few text-free samples
Figure 2 for a novel cross-lingual voice cloning approach with a few text-free samples
Figure 3 for a novel cross-lingual voice cloning approach with a few text-free samples
Viaarxiv icon