Picture for Po-chun Hsu

Po-chun Hsu

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Viaarxiv icon

Low-Resource Self-Supervised Learning with SSL-Enhanced TTS

Add code
Sep 29, 2023
Viaarxiv icon

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Add code
Jul 29, 2022
Figure 1 for Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Figure 2 for Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Figure 3 for Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Figure 4 for Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Viaarxiv icon

Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information

Add code
May 08, 2022
Figure 1 for Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Figure 2 for Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Figure 3 for Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Figure 4 for Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information
Viaarxiv icon

Parallel Synthesis for Autoregressive Speech Generation

Add code
Apr 25, 2022
Figure 1 for Parallel Synthesis for Autoregressive Speech Generation
Figure 2 for Parallel Synthesis for Autoregressive Speech Generation
Figure 3 for Parallel Synthesis for Autoregressive Speech Generation
Figure 4 for Parallel Synthesis for Autoregressive Speech Generation
Viaarxiv icon

Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis

Add code
Apr 01, 2022
Figure 1 for Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Figure 2 for Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Figure 3 for Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Figure 4 for Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Viaarxiv icon

Spotting adversarial samples for speaker verification by neural vocoders

Add code
Jul 02, 2021
Figure 1 for Spotting adversarial samples for speaker verification by neural vocoders
Figure 2 for Spotting adversarial samples for speaker verification by neural vocoders
Figure 3 for Spotting adversarial samples for speaker verification by neural vocoders
Figure 4 for Spotting adversarial samples for speaker verification by neural vocoders
Viaarxiv icon

Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech

Add code
Mar 20, 2021
Figure 1 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 2 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 3 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 4 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Viaarxiv icon

Towards Robust Neural Vocoding for Speech Generation: A Survey

Add code
Dec 05, 2019
Figure 1 for Towards Robust Neural Vocoding for Speech Generation: A Survey
Figure 2 for Towards Robust Neural Vocoding for Speech Generation: A Survey
Figure 3 for Towards Robust Neural Vocoding for Speech Generation: A Survey
Figure 4 for Towards Robust Neural Vocoding for Speech Generation: A Survey
Viaarxiv icon

Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders

Add code
Oct 25, 2019
Figure 1 for Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Figure 2 for Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Figure 3 for Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Figure 4 for Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Viaarxiv icon