Picture for Syu-Siang Wang

Syu-Siang Wang

Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics

Add code
Jul 02, 2024
Viaarxiv icon

IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays

Add code
Jul 09, 2023
Viaarxiv icon

Continuous Speech for Improved Learning Pathological Voice Disorders

Add code
Feb 22, 2022
Figure 1 for Continuous Speech for Improved Learning Pathological Voice Disorders
Figure 2 for Continuous Speech for Improved Learning Pathological Voice Disorders
Figure 3 for Continuous Speech for Improved Learning Pathological Voice Disorders
Figure 4 for Continuous Speech for Improved Learning Pathological Voice Disorders
Viaarxiv icon

Speech Enhancement Based on Cyclegan with Noise-informed Training

Add code
Oct 19, 2021
Figure 1 for Speech Enhancement Based on Cyclegan with Noise-informed Training
Figure 2 for Speech Enhancement Based on Cyclegan with Noise-informed Training
Figure 3 for Speech Enhancement Based on Cyclegan with Noise-informed Training
Figure 4 for Speech Enhancement Based on Cyclegan with Noise-informed Training
Viaarxiv icon

Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments

Add code
Oct 19, 2021
Figure 1 for Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments
Figure 2 for Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments
Figure 3 for Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments
Figure 4 for Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments
Viaarxiv icon

Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario

Add code
Jan 07, 2021
Figure 1 for Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
Figure 2 for Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
Figure 3 for Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
Figure 4 for Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario
Viaarxiv icon

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application

Add code
Aug 21, 2020
Figure 1 for CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
Figure 2 for CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
Figure 3 for CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
Figure 4 for CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
Viaarxiv icon

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing

Add code
Jun 18, 2020
Figure 1 for Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Figure 2 for Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Figure 3 for Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Figure 4 for Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Viaarxiv icon

Distributed Microphone Speech Enhancement based on Deep Learning

Add code
Nov 22, 2019
Figure 1 for Distributed Microphone Speech Enhancement based on Deep Learning
Figure 2 for Distributed Microphone Speech Enhancement based on Deep Learning
Figure 3 for Distributed Microphone Speech Enhancement based on Deep Learning
Figure 4 for Distributed Microphone Speech Enhancement based on Deep Learning
Viaarxiv icon

Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks

Add code
Jan 24, 2018
Figure 1 for Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Figure 2 for Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Figure 3 for Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Figure 4 for Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Viaarxiv icon