Picture for Yicheng Hsu

Yicheng Hsu

A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes

Add code
May 14, 2024
Viaarxiv icon

Spatial-Temporal Activity-Informed Diarization and Separation

Add code
Jan 30, 2024
Viaarxiv icon

Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation

Add code
Nov 21, 2023
Viaarxiv icon

Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function

Add code
Oct 22, 2023
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Add code
Apr 18, 2023
Viaarxiv icon

Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence

Add code
Mar 13, 2023
Viaarxiv icon

Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence

Add code
Nov 16, 2022
Viaarxiv icon

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Add code
Jul 17, 2022
Figure 1 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 2 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 3 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 4 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Viaarxiv icon

Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection

Add code
Jun 20, 2022
Figure 1 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 2 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 3 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 4 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Viaarxiv icon

Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter

Add code
May 07, 2022
Figure 1 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 2 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 3 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 4 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Viaarxiv icon