Picture for Zhong-Qiu Wang

Zhong-Qiu Wang

ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement

Add code
Jul 28, 2024
Viaarxiv icon

Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction

Add code
Jul 23, 2024
Viaarxiv icon

Cross-Talk Reduction

Add code
May 30, 2024
Viaarxiv icon

SuperME: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Robust ASR

Add code
Mar 15, 2024
Viaarxiv icon

Mixture to Mixture: Leveraging Close-talk Mixtures as Weak-supervision for Speech Separation

Add code
Feb 14, 2024
Viaarxiv icon

USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering

Add code
Feb 01, 2024
Viaarxiv icon

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Add code
Jan 23, 2024
Viaarxiv icon

A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction

Add code
Oct 12, 2023
Viaarxiv icon

Toward Universal Speech Enhancement for Diverse Input Conditions

Add code
Sep 29, 2023
Viaarxiv icon

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

Add code
Sep 15, 2023
Figure 1 for The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
Figure 2 for The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
Figure 3 for The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
Figure 4 for The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
Viaarxiv icon