Picture for Zexu Pan

Zexu Pan

Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning

Add code
Jan 17, 2025
Viaarxiv icon

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

Add code
Jan 17, 2025
Viaarxiv icon

Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction

Add code
Jan 03, 2025
Figure 1 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 2 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 3 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Figure 4 for Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction
Viaarxiv icon

Speech Separation with Pretrained Frontend to Minimize Domain Mismatch

Add code
Nov 05, 2024
Viaarxiv icon

pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues

Add code
Nov 05, 2024
Figure 1 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 2 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 3 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 4 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Viaarxiv icon

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions

Add code
Sep 25, 2024
Viaarxiv icon

Enhanced Reverberation as Supervision for Unsupervised Speech Separation

Add code
Aug 06, 2024
Viaarxiv icon

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

Add code
Aug 06, 2024
Figure 1 for TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Figure 2 for TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Figure 3 for TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Figure 4 for TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Viaarxiv icon

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization

Add code
Feb 27, 2024
Viaarxiv icon

NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection

Add code
Dec 12, 2023
Viaarxiv icon