Picture for Wenze Ren

Wenze Ren

MC-SEMamba: A Simple Multi-channel Extension of SEMamba

Add code
Sep 26, 2024
Figure 1 for MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Figure 2 for MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Figure 3 for MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Figure 4 for MC-SEMamba: A Simple Multi-channel Extension of SEMamba
Viaarxiv icon

Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing

Add code
Sep 22, 2024
Viaarxiv icon

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement

Add code
Sep 16, 2024
Figure 1 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 2 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 3 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Figure 4 for Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
Viaarxiv icon

DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset

Add code
Sep 13, 2024
Figure 1 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 2 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 3 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 4 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Viaarxiv icon

EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 30, 2024
Viaarxiv icon

EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 24, 2024
Viaarxiv icon

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

Add code
Sep 20, 2023
Viaarxiv icon