Picture for Chengshi Zheng

Chengshi Zheng

Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals

Add code
Oct 08, 2024
Figure 1 for Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals
Figure 2 for Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals
Figure 3 for Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals
Figure 4 for Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals
Viaarxiv icon

BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

Add code
Dec 21, 2023
Viaarxiv icon

Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection

Add code
Aug 19, 2023
Viaarxiv icon

A General Deep Learning Speech Enhancement Framework Motivated by Taylor's Theorem

Add code
Nov 30, 2022
Viaarxiv icon

TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech Enhancement from Beam-Space Dictionary Perspective

Add code
Nov 30, 2022
Viaarxiv icon

Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features

Add code
Aug 02, 2022
Figure 1 for Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features
Figure 2 for Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features
Figure 3 for Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features
Figure 4 for Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features
Viaarxiv icon

TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network

Add code
Jul 04, 2022
Figure 1 for TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Figure 2 for TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Figure 3 for TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Viaarxiv icon

Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement

Add code
Apr 30, 2022
Figure 1 for Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Figure 2 for Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Figure 3 for Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Figure 4 for Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement
Viaarxiv icon

Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement

Add code
Mar 30, 2022
Figure 1 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 2 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 3 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 4 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Viaarxiv icon

TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory

Add code
Mar 16, 2022
Figure 1 for TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory
Figure 2 for TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory
Figure 3 for TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory
Viaarxiv icon