Picture for Shubo Lv

Shubo Lv

MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement

Add code
Oct 06, 2023
Figure 1 for MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
Figure 2 for MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
Figure 3 for MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
Figure 4 for MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement
Viaarxiv icon

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

Add code
May 23, 2023
Viaarxiv icon

Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

Add code
Mar 14, 2023
Viaarxiv icon

spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement

Add code
Oct 17, 2022
Figure 1 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 2 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 3 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 4 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Viaarxiv icon

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

Add code
Nov 16, 2021
Figure 1 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 2 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 3 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 4 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Viaarxiv icon

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Add code
Nov 11, 2021
Figure 1 for Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Figure 2 for Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Figure 3 for Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Viaarxiv icon

DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement

Add code
Jun 16, 2021
Figure 1 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 2 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 3 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Figure 4 for DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement
Viaarxiv icon

F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement

Add code
Jun 16, 2021
Figure 1 for F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Figure 2 for F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Figure 3 for F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Figure 4 for F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement
Viaarxiv icon

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

Add code
Apr 08, 2021
Figure 1 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 2 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 3 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Figure 4 for AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Viaarxiv icon