Picture for Jiangyu Han

Jiangyu Han

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Viaarxiv icon

Leveraging Self-Supervised Learning for Speaker Diarization

Add code
Sep 14, 2024
Figure 1 for Leveraging Self-Supervised Learning for Speaker Diarization
Figure 2 for Leveraging Self-Supervised Learning for Speaker Diarization
Figure 3 for Leveraging Self-Supervised Learning for Speaker Diarization
Figure 4 for Leveraging Self-Supervised Learning for Speaker Diarization
Viaarxiv icon

DiaCorrect: Error Correction Back-end For Speaker Diarization

Add code
Sep 15, 2023
Viaarxiv icon

HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism

Add code
Mar 15, 2023
Figure 1 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 2 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 3 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 4 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Viaarxiv icon

Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement

Add code
Nov 22, 2022
Figure 1 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 2 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 3 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 4 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Viaarxiv icon

DiaCorrect: End-to-end error correction for speaker diarization

Add code
Oct 31, 2022
Figure 1 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 2 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 3 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 4 for DiaCorrect: End-to-end error correction for speaker diarization
Viaarxiv icon

Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation

Add code
Apr 23, 2022
Figure 1 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 2 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 3 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 4 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Viaarxiv icon

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

Add code
Mar 04, 2022
Figure 1 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 2 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 3 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 4 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Viaarxiv icon

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction

Add code
Dec 27, 2021
Figure 1 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 2 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 3 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 4 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Viaarxiv icon

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction

Add code
Jun 06, 2021
Figure 1 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 2 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 3 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 4 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Viaarxiv icon