Picture for Maokui He

Maokui He

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

Add code
Sep 17, 2023
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

Add code
Aug 28, 2023
Figure 1 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 2 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 3 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 4 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Viaarxiv icon

Semi-supervised multi-channel speaker diarization with cross-channel attention

Add code
Jul 17, 2023
Figure 1 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 2 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 3 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 4 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Viaarxiv icon

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

Add code
Feb 10, 2022
Figure 1 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 2 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 3 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 4 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Viaarxiv icon

Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker

Add code
Aug 07, 2021
Figure 1 for Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker
Figure 2 for Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker
Figure 3 for Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker
Figure 4 for Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker
Viaarxiv icon

USTC-NELSLIP System Description for DIHARD-III Challenge

Add code
Mar 19, 2021
Figure 1 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 2 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 3 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 4 for USTC-NELSLIP System Description for DIHARD-III Challenge
Viaarxiv icon