Picture for Nian Shao

Nian Shao

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Add code
Jun 28, 2024
Figure 1 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 2 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 3 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 4 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Viaarxiv icon

Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors

Add code
Sep 25, 2023
Viaarxiv icon

Fine-tune the pretrained ATST model for sound event detection

Add code
Sep 15, 2023
Figure 1 for Fine-tune the pretrained ATST model for sound event detection
Figure 2 for Fine-tune the pretrained ATST model for sound event detection
Figure 3 for Fine-tune the pretrained ATST model for sound event detection
Figure 4 for Fine-tune the pretrained ATST model for sound event detection
Viaarxiv icon

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

Add code
Jun 07, 2023
Viaarxiv icon

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Add code
Nov 04, 2021
Figure 1 for RCT: Random Consistency Training for Semi-supervised Sound Event Detection
Figure 2 for RCT: Random Consistency Training for Semi-supervised Sound Event Detection
Figure 3 for RCT: Random Consistency Training for Semi-supervised Sound Event Detection
Figure 4 for RCT: Random Consistency Training for Semi-supervised Sound Event Detection
Viaarxiv icon