Picture for Xiaofei Li

Xiaofei Li

LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction

Add code
Oct 09, 2024
Figure 1 for LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction
Figure 2 for LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction
Figure 3 for LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction
Figure 4 for LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction
Viaarxiv icon

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Add code
Jun 28, 2024
Figure 1 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 2 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 3 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 4 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Viaarxiv icon

Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement

Add code
Jun 05, 2024
Viaarxiv icon

DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention

Add code
Jun 04, 2024
Viaarxiv icon

IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization

Add code
May 11, 2024
Viaarxiv icon

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers

Add code
Mar 12, 2024
Viaarxiv icon

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

Add code
Feb 22, 2024
Viaarxiv icon

Deep learning and random light structuring ensure robust free-space communications

Add code
Jan 18, 2024
Viaarxiv icon

Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer

Add code
Dec 01, 2023
Viaarxiv icon

Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors

Add code
Sep 25, 2023
Viaarxiv icon