Picture for Yanhua Long

Yanhua Long

ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Add code
Aug 20, 2024
Viaarxiv icon

Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection

Add code
Nov 15, 2023
Viaarxiv icon

UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023

Add code
Aug 24, 2023
Figure 1 for UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Figure 2 for UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Figure 3 for UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Figure 4 for UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Viaarxiv icon

Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition

Add code
Jun 20, 2023
Figure 1 for Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Figure 2 for Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Figure 3 for Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Figure 4 for Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Viaarxiv icon

Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement

Add code
Nov 22, 2022
Figure 1 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 2 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 3 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Figure 4 for Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement
Viaarxiv icon

Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system

Add code
Nov 03, 2022
Viaarxiv icon

DiaCorrect: End-to-end error correction for speaker diarization

Add code
Oct 31, 2022
Figure 1 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 2 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 3 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 4 for DiaCorrect: End-to-end error correction for speaker diarization
Viaarxiv icon

Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation

Add code
Apr 23, 2022
Figure 1 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 2 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 3 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Figure 4 for Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Viaarxiv icon

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

Add code
Mar 04, 2022
Figure 1 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 2 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 3 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Figure 4 for PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Viaarxiv icon

Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection

Add code
Mar 04, 2022
Figure 1 for Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection
Figure 2 for Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection
Figure 3 for Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection
Figure 4 for Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection
Viaarxiv icon