Picture for Wei-Qiang Zhang

Wei-Qiang Zhang

Integrating Pause Information with Word Embeddings in Language Models for Alzheimer's Disease Detection from Spontaneous Speech

Add code
Jan 12, 2025
Viaarxiv icon

Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning

Add code
Dec 30, 2024
Figure 1 for Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning
Figure 2 for Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning
Figure 3 for Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning
Figure 4 for Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning
Viaarxiv icon

Improving Acoustic Scene Classification in Low-Resource Conditions

Add code
Dec 30, 2024
Figure 1 for Improving Acoustic Scene Classification in Low-Resource Conditions
Figure 2 for Improving Acoustic Scene Classification in Low-Resource Conditions
Figure 3 for Improving Acoustic Scene Classification in Low-Resource Conditions
Figure 4 for Improving Acoustic Scene Classification in Low-Resource Conditions
Viaarxiv icon

Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models

Add code
Sep 11, 2024
Viaarxiv icon

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Add code
Aug 27, 2024
Viaarxiv icon

Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text

Add code
Aug 10, 2024
Figure 1 for Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text
Figure 2 for Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text
Figure 3 for Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text
Figure 4 for Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text
Viaarxiv icon

AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection

Add code
Jun 17, 2024
Viaarxiv icon

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Add code
Jun 17, 2024
Viaarxiv icon

Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Add code
Jun 14, 2024
Viaarxiv icon

SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation

Add code
Mar 13, 2024
Viaarxiv icon