Picture for Beena Ahmed

Beena Ahmed

Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction

Add code
Sep 12, 2024
Figure 1 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 2 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 3 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 4 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Viaarxiv icon

Rethinking Mamba in Speech Processing by Self-Supervised Models

Add code
Sep 11, 2024
Figure 1 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 2 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 3 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 4 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Viaarxiv icon

Mamba in Speech: Towards an Alternative to Self-Attention

Add code
May 22, 2024
Viaarxiv icon

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

Add code
Feb 17, 2024
Viaarxiv icon

Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method

Add code
Nov 13, 2023
Figure 1 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 2 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 3 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 4 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Viaarxiv icon

Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio

Add code
Oct 17, 2023
Viaarxiv icon

Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling

Add code
Sep 21, 2023
Viaarxiv icon

Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations

Add code
Nov 14, 2022
Viaarxiv icon

Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning

Add code
Oct 19, 2022
Figure 1 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 2 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 3 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 4 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Viaarxiv icon