Picture for Julien Epps

Julien Epps

Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction

Add code
Sep 12, 2024
Figure 1 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 2 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 3 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Figure 4 for Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction
Viaarxiv icon

Rethinking Mamba in Speech Processing by Self-Supervised Models

Add code
Sep 11, 2024
Figure 1 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 2 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 3 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Figure 4 for Rethinking Mamba in Speech Processing by Self-Supervised Models
Viaarxiv icon

Mamba in Speech: Towards an Alternative to Self-Attention

Add code
May 22, 2024
Figure 1 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 2 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 3 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 4 for Mamba in Speech: Towards an Alternative to Self-Attention
Viaarxiv icon

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

Add code
Feb 17, 2024
Viaarxiv icon

Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method

Add code
Nov 13, 2023
Figure 1 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 2 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 3 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Figure 4 for Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
Viaarxiv icon

Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning

Add code
Oct 19, 2022
Figure 1 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 2 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 3 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Figure 4 for Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
Viaarxiv icon

The Ambiguous World of Emotion Representation

Add code
Sep 01, 2019
Figure 1 for The Ambiguous World of Emotion Representation
Figure 2 for The Ambiguous World of Emotion Representation
Figure 3 for The Ambiguous World of Emotion Representation
Figure 4 for The Ambiguous World of Emotion Representation
Viaarxiv icon

Transfer Learning for Improving Speech Emotion Classification Accuracy

Add code
Mar 26, 2018
Figure 1 for Transfer Learning for Improving Speech Emotion Classification Accuracy
Figure 2 for Transfer Learning for Improving Speech Emotion Classification Accuracy
Figure 3 for Transfer Learning for Improving Speech Emotion Classification Accuracy
Figure 4 for Transfer Learning for Improving Speech Emotion Classification Accuracy
Viaarxiv icon

Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study

Add code
Mar 26, 2018
Figure 1 for Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
Figure 2 for Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
Figure 3 for Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
Figure 4 for Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
Viaarxiv icon