Picture for Catherine Lai

Catherine Lai

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling

Add code
Sep 25, 2024
Figure 1 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 2 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 3 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 4 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

Add code
Jun 12, 2024
Figure 1 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 2 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 3 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 4 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Viaarxiv icon

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

Add code
May 30, 2024
Figure 1 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 2 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 3 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 4 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Viaarxiv icon

Crossmodal ASR Error Correction with Discrete Speech Units

Add code
May 26, 2024
Figure 1 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 2 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 3 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 4 for Crossmodal ASR Error Correction with Discrete Speech Units
Viaarxiv icon

Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition

Add code
Feb 04, 2024
Viaarxiv icon

Quantifying the perceptual value of lexical and non-lexical channels in speech

Add code
Jul 07, 2023
Viaarxiv icon

Transfer Learning for Personality Perception via Speech Emotion Recognition

Add code
May 25, 2023
Viaarxiv icon

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition

Add code
May 25, 2023
Viaarxiv icon

Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition

Add code
May 23, 2023
Viaarxiv icon