Picture for Javier Hernando

Javier Hernando

How Attention Shapes Emotion: A Comparative Study of Attention Mechanisms for Speech Emotion Recognition

Add code
Mar 16, 2026
Viaarxiv icon

Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks

Add code
Mar 09, 2026
Viaarxiv icon

Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data

Add code
Mar 09, 2026
Viaarxiv icon

Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios

Add code
May 30, 2025
Viaarxiv icon

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

Add code
Mar 28, 2025
Figure 1 for Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Figure 2 for Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Figure 3 for Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Figure 4 for Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Viaarxiv icon

Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge

Add code
Feb 04, 2025
Figure 1 for Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Figure 2 for Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Figure 3 for Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Figure 4 for Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Viaarxiv icon

Language Modelling for Speaker Diarization in Telephonic Interviews

Add code
Jan 28, 2025
Figure 1 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 2 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 3 for Language Modelling for Speaker Diarization in Telephonic Interviews
Figure 4 for Language Modelling for Speaker Diarization in Telephonic Interviews
Viaarxiv icon

On the Use of Audio to Improve Dialogue Policies

Add code
Oct 17, 2024
Figure 1 for On the Use of Audio to Improve Dialogue Policies
Figure 2 for On the Use of Audio to Improve Dialogue Policies
Figure 3 for On the Use of Audio to Improve Dialogue Policies
Viaarxiv icon

BSC-UPC at EmoSPeech-IberLEF2024: Attention Pooling for Emotion Recognition

Add code
Jul 17, 2024
Viaarxiv icon

Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge

Add code
Jun 15, 2024
Figure 1 for Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge
Figure 2 for Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge
Figure 3 for Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge
Figure 4 for Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge
Viaarxiv icon