Picture for David Gimeno-Gómez

David Gimeno-Gómez

Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis

Add code
Dec 02, 2024
Figure 1 for Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis
Figure 2 for Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis
Figure 3 for Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis
Figure 4 for Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis
Viaarxiv icon

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

Add code
Jul 09, 2024
Viaarxiv icon

AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies

Add code
Feb 20, 2024
Viaarxiv icon

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition

Add code
Feb 20, 2024
Viaarxiv icon

Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues

Add code
Jan 05, 2024
Figure 1 for Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues
Figure 2 for Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues
Figure 3 for Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues
Figure 4 for Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues
Viaarxiv icon

Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish

Add code
Nov 21, 2023
Viaarxiv icon

Analysis of Visual Features for Continuous Lipreading in Spanish

Add code
Nov 21, 2023
Viaarxiv icon

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild

Add code
Nov 21, 2023
Viaarxiv icon