Picture for David Gimeno-Gómez

David Gimeno-Gómez

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

Add code
Jul 09, 2024
Viaarxiv icon

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition

Add code
Feb 20, 2024
Viaarxiv icon

AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies

Add code
Feb 20, 2024
Viaarxiv icon

Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues

Add code
Jan 05, 2024
Viaarxiv icon

Analysis of Visual Features for Continuous Lipreading in Spanish

Add code
Nov 21, 2023
Viaarxiv icon

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild

Add code
Nov 21, 2023
Viaarxiv icon

Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish

Add code
Nov 21, 2023
Viaarxiv icon