Picture for George Sterpu

George Sterpu

Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys

Add code
Nov 18, 2023
Viaarxiv icon

AV Taris: Online Audio-Visual Speech Recognition

Add code
Dec 14, 2020
Figure 1 for AV Taris: Online Audio-Visual Speech Recognition
Figure 2 for AV Taris: Online Audio-Visual Speech Recognition
Figure 3 for AV Taris: Online Audio-Visual Speech Recognition
Figure 4 for AV Taris: Online Audio-Visual Speech Recognition
Viaarxiv icon

Learning to Count Words in Fluent Speech enables Online Speech Recognition

Add code
Jun 11, 2020
Figure 1 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 2 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 3 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 4 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Viaarxiv icon

Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition

Add code
May 19, 2020
Figure 1 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 2 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 3 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 4 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Viaarxiv icon

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

Add code
Apr 17, 2020
Figure 1 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 2 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 3 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 4 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Viaarxiv icon

Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition

Add code
Sep 05, 2018
Figure 1 for Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Figure 2 for Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Figure 3 for Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Figure 4 for Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Viaarxiv icon

Can DNNs Learn to Lipread Full Sentences?

Add code
May 29, 2018
Figure 1 for Can DNNs Learn to Lipread Full Sentences?
Figure 2 for Can DNNs Learn to Lipread Full Sentences?
Figure 3 for Can DNNs Learn to Lipread Full Sentences?
Viaarxiv icon