Picture for Abhinav Thanda

Abhinav Thanda

Audio-Visual Decision Fusion for WFST-based and seq2seq Models

Add code
Jan 29, 2020
Figure 1 for Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Figure 2 for Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Figure 3 for Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Figure 4 for Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Viaarxiv icon

LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models

Add code
Jun 25, 2019
Figure 1 for LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models
Figure 2 for LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models
Figure 3 for LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models
Figure 4 for LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models
Viaarxiv icon

Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition

Add code
Jan 10, 2017
Figure 1 for Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition
Figure 2 for Multi-task Learning Of Deep Neural Networks For Audio Visual Automatic Speech Recognition
Viaarxiv icon

Audio Visual Speech Recognition using Deep Recurrent Neural Networks

Add code
Nov 09, 2016
Figure 1 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 2 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 3 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 4 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Viaarxiv icon