Picture for Shiva Sundaram

Shiva Sundaram

Disentanglement for audio-visual emotion recognition using multitask setup

Add code
Feb 11, 2021
Figure 1 for Disentanglement for audio-visual emotion recognition using multitask setup
Figure 2 for Disentanglement for audio-visual emotion recognition using multitask setup
Figure 3 for Disentanglement for audio-visual emotion recognition using multitask setup
Figure 4 for Disentanglement for audio-visual emotion recognition using multitask setup
Viaarxiv icon

Audiovisual Highlight Detection in Videos

Add code
Feb 11, 2021
Figure 1 for Audiovisual Highlight Detection in Videos
Figure 2 for Audiovisual Highlight Detection in Videos
Figure 3 for Audiovisual Highlight Detection in Videos
Figure 4 for Audiovisual Highlight Detection in Videos
Viaarxiv icon

Self-Supervised learning with cross-modal transformers for emotion recognition

Add code
Nov 20, 2020
Figure 1 for Self-Supervised learning with cross-modal transformers for emotion recognition
Figure 2 for Self-Supervised learning with cross-modal transformers for emotion recognition
Figure 3 for Self-Supervised learning with cross-modal transformers for emotion recognition
Viaarxiv icon

Multi-modal embeddings using multi-task learning for emotion recognition

Add code
Sep 10, 2020
Figure 1 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 2 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 3 for Multi-modal embeddings using multi-task learning for emotion recognition
Viaarxiv icon

Multiresolution and Multimodal Speech Recognition with Transformers

Add code
Apr 29, 2020
Figure 1 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 2 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 3 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 4 for Multiresolution and Multimodal Speech Recognition with Transformers
Viaarxiv icon

Robust Multi-channel Speech Recognition using Frequency Aligned Network

Add code
Feb 06, 2020
Figure 1 for Robust Multi-channel Speech Recognition using Frequency Aligned Network
Figure 2 for Robust Multi-channel Speech Recognition using Frequency Aligned Network
Figure 3 for Robust Multi-channel Speech Recognition using Frequency Aligned Network
Figure 4 for Robust Multi-channel Speech Recognition using Frequency Aligned Network
Viaarxiv icon

Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning

Add code
Feb 01, 2020
Figure 1 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 2 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 3 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 4 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Viaarxiv icon

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning

Add code
Jan 11, 2019
Figure 1 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 2 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 3 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 4 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Viaarxiv icon