Picture for Kranti Kumar Parida

Kranti Kumar Parida

Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention

Add code
Nov 15, 2021
Figure 1 for Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
Figure 2 for Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
Figure 3 for Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
Figure 4 for Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
Viaarxiv icon

Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention

Add code
Aug 10, 2021
Figure 1 for Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention
Figure 2 for Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention
Figure 3 for Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention
Figure 4 for Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention
Viaarxiv icon

Beyond Image to Depth: Improving Depth Prediction using Echoes

Add code
Apr 03, 2021
Figure 1 for Beyond Image to Depth: Improving Depth Prediction using Echoes
Figure 2 for Beyond Image to Depth: Improving Depth Prediction using Echoes
Figure 3 for Beyond Image to Depth: Improving Depth Prediction using Echoes
Figure 4 for Beyond Image to Depth: Improving Depth Prediction using Echoes
Viaarxiv icon

Discriminative Semantic Transitive Consistency for Cross-Modal Learning

Add code
Mar 25, 2021
Figure 1 for Discriminative Semantic Transitive Consistency for Cross-Modal Learning
Figure 2 for Discriminative Semantic Transitive Consistency for Cross-Modal Learning
Figure 3 for Discriminative Semantic Transitive Consistency for Cross-Modal Learning
Figure 4 for Discriminative Semantic Transitive Consistency for Cross-Modal Learning
Viaarxiv icon

AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings

Add code
May 27, 2020
Figure 1 for AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Figure 2 for AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Figure 3 for AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Figure 4 for AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Viaarxiv icon

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos

Add code
Oct 19, 2019
Figure 1 for Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos
Figure 2 for Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos
Figure 3 for Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos
Figure 4 for Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos
Viaarxiv icon