Picture for Vimal Bhat

Vimal Bhat

NowYouSee Me: Context-Aware Automatic Audio Description

Add code
Dec 13, 2024
Viaarxiv icon

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning

Add code
Dec 10, 2024
Viaarxiv icon

DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism

Add code
Dec 05, 2024
Viaarxiv icon

Text-Guided Video Masked Autoencoder

Add code
Aug 01, 2024
Figure 1 for Text-Guided Video Masked Autoencoder
Figure 2 for Text-Guided Video Masked Autoencoder
Figure 3 for Text-Guided Video Masked Autoencoder
Figure 4 for Text-Guided Video Masked Autoencoder
Viaarxiv icon

Motion-Guided Masking for Spatiotemporal Representation Learning

Add code
Aug 24, 2023
Viaarxiv icon

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

Add code
Aug 22, 2023
Viaarxiv icon

Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos

Add code
Mar 13, 2023
Viaarxiv icon

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection

Add code
Apr 28, 2021
Figure 1 for Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
Figure 2 for Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
Figure 3 for Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
Figure 4 for Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
Viaarxiv icon