Picture for Vineet Gandhi

Vineet Gandhi

CVIT, IIIT Hyderabad

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark

Add code
Nov 12, 2024
Viaarxiv icon

Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models

Add code
Jul 26, 2024
Viaarxiv icon

Major Entity Identification: A Generalizable Alternative to Coreference Resolution

Add code
Jun 20, 2024
Viaarxiv icon

VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

Add code
Jun 16, 2024
Viaarxiv icon

SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning

Add code
Feb 07, 2024
Viaarxiv icon

Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings

Add code
Nov 27, 2023
Viaarxiv icon

RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations

Add code
Jul 03, 2023
Viaarxiv icon

Instance-Level Semantic Maps for Vision Language Navigation

Add code
May 23, 2023
Viaarxiv icon

MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting

Add code
May 19, 2023
Figure 1 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 2 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 3 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 4 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Viaarxiv icon

ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations

Add code
Mar 01, 2023
Viaarxiv icon