Picture for Vineet Gandhi

Vineet Gandhi

CVIT, IIIT Hyderabad

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Add code
Nov 25, 2024
Viaarxiv icon

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark

Add code
Nov 12, 2024
Figure 1 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 2 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 3 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 4 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Viaarxiv icon

Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models

Add code
Jul 26, 2024
Figure 1 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 2 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 3 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 4 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Viaarxiv icon

Major Entity Identification: A Generalizable Alternative to Coreference Resolution

Add code
Jun 20, 2024
Viaarxiv icon

VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

Add code
Jun 16, 2024
Viaarxiv icon

SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning

Add code
Feb 07, 2024
Viaarxiv icon

Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings

Add code
Nov 27, 2023
Viaarxiv icon

RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations

Add code
Jul 03, 2023
Viaarxiv icon

Instance-Level Semantic Maps for Vision Language Navigation

Add code
May 23, 2023
Viaarxiv icon

MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting

Add code
May 19, 2023
Figure 1 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 2 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 3 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 4 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Viaarxiv icon