Picture for Vineet Gandhi

Vineet Gandhi

CVIT, IIIT Hyderabad

Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset

Add code
Dec 25, 2024
Viaarxiv icon

MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI

Add code
Dec 25, 2024
Viaarxiv icon

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Add code
Nov 25, 2024
Figure 1 for TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
Viaarxiv icon

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark

Add code
Nov 12, 2024
Figure 1 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 2 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 3 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 4 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Viaarxiv icon

Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models

Add code
Jul 26, 2024
Figure 1 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 2 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 3 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Figure 4 for Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Viaarxiv icon

Major Entity Identification: A Generalizable Alternative to Coreference Resolution

Add code
Jun 20, 2024
Viaarxiv icon

VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

Add code
Jun 16, 2024
Viaarxiv icon

SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning

Add code
Feb 07, 2024
Figure 1 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 2 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 3 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 4 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Viaarxiv icon

Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings

Add code
Nov 27, 2023
Figure 1 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 2 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 3 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 4 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Viaarxiv icon

RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations

Add code
Jul 03, 2023
Viaarxiv icon