Picture for Nabarun Goswami

Nabarun Goswami

HyperVQ: MLR-based Vector Quantization in Hyperbolic Space

Add code
Mar 18, 2024
Viaarxiv icon

Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

Add code
Jan 18, 2024
Figure 1 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 2 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 3 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 4 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Viaarxiv icon

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

Add code
Aug 14, 2023
Viaarxiv icon

SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate

Add code
Jul 13, 2022
Figure 1 for SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Figure 2 for SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Figure 3 for SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Figure 4 for SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Viaarxiv icon