Picture for Nimrod Shabtay

Nimrod Shabtay

Teaching VLMs to Localize Specific Objects from In-context Examples

Add code
Nov 20, 2024
Viaarxiv icon

Continuous Speech Synthesis using per-token Latent Diffusion

Add code
Oct 21, 2024
Figure 1 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 2 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 3 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 4 for Continuous Speech Synthesis using per-token Latent Diffusion
Viaarxiv icon

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Add code
Oct 15, 2024
Viaarxiv icon

Deep Phase Coded Image Prior

Add code
Apr 05, 2024
Viaarxiv icon

PIP: Positional-encoding Image Prior

Add code
Nov 25, 2022
Viaarxiv icon