Picture for Idan Schwartz

Idan Schwartz

Iterative Object Count Optimization for Text-to-image Diffusion Models

Add code
Aug 21, 2024
Viaarxiv icon

Improving Visual Commonsense in Language Models via Multiple Image Generation

Add code
Jun 19, 2024
Viaarxiv icon

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Add code
Sep 28, 2023
Viaarxiv icon

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Add code
May 22, 2023
Viaarxiv icon

Discriminative Class Tokens for Text-to-Image Diffusion Models

Add code
Mar 30, 2023
Viaarxiv icon

Zero-Shot Video Captioning with Evolving Pseudo-Tokens

Add code
Jul 27, 2022
Figure 1 for Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Figure 2 for Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Figure 3 for Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Figure 4 for Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Viaarxiv icon

Optimizing Relevance Maps of Vision Transformers Improves Robustness

Add code
Jun 02, 2022
Figure 1 for Optimizing Relevance Maps of Vision Transformers Improves Robustness
Figure 2 for Optimizing Relevance Maps of Vision Transformers Improves Robustness
Figure 3 for Optimizing Relevance Maps of Vision Transformers Improves Robustness
Figure 4 for Optimizing Relevance Maps of Vision Transformers Improves Robustness
Viaarxiv icon

Latent Space Explanation by Intervention

Add code
Dec 09, 2021
Figure 1 for Latent Space Explanation by Intervention
Figure 2 for Latent Space Explanation by Intervention
Figure 3 for Latent Space Explanation by Intervention
Figure 4 for Latent Space Explanation by Intervention
Viaarxiv icon

Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Add code
Nov 29, 2021
Figure 1 for Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Figure 2 for Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Figure 3 for Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Figure 4 for Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Viaarxiv icon

Perceptual Score: What Data Modalities Does Your Model Perceive?

Add code
Oct 27, 2021
Figure 1 for Perceptual Score: What Data Modalities Does Your Model Perceive?
Figure 2 for Perceptual Score: What Data Modalities Does Your Model Perceive?
Figure 3 for Perceptual Score: What Data Modalities Does Your Model Perceive?
Figure 4 for Perceptual Score: What Data Modalities Does Your Model Perceive?
Viaarxiv icon