Picture for Ajinkya Kale

Ajinkya Kale

Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation

Add code
Dec 13, 2024
Viaarxiv icon

Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach

Add code
Nov 26, 2024
Viaarxiv icon

Quadratic Is Not What You Need For Multimodal Large Language Models

Add code
Oct 08, 2024
Viaarxiv icon

Towards Enhanced Controllability of Diffusion Models

Add code
Mar 15, 2023
Viaarxiv icon

Controlled and Conditional Text to Image Generation with Diffusion Prior

Add code
Feb 23, 2023
Viaarxiv icon

PRedItOR: Text Guided Image Editing with Diffusion Prior

Add code
Feb 15, 2023
Figure 1 for PRedItOR: Text Guided Image Editing with Diffusion Prior
Figure 2 for PRedItOR: Text Guided Image Editing with Diffusion Prior
Figure 3 for PRedItOR: Text Guided Image Editing with Diffusion Prior
Figure 4 for PRedItOR: Text Guided Image Editing with Diffusion Prior
Viaarxiv icon

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Add code
Dec 16, 2022
Viaarxiv icon

Fine-grained Image Captioning with CLIP Reward

Add code
May 26, 2022
Figure 1 for Fine-grained Image Captioning with CLIP Reward
Figure 2 for Fine-grained Image Captioning with CLIP Reward
Figure 3 for Fine-grained Image Captioning with CLIP Reward
Figure 4 for Fine-grained Image Captioning with CLIP Reward
Viaarxiv icon

StyleBabel: Artistic Style Tagging and Captioning

Add code
Mar 11, 2022
Figure 1 for StyleBabel: Artistic Style Tagging and Captioning
Figure 2 for StyleBabel: Artistic Style Tagging and Captioning
Figure 3 for StyleBabel: Artistic Style Tagging and Captioning
Figure 4 for StyleBabel: Artistic Style Tagging and Captioning
Viaarxiv icon

Towards Zero-shot Cross-lingual Image Retrieval and Tagging

Add code
Sep 15, 2021
Figure 1 for Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Figure 2 for Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Figure 3 for Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Figure 4 for Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Viaarxiv icon