Picture for Adrian Bulat

Adrian Bulat

Discriminative Fine-tuning of LVLMs

Add code
Dec 05, 2024
Viaarxiv icon

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Add code
Nov 27, 2024
Viaarxiv icon

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Add code
Aug 19, 2024
Viaarxiv icon

FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models

Add code
May 16, 2024
Figure 1 for FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
Figure 2 for FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
Figure 3 for FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
Figure 4 for FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
Viaarxiv icon

You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation

Add code
Jan 30, 2024
Figure 1 for You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Figure 2 for You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Figure 3 for You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Figure 4 for You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
Viaarxiv icon

SimDETR: Simplifying self-supervised pretraining for DETR

Add code
Jul 28, 2023
Viaarxiv icon

Black Box Few-Shot Adaptation for Vision-Language models

Add code
Apr 04, 2023
Viaarxiv icon

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Add code
Oct 10, 2022
Figure 1 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 2 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 3 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 4 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Viaarxiv icon

Variational prompt tuning improves generalization of vision-language models

Add code
Oct 05, 2022
Figure 1 for Variational prompt tuning improves generalization of vision-language models
Figure 2 for Variational prompt tuning improves generalization of vision-language models
Figure 3 for Variational prompt tuning improves generalization of vision-language models
Figure 4 for Variational prompt tuning improves generalization of vision-language models
Viaarxiv icon

Language-Aware Soft Prompting for Vision & Language Foundation Models

Add code
Oct 03, 2022
Figure 1 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 2 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 3 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 4 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Viaarxiv icon