Picture for Gabriela Ben Melech Stan

Gabriela Ben Melech Stan

FiVL: A Framework for Improved Vision-Language Alignment

Add code
Dec 19, 2024
Figure 1 for FiVL: A Framework for Improved Vision-Language Alignment
Figure 2 for FiVL: A Framework for Improved Vision-Language Alignment
Figure 3 for FiVL: A Framework for Improved Vision-Language Alignment
Figure 4 for FiVL: A Framework for Improved Vision-Language Alignment
Viaarxiv icon

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models

Add code
Apr 03, 2024
Viaarxiv icon

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Add code
Apr 01, 2024
Viaarxiv icon

LDM3D-VR: Latent Diffusion Model for 3D VR

Add code
Nov 06, 2023
Viaarxiv icon

LDM3D: Latent Diffusion Model for 3D

Add code
May 21, 2023
Viaarxiv icon

Improving video retrieval using multilingual knowledge transfer

Add code
Aug 28, 2022
Figure 1 for Improving video retrieval using multilingual knowledge transfer
Figure 2 for Improving video retrieval using multilingual knowledge transfer
Figure 3 for Improving video retrieval using multilingual knowledge transfer
Figure 4 for Improving video retrieval using multilingual knowledge transfer
Viaarxiv icon