Picture for Stephan Alaniz

Stephan Alaniz

FLAIR: VLM with Fine-grained Language-informed Image Representations

Add code
Dec 04, 2024
Figure 1 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 2 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 3 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 4 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Viaarxiv icon

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training

Add code
Dec 02, 2024
Viaarxiv icon

Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)

Add code
Oct 25, 2024
Figure 1 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 2 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 3 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Figure 4 for Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Viaarxiv icon

DataDream: Few-shot Guided Dataset Generation

Add code
Jul 16, 2024
Figure 1 for DataDream: Few-shot Guided Dataset Generation
Viaarxiv icon

How should the advent of large language models affect the practice of science?

Add code
Dec 05, 2023
Viaarxiv icon

PDiscoNet: Semantically consistent part discovery for fine-grained recognition

Add code
Sep 06, 2023
Viaarxiv icon

Iterative Superquadric Recomposition of 3D Objects from Multiple Views

Add code
Sep 05, 2023
Figure 1 for Iterative Superquadric Recomposition of 3D Objects from Multiple Views
Figure 2 for Iterative Superquadric Recomposition of 3D Objects from Multiple Views
Figure 3 for Iterative Superquadric Recomposition of 3D Objects from Multiple Views
Figure 4 for Iterative Superquadric Recomposition of 3D Objects from Multiple Views
Viaarxiv icon

DeViL: Decoding Vision features into Language

Add code
Sep 04, 2023
Figure 1 for DeViL: Decoding Vision features into Language
Figure 2 for DeViL: Decoding Vision features into Language
Figure 3 for DeViL: Decoding Vision features into Language
Figure 4 for DeViL: Decoding Vision features into Language
Viaarxiv icon

In-Context Impersonation Reveals Large Language Models' Strengths and Biases

Add code
May 24, 2023
Figure 1 for In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Figure 2 for In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Figure 3 for In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Figure 4 for In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Viaarxiv icon

Semantic Image Synthesis with Semantically Coupled VQ-Model

Add code
Sep 06, 2022
Figure 1 for Semantic Image Synthesis with Semantically Coupled VQ-Model
Figure 2 for Semantic Image Synthesis with Semantically Coupled VQ-Model
Figure 3 for Semantic Image Synthesis with Semantically Coupled VQ-Model
Figure 4 for Semantic Image Synthesis with Semantically Coupled VQ-Model
Viaarxiv icon