Picture for Bingqi Ma

Bingqi Ma

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Add code
Dec 12, 2024
Viaarxiv icon

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

Add code
Nov 29, 2024
Viaarxiv icon

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

Add code
Jun 17, 2024
Viaarxiv icon

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Apr 19, 2024
Figure 1 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 2 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 3 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Figure 4 for MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Viaarxiv icon

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

Add code
Mar 07, 2024
Viaarxiv icon

Towards Large-scale Masked Face Recognition

Add code
Oct 25, 2023
Viaarxiv icon

Rethinking Robust Representation Learning Under Fine-grained Noisy Faces

Add code
Aug 08, 2022
Figure 1 for Rethinking Robust Representation Learning Under Fine-grained Noisy Faces
Figure 2 for Rethinking Robust Representation Learning Under Fine-grained Noisy Faces
Figure 3 for Rethinking Robust Representation Learning Under Fine-grained Noisy Faces
Figure 4 for Rethinking Robust Representation Learning Under Fine-grained Noisy Faces
Viaarxiv icon

Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection

Add code
Apr 17, 2022
Figure 1 for Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Figure 2 for Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Figure 3 for Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Figure 4 for Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Viaarxiv icon