Picture for Qi Qian

Qi Qian

Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning

Add code
Nov 06, 2024
Viaarxiv icon

SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

Add code
Sep 16, 2024
Viaarxiv icon

Text-Guided Mixup Towards Long-Tailed Image Categorization

Add code
Sep 05, 2024
Figure 1 for Text-Guided Mixup Towards Long-Tailed Image Categorization
Figure 2 for Text-Guided Mixup Towards Long-Tailed Image Categorization
Figure 3 for Text-Guided Mixup Towards Long-Tailed Image Categorization
Figure 4 for Text-Guided Mixup Towards Long-Tailed Image Categorization
Viaarxiv icon

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Add code
Aug 23, 2024
Viaarxiv icon

Online Zero-Shot Classification with CLIP

Add code
Aug 23, 2024
Viaarxiv icon

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Add code
Aug 09, 2024
Viaarxiv icon

Searching for Best Practices in Retrieval-Augmented Generation

Add code
Jul 01, 2024
Viaarxiv icon

Efficient Personalized Text-to-image Generation by Leveraging Textual Subspace

Add code
Jun 30, 2024
Viaarxiv icon

Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

Add code
Apr 24, 2024
Viaarxiv icon

mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model

Add code
Nov 30, 2023
Viaarxiv icon