Picture for Longhui Wei

Longhui Wei

ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts

Add code
Oct 21, 2024
Viaarxiv icon

OVMR: Open-Vocabulary Recognition with Multi-Modal References

Add code
Jun 07, 2024
Viaarxiv icon

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

Add code
Mar 28, 2024
Viaarxiv icon

Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models

Add code
Jan 13, 2024
Viaarxiv icon

Boosting Segment Anything Model Towards Open-Vocabulary Learning

Add code
Dec 06, 2023
Viaarxiv icon

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

Add code
Nov 22, 2023
Viaarxiv icon

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion

Add code
Aug 08, 2023
Viaarxiv icon

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation

Add code
Aug 04, 2023
Viaarxiv icon

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

Add code
Jun 14, 2023
Figure 1 for Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Figure 2 for Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Figure 3 for Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Figure 4 for Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Viaarxiv icon

Continual Vision-Language Representation Learning with Off-Diagonal Information

Add code
May 17, 2023
Viaarxiv icon