Picture for Ye Fang

Ye Fang

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Add code
Apr 29, 2024
Viaarxiv icon

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Add code
Dec 22, 2023
Viaarxiv icon

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Add code
Dec 13, 2023
Viaarxiv icon

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Add code
Dec 05, 2023
Viaarxiv icon