Picture for Zeyi Sun

Zeyi Sun

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Add code
Dec 02, 2024
Viaarxiv icon

Pre-trained Graphformer-based Ranking at Web-scale Search (Extended Abstract)

Add code
Sep 25, 2024
Viaarxiv icon

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Add code
Jun 17, 2024
Figure 1 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Figure 2 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Figure 3 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Viaarxiv icon

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Add code
May 31, 2024
Viaarxiv icon

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Add code
Apr 29, 2024
Viaarxiv icon

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition

Add code
Mar 20, 2024
Viaarxiv icon

Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective

Add code
Jan 13, 2024
Figure 1 for Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective
Figure 2 for Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective
Figure 3 for Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective
Figure 4 for Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective
Viaarxiv icon

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Add code
Dec 22, 2023
Viaarxiv icon

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Add code
Dec 13, 2023
Viaarxiv icon

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Add code
Dec 05, 2023
Viaarxiv icon