Picture for Chengyao Wang

Chengyao Wang

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Add code
Mar 14, 2024
Viaarxiv icon

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract

Add code
Jun 27, 2023
Viaarxiv icon