Picture for Chengyao Wang

Chengyao Wang

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Add code
Dec 12, 2024
Viaarxiv icon

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Add code
Dec 05, 2024
Viaarxiv icon

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Add code
Mar 14, 2024
Viaarxiv icon

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract

Add code
Jun 27, 2023
Figure 1 for Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract
Figure 2 for Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract
Figure 3 for Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract
Figure 4 for Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract
Viaarxiv icon