Picture for Chongyi Wang

Chongyi Wang

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Add code
Aug 03, 2024
Figure 1 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 2 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 3 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 4 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Viaarxiv icon

GUICourse: From General Vision Language Models to Versatile GUI Agents

Add code
Jun 17, 2024
Figure 1 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 2 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 3 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Figure 4 for GUICourse: From General Vision Language Models to Versatile GUI Agents
Viaarxiv icon

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Add code
Apr 09, 2024
Figure 1 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 2 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 3 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 4 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Viaarxiv icon

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants

Add code
Oct 01, 2023
Viaarxiv icon

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Add code
Aug 23, 2023
Viaarxiv icon