Picture for Bei Chen

Bei Chen

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Add code
Oct 16, 2024
Figure 1 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 2 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 3 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Figure 4 for HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Viaarxiv icon

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Viaarxiv icon

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Add code
Jul 22, 2024
Viaarxiv icon

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Add code
Jun 20, 2024
Figure 1 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 2 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 3 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 4 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Viaarxiv icon

Yi: Open Foundation Models by 01.AI

Add code
Mar 07, 2024
Figure 1 for Yi: Open Foundation Models by 01.AI
Figure 2 for Yi: Open Foundation Models by 01.AI
Figure 3 for Yi: Open Foundation Models by 01.AI
Figure 4 for Yi: Open Foundation Models by 01.AI
Viaarxiv icon

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Add code
Jan 22, 2024
Viaarxiv icon

Can Programming Languages Boost Each Other via Instruction Tuning?

Add code
Sep 03, 2023
Figure 1 for Can Programming Languages Boost Each Other via Instruction Tuning?
Figure 2 for Can Programming Languages Boost Each Other via Instruction Tuning?
Figure 3 for Can Programming Languages Boost Each Other via Instruction Tuning?
Figure 4 for Can Programming Languages Boost Each Other via Instruction Tuning?
Viaarxiv icon

SoTaNa: The Open-Source Software Development Assistant

Add code
Aug 25, 2023
Figure 1 for SoTaNa: The Open-Source Software Development Assistant
Figure 2 for SoTaNa: The Open-Source Software Development Assistant
Figure 3 for SoTaNa: The Open-Source Software Development Assistant
Figure 4 for SoTaNa: The Open-Source Software Development Assistant
Viaarxiv icon

Skill-Based Few-Shot Selection for In-Context Learning

Add code
May 23, 2023
Figure 1 for Skill-Based Few-Shot Selection for In-Context Learning
Figure 2 for Skill-Based Few-Shot Selection for In-Context Learning
Figure 3 for Skill-Based Few-Shot Selection for In-Context Learning
Figure 4 for Skill-Based Few-Shot Selection for In-Context Learning
Viaarxiv icon

Question Answering as Programming for Solving Time-Sensitive Questions

Add code
May 23, 2023
Viaarxiv icon