Picture for Shuyue Guo

Shuyue Guo

MdEval: Massively Multilingual Code Debugging

Add code
Nov 04, 2024
Figure 1 for MdEval: Massively Multilingual Code Debugging
Figure 2 for MdEval: Massively Multilingual Code Debugging
Figure 3 for MdEval: Massively Multilingual Code Debugging
Figure 4 for MdEval: Massively Multilingual Code Debugging
Viaarxiv icon

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Add code
Oct 09, 2024
Figure 1 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 2 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 3 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 4 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Viaarxiv icon

LIME-M: Less Is More for Evaluation of MLLMs

Add code
Sep 10, 2024
Figure 1 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 2 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 3 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 4 for LIME-M: Less Is More for Evaluation of MLLMs
Viaarxiv icon

MuPT: A Generative Symbolic Music Pretrained Transformer

Add code
Apr 10, 2024
Figure 1 for MuPT: A Generative Symbolic Music Pretrained Transformer
Figure 2 for MuPT: A Generative Symbolic Music Pretrained Transformer
Figure 3 for MuPT: A Generative Symbolic Music Pretrained Transformer
Figure 4 for MuPT: A Generative Symbolic Music Pretrained Transformer
Viaarxiv icon

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models

Add code
Apr 06, 2024
Figure 1 for CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Figure 2 for CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Figure 3 for CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Figure 4 for CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Viaarxiv icon

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Add code
Jan 22, 2024
Viaarxiv icon

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation

Add code
Jan 12, 2024
Viaarxiv icon