Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Jan 11, 2025

Xuanle Zhao, Xianzhen Luo, Qi Shi, Chi Chen, Shuo Wang, Wanxiang Che, Zhiyuan Liu, Maosong Sun

Figure 1 for ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Figure 2 for ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Figure 3 for ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Figure 4 for ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Share this with someone who'll enjoy it:

Abstract:Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in chart understanding tasks. However, interpreting charts with textual descriptions often leads to information loss, as it fails to fully capture the dense information embedded in charts. In contrast, parsing charts into code provides lossless representations that can effectively contain all critical details. Although existing open-source MLLMs have achieved success in chart understanding tasks, they still face two major challenges when applied to chart-to-code tasks.: (1) Low executability and poor restoration of chart details in the generated code and (2) Lack of large-scale and diverse training data. To address these challenges, we propose \textbf{ChartCoder}, the first dedicated chart-to-code MLLM, which leverages Code LLMs as the language backbone to enhance the executability of the generated code. Furthermore, we introduce \textbf{Chart2Code-160k}, the first large-scale and diverse dataset for chart-to-code generation, and propose the \textbf{Snippet-of-Thought (SoT)} method, which transforms direct chart-to-code generation data into step-by-step generation. Experiments demonstrate that ChartCoder, with only 7B parameters, surpasses existing open-source MLLMs on chart-to-code benchmarks, achieving superior chart restoration and code excitability. Our code will be available at https://github.com/thunlp/ChartCoder.

* 13 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Paper and Code