Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Apr 01, 2024

Xiaokang Zhang, Jing Zhang, Zeyao Ma, Yang Li, Bohan Zhang, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-Li(+4 more)

Figure 1 for TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Figure 2 for TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Figure 3 for TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Figure 4 for TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Share this with someone who'll enjoy it:

Abstract:We introduce TableLLM, a robust large language model (LLM) with 13 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios. We propose a distant supervision method for training, which comprises a reasoning process extension strategy, aiding in training LLMs to understand reasoning patterns more effectively as well as a cross-way validation strategy, ensuring the quality of the automatically generated data. To evaluate the performance of TableLLM, we have crafted a benchmark tailored to address both document and spreadsheet formats as well as constructed a well-organized evaluation pipeline capable of handling both scenarios. Thorough evaluations underscore the advantages of TableLLM when compared to various existing general-purpose and tabular data-focused LLMs. We have publicly released the model checkpoint, source code, benchmarks, and a web application for user interaction.Our codes and data are publicly available at https://github.com/TableLLM/TableLLM.

* https://tablellm.github.io/

View paper on

Share this with someone who'll enjoy it:

Title:TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Paper and Code