Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Brij Chavda

An Automatic Prompt Generation System for Tabular Data Tasks

May 09, 2024

Ashlesha Akella, Abhijit Manatkar, Brij Chavda, Hima Patel

Figure 1 for An Automatic Prompt Generation System for Tabular Data Tasks

Figure 2 for An Automatic Prompt Generation System for Tabular Data Tasks

Figure 3 for An Automatic Prompt Generation System for Tabular Data Tasks

Figure 4 for An Automatic Prompt Generation System for Tabular Data Tasks

Abstract:Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through carefully crafted prompts. However, creating effective prompts for tabular datasets is challenging due to the structured nature of the data and the need to manage numerous columns. This paper presents an innovative auto-prompt generation system suitable for multiple LLMs, with minimal training. It proposes two novel methods; 1) A Reinforcement Learning-based algorithm for identifying and sequencing task-relevant columns 2) Cell-level similarity-based approach for enhancing few-shot example selection. Our approach has been extensively tested across 66 datasets, demonstrating improved performance in three downstream tasks: data imputation, error detection, and entity matching using two distinct LLMs; Google flan-t5-xxl and Mixtral 8x7B.

* Accepted to NAACL 2024 Industry Track

Via

Access Paper or Ask Questions