Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Mar 06, 2024

Bin Zhang, Yuxiao Ye, Guoqing Du, Xiaoru Hu, Zhishuai Li, Sun Yang, Chi Harold Liu, Rui Zhao, Ziyue Li, Hangyu Mao

Figure 1 for Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Figure 2 for Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Figure 3 for Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Figure 4 for Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) have emerged as a powerful tool in advancing the Text-to-SQL task, significantly outperforming traditional methods. Nevertheless, as a nascent research field, there is still no consensus on the optimal prompt templates and design frameworks. Additionally, existing benchmarks inadequately explore the performance of LLMs across the various sub-tasks of the Text-to-SQL process, which hinders the assessment of LLMs' cognitive capabilities and the optimization of LLM-based solutions. To address the aforementioned issues, we firstly construct a new dataset designed to mitigate the risk of overfitting in LLMs. Then we formulate five evaluation tasks to comprehensively assess the performance of diverse methods across various LLMs throughout the Text-to-SQL process.Our study highlights the performance disparities among LLMs and proposes optimal in-context learning solutions tailored to each task. These findings offer valuable insights for enhancing the development of LLM-based Text-to-SQL systems.

* 26pages, 6figures, 14tables

View paper on

Share this with someone who'll enjoy it:

Title:Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation

Paper and Code