Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hideo Kobayashi

Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Jan 24, 2025

Naihao Deng, Sheng Zhang, Henghui Zhu, Shuaichen Chang, Jiani Zhang, Alexander Hanbo Li, Chung-Wei Hang, Hideo Kobayashi, Yiqun Hu, Patrick Ng

Figure 1 for Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Figure 2 for Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Figure 3 for Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Figure 4 for Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Abstract:Recent advances in natural language processing have leveraged instruction tuning to enhance Large Language Models (LLMs) for table-related tasks. However, previous works train different base models with different training data, lacking an apples-to-apples comparison across the result table LLMs. To address this, we fine-tune base models from the Mistral, OLMo, and Phi families on existing public training datasets. Our replication achieves performance on par with or surpassing existing table LLMs, establishing new state-of-the-art performance on Hitab, a table question-answering dataset. More importantly, through systematic out-of-domain evaluation, we decouple the contributions of training data and the base model, providing insight into their individual impacts. In addition, we assess the effects of table-specific instruction tuning on general-purpose benchmarks, revealing trade-offs between specialization and generalization.

Via

Access Paper or Ask Questions

You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Sep 18, 2024

Hideo Kobayashi, Wuwei Lan, Peng Shi, Shuaichen Chang, Jiang Guo, Henghui Zhu, Zhiguo Wang, Patrick Ng

Figure 1 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Figure 2 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Figure 3 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Figure 4 for You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL

Abstract:While significant progress has been made on the text-to-SQL task, recent solutions repeatedly encode the same database schema for every question, resulting in unnecessary high inference cost and often overlooking crucial database knowledge. To address these issues, we propose You Only Read Once (YORO), a novel paradigm that directly internalizes database knowledge into the parametric knowledge of a text-to-SQL model during training and eliminates the need for schema encoding during inference. YORO significantly reduces the input token length by 66%-98%. Despite its shorter inputs, our empirical results demonstrate YORO's competitive performances with traditional systems on three benchmarks as well as its significant outperformance on large databases. Furthermore, YORO excels in handling questions with challenging value retrievals such as abbreviation.

Via

Access Paper or Ask Questions