Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Felix den Breejen

Why In-Context Learning Transformers are Tabular Data Classifiers

May 22, 2024

Felix den Breejen, Sangmin Bae, Stephen Cha, Se-Young Yun

Abstract:The recently introduced TabPFN pretrains an In-Context Learning (ICL) transformer on synthetic data to perform tabular data classification. As synthetic data does not share features or labels with real-world data, the underlying mechanism that contributes to the success of this method remains unclear. This study provides an explanation by demonstrating that ICL-transformers acquire the ability to create complex decision boundaries during pretraining. To validate our claim, we develop a novel forest dataset generator which creates datasets that are unrealistic, but have complex decision boundaries. Our experiments confirm the effectiveness of ICL-transformers pretrained on this data. Furthermore, we create TabForestPFN, the ICL-transformer pretrained on both the original TabPFN synthetic dataset generator and our forest dataset generator. By fine-tuning this model, we reach the current state-of-the-art on tabular data classification. Code is available at https://github.com/FelixdenBreejen/TabForestPFN.

* 9 pages main body, 22 pages total. Preprint under review

Via

Access Paper or Ask Questions

Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

Nov 13, 2023

Felix den Breejen, Sangmin Bae, Stephen Cha, Tae-Young Kim, Seoung Hyun Koh, Se-Young Yun

Figure 1 for Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

Figure 2 for Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

Figure 3 for Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

Abstract:While interests in tabular deep learning has significantly grown, conventional tree-based models still outperform deep learning methods. To narrow this performance gap, we explore the innovative retrieval mechanism, a methodology that allows neural networks to refer to other data points while making predictions. Our experiments reveal that retrieval-based training, especially when fine-tuning the pretrained TabPFN model, notably surpasses existing methods. Moreover, the extensive pretraining plays a crucial role to enhance the performance of the model. These insights imply that blending the retrieval mechanism with pretraining and transfer learning schemes offers considerable potential for advancing the field of tabular deep learning.

* Table Representation Learning Workshop at NeurIPS 2023

Via

Access Paper or Ask Questions