Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rob Davis

LaTable: Towards Large Tabular Models

Jun 25, 2024

Boris van Breugel, Jonathan Crabbé, Rob Davis, Mihaela van der Schaar

Figure 1 for LaTable: Towards Large Tabular Models

Figure 2 for LaTable: Towards Large Tabular Models

Figure 3 for LaTable: Towards Large Tabular Models

Figure 4 for LaTable: Towards Large Tabular Models

Abstract:Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of different tabular datasets, tabular metadata (e.g. dataset description and feature headers), and tables lacking prior knowledge (e.g. feature order). In this work we propose LaTable: a novel tabular diffusion model that addresses these challenges and can be trained across different datasets. Through extensive experiments we find that LaTable outperforms baselines on in-distribution generation, and that finetuning LaTable can generate out-of-distribution datasets better with fewer samples. On the other hand, we explore the poor zero-shot performance of LaTable, and what it may teach us about building generative tabular foundation models with better zero- and few-shot generation capabilities.

Via

Access Paper or Ask Questions