Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Le-Ye Wang

Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Oct 23, 2023

Qi-Le Zhou, Han-Jia Ye, Le-Ye Wang, De-Chuan Zhan

Figure 1 for Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Figure 2 for Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Figure 3 for Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Figure 4 for Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Abstract:Fine-tuning a pre-trained deep neural network has become a successful paradigm in various machine learning tasks. However, such a paradigm becomes particularly challenging with tabular data when there are discrepancies between the feature sets of pre-trained models and the target tasks. In this paper, we propose TabToken, a method aims at enhancing the quality of feature tokens (i.e., embeddings of tabular features). TabToken allows for the utilization of pre-trained models when the upstream and downstream tasks share overlapping features, facilitating model fine-tuning even with limited training examples. Specifically, we introduce a contrastive objective that regularizes the tokens, capturing the semantics within and across features. During the pre-training stage, the tokens are learned jointly with top-layer deep models such as transformer. In the downstream task, tokens of the shared features are kept fixed while TabToken efficiently fine-tunes the remaining parts of the model. TabToken not only enables knowledge transfer from a pre-trained model to tasks with heterogeneous features, but also enhances the discriminative ability of deep tabular models in standard classification and regression tasks.

Via

Access Paper or Ask Questions