Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ping-Han Chiang

DOFEN: Deep Oblivious Forest ENsemble

Dec 24, 2024

Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Chih-Sheng Chen, Tien-Hao Chang

Figure 1 for DOFEN: Deep Oblivious Forest ENsemble

Figure 2 for DOFEN: Deep Oblivious Forest ENsemble

Figure 3 for DOFEN: Deep Oblivious Forest ENsemble

Figure 4 for DOFEN: Deep Oblivious Forest ENsemble

Abstract:Deep Neural Networks (DNNs) have revolutionized artificial intelligence, achieving impressive results on diverse data types, including images, videos, and texts. However, DNNs still lag behind Gradient Boosting Decision Trees (GBDT) on tabular data, a format extensively utilized across various domains. In this paper, we propose DOFEN, short for \textbf{D}eep \textbf{O}blivious \textbf{F}orest \textbf{EN}semble, a novel DNN architecture inspired by oblivious decision trees. DOFEN constructs relaxed oblivious decision trees (rODTs) by randomly combining conditions for each column and further enhances performance with a two-level rODT forest ensembling process. By employing this approach, DOFEN achieves state-of-the-art results among DNNs and further narrows the gap between DNNs and tree-based models on the well-recognized benchmark: Tabular Benchmark \citep{grinsztajn2022tree}, which includes 73 total datasets spanning a wide array of domains. The code of DOFEN is available at: \url{https://github.com/Sinopac-Digital-Technology-Division/DOFEN}.

* NeurIPS 2024 (poster); (v2: modify and rearrange sections, propose multihead extension of DOFEN, include new results on tabular benchmark and other benchmarks)

Via

Access Paper or Ask Questions

Trompt: Towards a Better Deep Neural Network for Tabular Data

May 31, 2023

Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang

Figure 1 for Trompt: Towards a Better Deep Neural Network for Tabular Data

Figure 2 for Trompt: Towards a Better Deep Neural Network for Tabular Data

Figure 3 for Trompt: Towards a Better Deep Neural Network for Tabular Data

Figure 4 for Trompt: Towards a Better Deep Neural Network for Tabular Data

Abstract:Tabular data is arguably one of the most commonly used data structures in various practical domains, including finance, healthcare and e-commerce. The inherent heterogeneity allows tabular data to store rich information. However, based on a recently published tabular benchmark, we can see deep neural networks still fall behind tree-based models on tabular datasets. In this paper, we propose Trompt--which stands for Tabular Prompt--a novel architecture inspired by prompt learning of language models. The essence of prompt learning is to adjust a large pre-trained model through a set of prompts outside the model without directly modifying the model. Based on this idea, Trompt separates the learning strategy of tabular data into two parts. The first part, analogous to pre-trained models, focus on learning the intrinsic information of a table. The second part, analogous to prompts, focus on learning the variations among samples. Trompt is evaluated with the benchmark mentioned above. The experimental results demonstrate that Trompt outperforms state-of-the-art deep neural networks and is comparable to tree-based models.

* ICML'23 (poster)

Via

Access Paper or Ask Questions