Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Revisiting column-generation-based matheuristic for learning classification trees

Aug 22, 2023

Krunal Kishor Patel, Guy Desaulniers, Andrea Lodi

Figure 1 for Revisiting column-generation-based matheuristic for learning classification trees

Figure 2 for Revisiting column-generation-based matheuristic for learning classification trees

Figure 3 for Revisiting column-generation-based matheuristic for learning classification trees

Figure 4 for Revisiting column-generation-based matheuristic for learning classification trees

Share this with someone who'll enjoy it:

Abstract:Decision trees are highly interpretable models for solving classification problems in machine learning (ML). The standard ML algorithms for training decision trees are fast but generate suboptimal trees in terms of accuracy. Other discrete optimization models in the literature address the optimality problem but only work well on relatively small datasets. \cite{firat2020column} proposed a column-generation-based heuristic approach for learning decision trees. This approach improves scalability and can work with large datasets. In this paper, we describe improvements to this column generation approach. First, we modify the subproblem model to significantly reduce the number of subproblems in multiclass classification instances. Next, we show that the data-dependent constraints in the master problem are implied, and use them as cutting planes. Furthermore, we describe a separation model to generate data points for which the linear programming relaxation solution violates their corresponding constraints. We conclude by presenting computational results that show that these modifications result in better scalability.

* Submitted to Computers and Operations Research journal

View paper on

Share this with someone who'll enjoy it:

Title:Revisiting column-generation-based matheuristic for learning classification trees

Paper and Code