Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hans Julius Skaug

An information criterion for automatic gradient tree boosting

Aug 13, 2020

Berent Ånund Strømnes Lunde, Tore Selland Kleppe, Hans Julius Skaug

Figure 1 for An information criterion for automatic gradient tree boosting

Figure 2 for An information criterion for automatic gradient tree boosting

Figure 3 for An information criterion for automatic gradient tree boosting

Figure 4 for An information criterion for automatic gradient tree boosting

Abstract:An information theoretic approach to learning the complexity of classification and regression trees and the number of trees in gradient tree boosting is proposed. The optimism (test loss minus training loss) of the greedy leaf splitting procedure is shown to be the maximum of a Cox-Ingersoll-Ross process, from which a generalization-error based information criterion is formed. The proposed procedure allows fast local model selection without cross validation based hyper parameter tuning, and hence efficient and automatic comparison among the large number of models performed during each boosting iteration. Relative to xgboost, speedups on numerical experiments ranges from around 10 to about 1400, at similar predictive-power measured in terms of test-loss.

Via

Access Paper or Ask Questions