Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Franck Sabatié

Embedded Constrained Feature Construction for High-Energy Physics Data Classification

Dec 17, 2019

Noëlie Cherrier, Maxime Defurne, Jean-Philippe Poli, Franck Sabatié

Figure 1 for Embedded Constrained Feature Construction for High-Energy Physics Data Classification

Figure 2 for Embedded Constrained Feature Construction for High-Energy Physics Data Classification

Abstract:Before any publication, data analysis of high-energy physics experiments must be validated. This validation is granted only if a perfect understanding of the data and the analysis process is demonstrated. Therefore, physicists prefer using transparent machine learning algorithms whose performances highly rely on the suitability of the provided input features. To transform the feature space, feature construction aims at automatically generating new relevant features. Whereas most of previous works in this area perform the feature construction prior to the model training, we propose here a general framework to embed a feature construction technique adapted to the constraints of high-energy physics in the induction of tree-based models. Experiments on two high-energy physics datasets confirm that a significant gain is obtained on the classification scores, while limiting the number of built features. Since the features are built to be interpretable, the whole model is transparent and readable.

* Accepted at the NeurIPS 2019 workshop on Machine Learning for the Physical Sciences (https://ml4physicalsciences.github.io)

Via

Access Paper or Ask Questions

Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

Aug 17, 2019

Noëlie Cherrier, Jean-Philippe Poli, Maxime Defurne, Franck Sabatié

Figure 1 for Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

Figure 2 for Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

Figure 3 for Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

Figure 4 for Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

Abstract:A good feature representation is a determinant factor to achieve high performance for many machine learning algorithms in terms of classification. This is especially true for techniques that do not build complex internal representations of data (e.g. decision trees, in contrast to deep neural networks). To transform the feature space, feature construction techniques build new high-level features from the original ones. Among these techniques, Genetic Programming is a good candidate to provide interpretable features required for data analysis in high energy physics. Classically, original features or higher-level features based on physics first principles are used as inputs for training. However, physicists would benefit from an automatic and interpretable feature construction for the classification of particle collision events. Our main contribution consists in combining different aspects of Genetic Programming and applying them to feature construction for experimental physics. In particular, to be applicable to physics, dimensional consistency is enforced using grammars. Results of experiments on three physics datasets show that the constructed features can bring a significant gain to the classification accuracy. To the best of our knowledge, it is the first time a method is proposed for interpretable feature construction with units of measurement, and that experts in high-energy physics validate the overall approach as well as the interpretability of the built features.

* Proceedings of 2019 IEEE Congress on Evolutionary Computation (CEC), Wellington, New Zealand, 2019, pp. 1650-1658
* Accepted in this version to CEC 2019

Via

Access Paper or Ask Questions