Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Masaaki Kotera

Fast Estimation Method for the Stability of Ensemble Feature Selectors

Aug 03, 2021

Rina Onda, Zhengyan Gao, Masaaki Kotera, Kenta Oono

Figure 1 for Fast Estimation Method for the Stability of Ensemble Feature Selectors

Figure 2 for Fast Estimation Method for the Stability of Ensemble Feature Selectors

Figure 3 for Fast Estimation Method for the Stability of Ensemble Feature Selectors

Figure 4 for Fast Estimation Method for the Stability of Ensemble Feature Selectors

Abstract:It is preferred that feature selectors be \textit{stable} for better interpretabity and robust prediction. Ensembling is known to be effective for improving the stability of feature selectors. Since ensembling is time-consuming, it is desirable to reduce the computational cost to estimate the stability of the ensemble feature selectors. We propose a simulator of a feature selector, and apply it to a fast estimation of the stability of ensemble feature selectors. To the best of our knowledge, this is the first study that estimates the stability of ensemble feature selectors and reduces the computation time theoretically and empirically.

* 7 pages. Supplementary material 9 pages. Accepted in ICML2021 Workshop, Subset Selection in Machine Learning: From Theory to Practice (SubSetML) URL: https://sites.google.com/view/icml-2021-subsetml

Via

Access Paper or Ask Questions

Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Oct 02, 2020

Katsuhiko Ishiguro, Kazuya Ujihara, Ryohto Sawada, Hirotaka Akita, Masaaki Kotera

Figure 1 for Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Figure 2 for Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Figure 3 for Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Figure 4 for Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis

Abstract:Retrosynthesis is a problem to infer reactant compounds to synthesize a given product compound through chemical reactions. Recent studies on retrosynthesis focus on proposing more sophisticated prediction models, but the dataset to feed the models also plays an essential role in achieving the best generalizing models. Generally, a dataset that is best suited for a specific task tends to be small. In such a case, it is the standard solution to transfer knowledge from a large or clean dataset in the same domain. In this paper, we conduct a systematic and intensive examination of data transfer approaches on end-to-end generative models, in application to retrosynthesis. Experimental results show that typical data transfer methods can improve test prediction scores of an off-the-shelf Transformer baseline model. Especially, the pre-training plus fine-tuning approach boosts the accuracy scores of the baseline, achieving the new state-of-the-art. In addition, we conduct a manual inspection for the erroneous prediction results. The inspection shows that the pre-training plus fine-tuning models can generate chemically appropriate or sensible proposals in almost all cases.

Via

Access Paper or Ask Questions