Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan Henderson

Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity

May 31, 2021

Ryan Henderson, Djork-Arné Clevert, Floriane Montanari

Figure 1 for Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity

Figure 2 for Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity

Figure 3 for Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity

Figure 4 for Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity

Abstract:Rationalizing which parts of a molecule drive the predictions of a molecular graph convolutional neural network (GCNN) can be difficult. To help, we propose two simple regularization techniques to apply during the training of GCNNs: Batch Representation Orthonormalization (BRO) and Gini regularization. BRO, inspired by molecular orbital theory, encourages graph convolution operations to generate orthonormal node embeddings. Gini regularization is applied to the weights of the output layer and constrains the number of dimensions the model can use to make predictions. We show that Gini and BRO regularization can improve the accuracy of state-of-the-art GCNN attribution methods on artificial benchmark datasets. In a real-world setting, we demonstrate that medicinal chemists significantly prefer explanations extracted from regularized models. While we only study these regularizers in the context of GCNNs, both can be applied to other types of neural networks

* Accepted to ICML 2021

Via

Access Paper or Ask Questions

Gini in a Bottleneck: Gotta Train Me the Right Way

Oct 09, 2020

Ryan Henderson, Djork-Arné Clevert, Floriane Montanari

Figure 1 for Gini in a Bottleneck: Gotta Train Me the Right Way

Figure 2 for Gini in a Bottleneck: Gotta Train Me the Right Way

Figure 3 for Gini in a Bottleneck: Gotta Train Me the Right Way

Figure 4 for Gini in a Bottleneck: Gotta Train Me the Right Way

Abstract:Due to the nature of deep learning approaches, it is inherently difficult to understand which aspects of a molecular graph drive the predictions of the network. As a mitigation strategy, we constrain certain weights in a multi-task graph convolutional neural network according to the Gini index to maximize the "inequality" of the learned representations. We show that this constraint does not degrade evaluation metrics for some targets, and allows us to combine the outputs of the graph convolutional operation in a visually interpretable way. We then perform a proof-of-concept experiment on quantum chemistry targets on the public QM9 dataset, and a larger experiment on ADMET targets on proprietary drug-like molecules. Since a benchmark of explainability in the latter case is difficult, we informally surveyed medicinal chemists within our organization to check for agreement between regions of the molecule they and the model identified as relevant to the properties in question.

* submitted to Machine Learning for Molecules Workshop @ NeurIPS 2020

Via

Access Paper or Ask Questions

Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers

Sep 11, 2017

Ryan Henderson, Rasmus Rothe

Figure 1 for Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers

Figure 2 for Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers

Figure 3 for Picasso: A Modular Framework for Visualizing the Learning Process of Neural Network Image Classifiers

Abstract:Picasso is a free open-source (Eclipse Public License) web application written in Python for rendering standard visualizations useful for analyzing convolutional neural networks. Picasso ships with occlusion maps and saliency maps, two visualizations which help reveal issues that evaluation metrics like loss and accuracy might hide: for example, learning a proxy classification task. Picasso works with the Tensorflow deep learning framework, and Keras (when the model can be loaded into the Tensorflow backend). Picasso can be used with minimal configuration by deep learning researchers and engineers alike across various neural network architectures. Adding new visualizations is simple: the user can specify their visualization code and HTML template separately from the application code.

* Journal of Open Research Software. 5(1), p.22 (2017)
* 9 pages, submission to the Journal of Open Research Software, github.com/merantix/picasso

Via

Access Paper or Ask Questions