Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Julien Horwood

Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation

Feb 13, 2024

Xuexin Chen, Ruichu Cai, Zhengting Huang, Yuxuan Zhu, Julien Horwood, Zhifeng Hao, Zijian Li, Jose Miguel Hernandez-Lobato

Abstract:We investigate the problem of explainability in machine learning.To address this problem, Feature Attribution Methods (FAMs) measure the contribution of each feature through a perturbation test, where the difference in prediction is compared under different perturbations.However, such perturbation tests may not accurately distinguish the contributions of different features, when their change in prediction is the same after perturbation.In order to enhance the ability of FAMs to distinguish different features' contributions in this challenging setting, we propose to utilize the probability (PNS) that perturbing a feature is a necessary and sufficient cause for the prediction to change as a measure of feature importance.Our approach, Feature Attribution with Necessity and Sufficiency (FANS), computes the PNS via a perturbation test involving two stages (factual and interventional).In practice, to generate counterfactual samples, we use a resampling-based approach on the observed samples to approximate the required conditional distribution.Finally, we combine FANS and gradient-based optimization to extract the subset with the largest PNS.We demonstrate that FANS outperforms existing feature attribution methods on six benchmarks.

Via

Access Paper or Ask Questions

Leveraging Task Structures for Improved Identifiability in Neural Network Representations

Jun 26, 2023

Wenlin Chen, Julien Horwood, Juyeon Heo, José Miguel Hernández-Lobato

Abstract:This work extends the theory of identifiability in supervised learning by considering the consequences of having access to a distribution of tasks. In such cases, we show that identifiability is achievable even in the case of regression, extending prior work restricted to the single-task classification case. Furthermore, we show that the existence of a task distribution which defines a conditional prior over latent variables reduces the equivalence class for identifiability to permutations and scaling, a much stronger and more useful result. When we further assume a causal structure over these tasks, our approach enables simple maximum marginal likelihood optimization together with downstream applicability to causal representation learning. Empirically, we validate that our model outperforms more general unsupervised models in recovering canonical representations for synthetic and real-world data.

* 11 pages, 5 figures, 2 tables

Via

Access Paper or Ask Questions

Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

Apr 29, 2020

Julien Horwood, Emmanuel Noutahi

Figure 1 for Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

Figure 2 for Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

Figure 3 for Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

Figure 4 for Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning

Abstract:The fundamental goal of generative drug design is to propose optimized molecules that meet predefined activity, selectivity, and pharmacokinetic criteria. Despite recent progress, we argue that existing generative methods are limited in their ability to favourably shift the distributions of molecular properties during optimization. We instead propose a novel Reinforcement Learning framework for molecular design in which an agent learns to directly optimize through a space of synthetically-accessible drug-like molecules. This becomes possible by defining transitions in our Markov Decision Process as chemical reactions, and allows us to leverage synthetic routes as an inductive bias. We validate our method by demonstrating that it outperforms existing state-of the art approaches in the optimization of pharmacologically-relevant objectives, while results on multi-objective optimization tasks suggest increased scalability to realistic pharmaceutical design problems.

Via

Access Paper or Ask Questions

Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

May 28, 2019

Emmanuel Noutahi, Dominique Beani, Julien Horwood, Prudencio Tossou

Figure 1 for Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

Figure 2 for Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

Figure 3 for Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

Figure 4 for Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

Abstract:Recent work in graph neural networks (GNNs) has lead to improvements in molecular activity and property prediction tasks. However, GNNs lack interpretability as they fail to capture the relative importance of various molecular substructures due to the absence of efficient intermediate pooling steps for sparse graphs. To address this issue, we propose LaPool (Laplacian Pooling), a novel, data-driven, and interpretable graph pooling method that takes into account the node features and graph structure to improve molecular understanding. Inspired by theories in graph signal processing, LaPool performs a feature-driven hierarchical segmentation of molecules by selecting a set of centroid nodes from a graph as cluster representatives. It then learns a sparse assignment of remaining nodes into these clusters using an attention mechanism. We benchmark our model by showing that it outperforms recent graph pooling layers on molecular graph understanding and prediction tasks. We then demonstrate improved interpretability by identifying important molecular substructures and generating novel and valid molecules, with important applications in drug discovery and pharmacology.

* 8 pages, with Appendices

Via

Access Paper or Ask Questions