Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joseph Ramsey

Fast Scalable and Accurate Discovery of DAGs Using the Best Order Score Search and Grow-Shrink Trees

Oct 26, 2023

Bryan Andrews, Joseph Ramsey, Ruben Sanchez-Romero, Jazmin Camchong, Erich Kummerfeld

Abstract:Learning graphical conditional independence structures is an important machine learning problem and a cornerstone of causal discovery. However, the accuracy and execution time of learning algorithms generally struggle to scale to problems with hundreds of highly connected variables -- for instance, recovering brain networks from fMRI data. We introduce the best order score search (BOSS) and grow-shrink trees (GSTs) for learning directed acyclic graphs (DAGs) in this paradigm. BOSS greedily searches over permutations of variables, using GSTs to construct and score DAGs from permutations. GSTs efficiently cache scores to eliminate redundant calculations. BOSS achieves state-of-the-art performance in accuracy and execution time, comparing favorably to a variety of combinatorial and gradient-based learning algorithms under a broad range of conditions. To demonstrate its practicality, we apply BOSS to two sets of resting-state fMRI data: simulated data with pseudo-empirical noise distributions derived from randomized empirical fMRI cortical signals and clinical data from 3T fMRI scans processed into cortical parcels. BOSS is available for use within the TETRAD project which includes Python and R wrappers.

Via

Access Paper or Ask Questions

Causal-learn: Causal Discovery in Python

Jul 31, 2023

Yujia Zheng, Biwei Huang, Wei Chen, Joseph Ramsey, Mingming Gong, Ruichu Cai, Shohei Shimizu, Peter Spirtes, Kun Zhang

Abstract:Causal discovery aims at revealing causal relations from observational data, which is a fundamental task in science and engineering. We describe $\textit{causal-learn}$, an open-source Python library for causal discovery. This library focuses on bringing a comprehensive collection of causal discovery methods to both practitioners and researchers. It provides easy-to-use APIs for non-specialists, modular building blocks for developers, detailed documentation for learners, and comprehensive methods for all. Different from previous packages in R or Java, $\textit{causal-learn}$ is fully developed in Python, which could be more in tune with the recent preference shift in programming languages within related communities. The library is available at https://github.com/py-why/causal-learn.

Via

Access Paper or Ask Questions

Greedy Relaxations of the Sparsest Permutation Algorithm

Jun 11, 2022

Wai-Yin Lam, Bryan Andrews, Joseph Ramsey

Figure 1 for Greedy Relaxations of the Sparsest Permutation Algorithm

Figure 2 for Greedy Relaxations of the Sparsest Permutation Algorithm

Figure 3 for Greedy Relaxations of the Sparsest Permutation Algorithm

Figure 4 for Greedy Relaxations of the Sparsest Permutation Algorithm

Abstract:There has been an increasing interest in methods that exploit permutation reasoning to search for directed acyclic causal models, including the "Ordering Search" of Teyssier and Kohler and GSP of Solus, Wang and Uhler. We extend the methods of the latter by a permutation-based operation, tuck, and develop a class of algorithms, namely GRaSP, that are efficient and pointwise consistent under increasingly weaker assumptions than faithfulness. The most relaxed form of GRaSP outperforms many state-of-the-art causal search algorithms in simulation, allowing efficient and accurate search even for dense graphs and graphs with more than 100 variables.

* 36 pages, 16 figures, 4 tables, 2 algorithms, accepted, UAI (Uncertainty in Artificial Intelligence) 2022

Via

Access Paper or Ask Questions

Causal discovery for observational sciences using supervised machine learning

Feb 25, 2022

Anne Helby Petersen, Joseph Ramsey, Claus Thorn Ekstrøm, Peter Spirtes

Figure 1 for Causal discovery for observational sciences using supervised machine learning

Figure 2 for Causal discovery for observational sciences using supervised machine learning

Figure 3 for Causal discovery for observational sciences using supervised machine learning

Figure 4 for Causal discovery for observational sciences using supervised machine learning

Abstract:Causal inference can estimate causal effects, but unless data are collected experimentally, statistical analyses must rely on pre-specified causal models. Causal discovery algorithms are empirical methods for constructing such causal models from data. Several asymptotically correct methods already exist, but they generally struggle on smaller samples. Moreover, most methods focus on very sparse causal models, which may not always be a realistic representation of real-life data generating mechanisms. Finally, while causal relationships suggested by the methods often hold true, their claims about causal non-relatedness have high error rates. This non-conservative error tradeoff is not ideal for observational sciences, where the resulting model is directly used to inform causal inference: A causal model with many missing causal relations entails too strong assumptions and may lead to biased effect estimates. We propose a new causal discovery method that addresses these three shortcomings: Supervised learning discovery (SLdisco). SLdisco uses supervised machine learning to obtain a mapping from observational data to equivalence classes of causal models. We evaluate SLdisco in a large simulation study based on Gaussian data and we consider several choices of model size and sample size. We find that SLdisco is more conservative, only moderately less informative and less sensitive towards sample size than existing procedures. We furthermore provide a real epidemiological data application. We use random subsampling to investigate real data performance on small samples and again find that SLdisco is less sensitive towards sample size and hence seems to better utilize the information available in small datasets.

Via

Access Paper or Ask Questions

FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent Confounders

Mar 26, 2021

Wei Chen, Kun Zhang, Ruichu Cai, Biwei Huang, Joseph Ramsey, Zhifeng Hao, Clark Glymour

Figure 1 for FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent Confounders

Figure 2 for FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent Confounders

Figure 3 for FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent Confounders

Figure 4 for FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent Confounders

Abstract:We consider the problem of estimating a particular type of linear non-Gaussian model. Without resorting to the overcomplete Independent Component Analysis (ICA), we show that under some mild assumptions, the model is uniquely identified by a hybrid method. Our method leverages the advantages of constraint-based methods and independent noise-based methods to handle both confounded and unconfounded situations. The first step of our method uses the FCI procedure, which allows confounders and is able to produce asymptotically correct results. The results, unfortunately, usually determine very few unconfounded direct causal relations, because whenever it is possible to have a confounder, it will indicate it. The second step of our procedure finds the unconfounded causal edges between observed variables among only those adjacent pairs informed by the FCI results. By making use of the so-called Triad condition, the third step is able to find confounders and their causal relations with other variables. Afterward, we apply ICA on a notably smaller set of graphs to identify remaining causal relationships if needed. Extensive experiments on simulated data and real-world data validate the correctness and effectiveness of the proposed method.

Via

Access Paper or Ask Questions

Causal Discovery from Heterogeneous/Nonstationary Data

Mar 19, 2019

Biwei Huang, Kun Zhang, Jiji Zhang, Joseph Ramsey, Ruben Sanchez-Romero, Clark Glymour, Bernhard Schölkopf

Figure 1 for Causal Discovery from Heterogeneous/Nonstationary Data

Figure 2 for Causal Discovery from Heterogeneous/Nonstationary Data

Figure 3 for Causal Discovery from Heterogeneous/Nonstationary Data

Figure 4 for Causal Discovery from Heterogeneous/Nonstationary Data

Abstract:It is commonplace to encounter heterogeneous or nonstationary data, of which the underlying generating process changes across domains or over time. Such a distribution shift feature presents both challenges and opportunities for causal discovery. In this paper, we develop a framework for causal discovery from such data, called Constraint-based causal Discovery from heterogeneous/NOnstationary Data (CD-NOD), to find causal skeleton and directions and estimate the properties of mechanism changes. First, we propose an enhanced constraint-based procedure to detect variables whose local mechanisms change and recover the skeleton of the causal structure over observed variables. Second, we present a method to determine causal orientations by making use of independent changes in the data distribution implied by the underlying causal model, benefiting from information carried by changing distributions. After learning the causal structure, next, we investigate how to efficiently estimate the `driving force' of the nonstationarity of a causal mechanism. That is, we aim to extract from data a low-dimensional representation of changes. The proposed methods are nonparametric, with no hard restrictions on data distributions and causal mechanisms, and do not rely on window segmentation. Furthermore, we find that data heterogeneity benefits causal structure identification even with particular types of confounders. Finally, we show the connection between heterogeneity/nonstationarity and soft intervention in causal discovery. Experimental results on various synthetic and real-world data sets (task-fMRI and stock market data) are presented to demonstrate the efficacy of the proposed methods.

Via

Access Paper or Ask Questions

FASK with Interventional Knowledge Recovers Edges from the Sachs Model

May 06, 2018

Joseph Ramsey, Bryan Andrews

Figure 1 for FASK with Interventional Knowledge Recovers Edges from the Sachs Model

Figure 2 for FASK with Interventional Knowledge Recovers Edges from the Sachs Model

Figure 3 for FASK with Interventional Knowledge Recovers Edges from the Sachs Model

Figure 4 for FASK with Interventional Knowledge Recovers Edges from the Sachs Model

Abstract:We report a procedure that, in one step from continuous data with minimal preparation, recovers the graph found by Sachs et al. \cite{sachs2005causal}, with only a few edges different. The algorithm, Fast Adjacency Skewness (FASK), relies on a mixture of linear reasoning and reasoning from the skewness of variables; the Sachs data is a good candidate for this procedure since the skewness of the variables is quite pronounced. We review the ground truth model from Sachs et al. as well as some of the fluctuations seen in the protein abundances in the system, give the Sachs model and the FASK model, and perform a detailed comparison. Some variation in hyper-parameters is explored, though the main result uses values at or near the defaults learned from work modeling fMRI data.

* 13 pages, 21 figures, 2 tables, Technical Report

Via

Access Paper or Ask Questions

Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Jun 10, 2017

Kun Zhang, Mingming Gong, Joseph Ramsey, Kayhan Batmanghelich, Peter Spirtes, Clark Glymour

Figure 1 for Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Figure 2 for Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Figure 3 for Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Figure 4 for Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Abstract:Measurement error in the observed values of the variables can greatly change the output of various causal discovery methods. This problem has received much attention in multiple fields, but it is not clear to what extent the causal model for the measurement-error-free variables can be identified in the presence of measurement error with unknown variance. In this paper, we study precise sufficient identifiability conditions for the measurement-error-free causal model and show what information of the causal model can be recovered from observed data. In particular, we present two different sets of identifiability conditions, based on the second-order statistics and higher-order statistics of the data, respectively. The former was inspired by the relationship between the generating model of the measurement-error-contaminated data and the factor analysis model, and the latter makes use of the identifiability result of the over-complete independent component analysis problem.

* 15 pages, 5 figures, 1 table

Via

Access Paper or Ask Questions

Improving Accuracy and Scalability of the PC Algorithm by Maximizing P-value

Oct 05, 2016

Joseph Ramsey

Figure 1 for Improving Accuracy and Scalability of the PC Algorithm by Maximizing P-value

Figure 2 for Improving Accuracy and Scalability of the PC Algorithm by Maximizing P-value

Figure 3 for Improving Accuracy and Scalability of the PC Algorithm by Maximizing P-value

Abstract:A number of attempts have been made to improve accuracy and/or scalability of the PC (Peter and Clark) algorithm, some well known (Buhlmann, et al., 2010; Kalisch and Buhlmann, 2007; 2008; Zhang, 2012, to give some examples). We add here one more tool to the toolbox: the simple observation that if one is forced to choose between a variety of possible conditioning sets for a pair of variables, one should choose the one with the highest p-value. One can use the CPC (Conservative PC, Ramsey et al., 2012) algorithm as a guide to possible sepsets for a pair of variables. However, whereas CPC uses a voting rule to classify colliders versus noncolliders, our proposed algorithm, PC-Max, picks the conditioning set with the highest p-value, so that there are no ambiguities. We combine this with two other optimizations: (a) avoiding bidirected edges in the orientation of colliders, and (b) parallelization. For (b) we borrow ideas from the PC-Stable algorithm (Colombo and Maathuis, 2014). The result is an algorithm that scales quite well both in terms of accuracy and time, with no risk of bidirected edges.

* 11 pages, 4 figures, 2 tables, technical report

Via

Access Paper or Ask Questions

Adjacency-Faithfulness and Conservative Causal Inference

Jun 27, 2012

Joseph Ramsey, Jiji Zhang, Peter L. Spirtes

Figure 1 for Adjacency-Faithfulness and Conservative Causal Inference

Figure 2 for Adjacency-Faithfulness and Conservative Causal Inference

Figure 3 for Adjacency-Faithfulness and Conservative Causal Inference

Figure 4 for Adjacency-Faithfulness and Conservative Causal Inference

Abstract:Most causal inference algorithms in the literature (e.g., Pearl (2000), Spirtes et al. (2000), Heckerman et al. (1999)) exploit an assumption usually referred to as the causal Faithfulness or Stability condition. In this paper, we highlight two components of the condition used in constraint-based algorithms, which we call "Adjacency-Faithfulness" and "Orientation-Faithfulness". We point out that assuming Adjacency-Faithfulness is true, it is in principle possible to test the validity of Orientation-Faithfulness. Based on this observation, we explore the consequence of making only the Adjacency-Faithfulness assumption. We show that the familiar PC algorithm has to be modified to be (asymptotically) correct under the weaker, Adjacency-Faithfulness assumption. Roughly the modified algorithm, called Conservative PC (CPC), checks whether Orientation-Faithfulness holds in the orientation phase, and if not, avoids drawing certain causal conclusions the PC algorithm would draw. However, if the stronger, standard causal Faithfulness condition actually obtains, the CPC algorithm is shown to output the same pattern as the PC algorithm does in the large sample limit. We also present a simulation study showing that the CPC algorithm runs almost as fast as the PC algorithm, and outputs significantly fewer false causal arrowheads than the PC algorithm does on realistic sample sizes. We end our paper by discussing how score-based algorithms such as GES perform when the Adjacency-Faithfulness but not the standard causal Faithfulness condition holds, and how to extend our work to the FCI algorithm, which allows for the possibility of latent variables.

* Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

Via

Access Paper or Ask Questions