Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tibor Schuster

Causal Discovery with Language Models as Imperfect Experts

Jul 05, 2023

Stephanie Long, Alexandre Piché, Valentina Zantedeschi, Tibor Schuster, Alexandre Drouin

Abstract:Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, we explore how expert knowledge can be used to improve the data-driven identification of causal graphs, beyond Markov equivalence classes. In doing so, we consider a setting where we can query an expert about the orientation of causal relationships between variables, but where the expert may provide erroneous information. We propose strategies for amending such expert knowledge based on consistency properties, e.g., acyclicity and conditional independencies in the equivalence class. We then report a case study, on real data, where a large language model is used as an imperfect expert.

* Peer reviewed and accepted for presentation at the Structured Probabilistic Inference & Generative Modeling (SPIGM) workshop at ICML 2023, Hawaii, USA

Via

Access Paper or Ask Questions

Can large language models build causal graphs?

Mar 07, 2023

Stephanie Long, Tibor Schuster, Alexandre Piché, Department of Family Medicine, McGill University, Mila, Université de Montreal, ServiceNow Research

Figure 1 for Can large language models build causal graphs?

Figure 2 for Can large language models build causal graphs?

Figure 3 for Can large language models build causal graphs?

Figure 4 for Can large language models build causal graphs?

Abstract:Building causal graphs can be a laborious process. To ensure all relevant causal pathways have been captured, researchers often have to discuss with clinicians and experts while also reviewing extensive relevant medical literature. By encoding common and medical knowledge, large language models (LLMs) represent an opportunity to ease this process by automatically scoring edges (i.e., connections between two variables) in potential graphs. LLMs however have been shown to be brittle to the choice of probing words, context, and prompts that the user employs. In this work, we evaluate if LLMs can be a useful tool in complementing causal graph development.

Via

Access Paper or Ask Questions

Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials

Mar 24, 2019

Hossein Aboutalebi, Doina Precup, Tibor Schuster

Figure 1 for Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials

Figure 2 for Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials

Figure 3 for Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials

Figure 4 for Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials

Abstract:The stochastic multi-armed bandit problem is a well-known model for studying the exploration-exploitation trade-off. It has significant possible applications in adaptive clinical trials, which allow for dynamic changes in the treatment allocation probabilities of patients. However, most bandit learning algorithms are designed with the goal of minimizing the expected regret. While this approach is useful in many areas, in clinical trials, it can be sensitive to outlier data, especially when the sample size is small. In this paper, we define and study a new robustness criterion for bandit problems. Specifically, we consider optimizing a function of the distribution of returns as a regret measure. This provides practitioners more flexibility to define an appropriate regret measure. The learning algorithm we propose to solve this type of problem is a modification of the BESA algorithm [Baransi et al., 2014], which considers a more general version of regret. We present a regret bound for our approach and evaluate it empirically both on synthetic problems as well as on a dataset from the clinical trial literature. Our approach compares favorably to a suite of standard bandit algorithms.

Via

Access Paper or Ask Questions