Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Susanne Still

Optimal Causal Inference: Estimating Stored Information and Approximating Causal Architecture

Aug 19, 2010

Susanne Still, James P. Crutchfield, Christopher J. Ellison

Figure 1 for Optimal Causal Inference: Estimating Stored Information and Approximating Causal Architecture

Figure 2 for Optimal Causal Inference: Estimating Stored Information and Approximating Causal Architecture

Figure 3 for Optimal Causal Inference: Estimating Stored Information and Approximating Causal Architecture

Figure 4 for Optimal Causal Inference: Estimating Stored Information and Approximating Causal Architecture

Abstract:We introduce an approach to inferring the causal architecture of stochastic dynamical systems that extends rate distortion theory to use causal shielding---a natural principle of learning. We study two distinct cases of causal inference: optimal causal filtering and optimal causal estimation. Filtering corresponds to the ideal case in which the probability distribution of measurement sequences is known, giving a principled method to approximate a system's causal structure at a desired level of representation. We show that, in the limit in which a model complexity constraint is relaxed, filtering finds the exact causal architecture of a stochastic dynamical system, known as the causal-state partition. From this, one can estimate the amount of historical information the process stores. More generally, causal filtering finds a graded model-complexity hierarchy of approximations to the causal architecture. Abrupt changes in the hierarchy, as a function of approximation, capture distinct scales of structural organization. For nonideal cases with finite data, we show how the correct number of underlying causal states can be found by optimal causal estimation. A previously derived model complexity control term allows us to correct for the effect of statistical fluctuations in probability estimates and thereby avoid over-fitting.

* 14 pages, 13 figures; http://cse.ucdavis.edu/~cmg/compmech/pubs/oci.htm; Updated figures and citations; added corrections and clarifications

Via

Access Paper or Ask Questions

Structure or Noise?

Jun 29, 2008

Susanne Still, James P. Crutchfield

Abstract:We show how rate-distortion theory provides a mechanism for automated theory building by naturally distinguishing between regularity and randomness. We start from the simple principle that model variables should, as much as possible, render the future and past conditionally independent. From this, we construct an objective function for model making whose extrema embody the trade-off between a model's structural complexity and its predictive power. The solutions correspond to a hierarchy of models that, at each level of complexity, achieve optimal predictive power at minimal cost. In the limit of maximal prediction the resulting optimal model identifies a process's intrinsic organization by extracting the underlying causal states. In this limit, the model's complexity is given by the statistical complexity, which is known to be minimal for achieving maximum prediction. Examples show how theory building can profit from analyzing a process's causal compressibility, which is reflected in the optimal models' rate-distortion curve--the process's characteristic for optimally balancing structure and noise at different levels of representation.

* 6 pages, 2 figures; http://cse.ucdavis.edu/~cmg/compmech/pubs/son.html

Via

Access Paper or Ask Questions