Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

J. P. Crutchfield

Thermodynamic Machine Learning through Maximum Work Production

Jun 27, 2020

A. B. Boyd, J. P. Crutchfield, M. Gu

Figure 1 for Thermodynamic Machine Learning through Maximum Work Production

Figure 2 for Thermodynamic Machine Learning through Maximum Work Production

Figure 3 for Thermodynamic Machine Learning through Maximum Work Production

Figure 4 for Thermodynamic Machine Learning through Maximum Work Production

Abstract:Adaptive thermodynamic systems -- such as a biological organism attempting to gain survival advantage, an autonomous robot performing a functional task, or a motor protein transporting intracellular nutrients -- can improve their performance by effectively modeling the regularities and stochasticity in their environments. Analogously, but in a purely computational realm, machine learning algorithms seek to estimate models that capture predictable structure and identify irrelevant noise in training data by optimizing performance measures, such as a model's log-likelihood of having generated the data. Is there a sense in which these computational models are physically preferred? For adaptive physical systems we introduce the organizing principle that thermodynamic work is the most relevant performance measure of advantageously modeling an environment. Specifically, a physical agent's model determines how much useful work it can harvest from an environment. We show that when such agents maximize work production they also maximize their environmental model's log-likelihood, establishing an equivalence between thermodynamics and learning. In this way, work maximization appears as an organizing principle that underlies learning in adaptive thermodynamic systems.

* 27 pages, 10 figures; http://csc.ucdavis.edu/~cmg/compmech/pubs/tml.htm

Via

Access Paper or Ask Questions

Inference, Prediction, and Entropy-Rate Estimation of Continuous-time, Discrete-event Processes

May 07, 2020

S. E. Marzen, J. P. Crutchfield

Figure 1 for Inference, Prediction, and Entropy-Rate Estimation of Continuous-time, Discrete-event Processes

Figure 2 for Inference, Prediction, and Entropy-Rate Estimation of Continuous-time, Discrete-event Processes

Figure 3 for Inference, Prediction, and Entropy-Rate Estimation of Continuous-time, Discrete-event Processes

Figure 4 for Inference, Prediction, and Entropy-Rate Estimation of Continuous-time, Discrete-event Processes

Abstract:Inferring models, predicting the future, and estimating the entropy rate of discrete-time, discrete-event processes is well-worn ground. However, a much broader class of discrete-event processes operates in continuous-time. Here, we provide new methods for inferring, predicting, and estimating them. The methods rely on an extension of Bayesian structural inference that takes advantage of neural network's universal approximation power. Based on experiments with complex synthetic data, the methods are competitive with the state-of-the-art for prediction and entropy-rate estimation.

* 11 pages, 5 figures; http://csc.ucdavis.edu/~cmg/compmech/pubs/ctbsi.htm

Via

Access Paper or Ask Questions

Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Oct 17, 2019

S. E. Marzen, J. P. Crutchfield

Figure 1 for Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Figure 2 for Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Figure 3 for Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Figure 4 for Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Abstract:Reservoir computers (RCs) and recurrent neural networks (RNNs) can mimic any finite-state automaton in theory, and some workers demonstrated that this can hold in practice. We test the capability of generalized linear models, RCs, and Long Short-Term Memory (LSTM) RNN architectures to predict the stochastic processes generated by a large suite of probabilistic deterministic finite-state automata (PDFA). PDFAs provide an excellent performance benchmark in that they can be systematically enumerated, the randomness and correlation structure of their generated processes are exactly known, and their optimal memory-limited predictors are easily computed. Unsurprisingly, LSTMs outperform RCs, which outperform generalized linear models. Surprisingly, each of these methods can fall short of the maximal predictive accuracy by as much as 50% after training and, when optimized, tend to fall short of the maximal predictive accuracy by ~5%, even though previously available methods achieve maximal predictive accuracy with orders-of-magnitude less data. Thus, despite the representational universality of RCs and RNNs, using them can engender a surprising predictive gap for simple stimuli. One concludes that there is an important and underappreciated role for methods that infer "causal states" or "predictive state representations".

* 15 pages, 4 figures; http://csc.ucdavis.edu/~cmg/compmech/pubs/pdfarnr.htm

Via

Access Paper or Ask Questions

Classical and Quantum Factors of Channels

Sep 23, 2017

J. R. Mahoney, C. Aghamohammadi, J. P. Crutchfield

Figure 1 for Classical and Quantum Factors of Channels

Figure 2 for Classical and Quantum Factors of Channels

Figure 3 for Classical and Quantum Factors of Channels

Abstract:Given a classical channel, a stochastic map from inputs to outputs, can we replace the input with a simple intermediate variable that still yields the correct conditional output distribution? We examine two cases: first, when the intermediate variable is classical; second, when the intermediate variable is quantum. We show that the quantum variable's size is generically smaller than the classical, according to two different measures---cardinality and entropy. We demonstrate optimality conditions for a special case. We end with several related results: a proposal for extending the special case, a demonstration of the impact of quantum phases, and a case study concerning pure versus mixed states.

* 11 pages, 3 figures; http://csc.ucdavis.edu/~cmg/compmech/pubs/qfact.htm

Via

Access Paper or Ask Questions

Pairwise Correlations in Layered Close-Packed Structures

Jul 26, 2014

P. M. Riechers, D. P. Varn, J. P. Crutchfield

Figure 1 for Pairwise Correlations in Layered Close-Packed Structures

Figure 2 for Pairwise Correlations in Layered Close-Packed Structures

Figure 3 for Pairwise Correlations in Layered Close-Packed Structures

Figure 4 for Pairwise Correlations in Layered Close-Packed Structures

Abstract:Given a description of the stacking statistics of layered close-packed structures in the form of a hidden Markov model, we develop analytical expressions for the pairwise correlation functions between the layers. These may be calculated analytically as explicit functions of model parameters or the expressions may be used as a fast, accurate, and efficient way to obtain numerical values. We present several examples, finding agreement with previous work as well as deriving new relations.

* 21 pages, 21 figures; http://csc.ucdavis.edu/~cmg/compmech/pubs/cfem_prb.htm

Via

Access Paper or Ask Questions