-
Generalised Synchronisations, Embeddings, and Approximations for Continuous Time Reservoir Computers
Authors:
Allen G Hart
Abstract:
We establish conditions under which a continuous time reservoir computer, such as a leaky integrator echo state network, admits a generalised synchronisation $f$ between between the source dynamics and reservoir dynamics. We show that multiple generalised synchronisations can exist simultaneously, and connect this to the multi-Echo-State-Property (multi-ESP). In the special case of a linear reserv…
▽ More
We establish conditions under which a continuous time reservoir computer, such as a leaky integrator echo state network, admits a generalised synchronisation $f$ between between the source dynamics and reservoir dynamics. We show that multiple generalised synchronisations can exist simultaneously, and connect this to the multi-Echo-State-Property (multi-ESP). In the special case of a linear reservoir computer, we derive a closed form expression for the generalised synchronisation $f$. Furthermore, we establish conditions under which $f$ is of class $C^1$, and conditions under which $f$ is a topological embedding on the fixed points of the source system. This embedding result is closely related to Takens' embedding Theorem.
We also prove that the embedding of fixed points occurs almost surely for randomly generated linear reservoir systems. With an embedding achieved, we discuss how the universal approximation theorem makes it possible to forecast the future dynamics of the source system and replicate its topological properties. We illustrate the theory by embedding a fixed point of the Lorenz-63 system into the reservoir space using numerical methods. Finally, we show that if the observations are perturbed by white noise, the GS is preserved up to a perturbation by an Ornstein-Uhlenbeck process.
△ Less
Submitted 26 October, 2023; v1 submitted 17 November, 2022;
originally announced November 2022.
-
(Thesis) Reservoir Computing With Dynamical Systems
Authors:
Allen G Hart
Abstract:
A reservoir computer is a special type of neural network, where most of the weights are randomly fixed and only a subset are trained.
In this thesis we prove results about reservoir computers trained on deterministic dynamical systems, and stochastic processes. We focus mostly on a special type of reservoir computer called an Echo State Network (ESN).
In the deterministic case, we prove (under…
▽ More
A reservoir computer is a special type of neural network, where most of the weights are randomly fixed and only a subset are trained.
In this thesis we prove results about reservoir computers trained on deterministic dynamical systems, and stochastic processes. We focus mostly on a special type of reservoir computer called an Echo State Network (ESN).
In the deterministic case, we prove (under some assumptions) that if a reservoir computer has the Echo State Property (ESP), then there is a C1 generalised synchronisation between the input dynamical system and the dynamics in the reservoir space. Furthermore, we prove that a reservoir computer with the local ESP in several disjoint subsets of the reservoir space will admit several distinct generalised synchronisations. In the special case that the reservoir map is linear, and has the ESP, we prove that the generalised synchronisation is generically an embedding. This result admits Takens' embedding Theorem as a special case.
We go to show that ESNs trained on scalar observations of an ergodic dynamical system can approximate an arbitrary target function, including the next step map used in time series forecasting. This universal approximation property holds despite the training process being entirely linear.
We prove analogous results for ESNs trained on observations of a stochastic process, which are not be Markovian in general. We use these results to develop supervised learning, and reinforcement learning algorithms supported by an ESN.
In the penultimate chapter of this thesis, we use a reservoir computer to numerically solve linear PDEs. In the final chapter, we conclude and discuss directions for future work.
△ Less
Submitted 24 December, 2021; v1 submitted 28 November, 2021;
originally announced November 2021.
-
A hidden Markov model for describing turbostratic disorder applied to carbon blacks and graphene
Authors:
A G Hart,
T C Hansen,
W F Kuhs
Abstract:
We present a mathematical framework to represent turbostratic disorder in materials like carbon blacks, smectites, and twisted $n$-layer graphene. In particular, the set of all possible disordered layers, including rotated, shifted, and curved layers form a stochastic sequence governed by a hidden Markov model. The probability distribution over the set of layer types is treated as an element of a…
▽ More
We present a mathematical framework to represent turbostratic disorder in materials like carbon blacks, smectites, and twisted $n$-layer graphene. In particular, the set of all possible disordered layers, including rotated, shifted, and curved layers form a stochastic sequence governed by a hidden Markov model. The probability distribution over the set of layer types is treated as an element of a Hilbert space, and using tools of Fourier analysis and functional analysis, we develop expressions for the scattering cross sections of a broad class of disordered materials.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
A Markov theoretic description of stacking disordered aperiodic crystals including ice and opaline silica
Authors:
Allen G Hart,
Thomas C Hansen,
Werner F Kuhs
Abstract:
We review the Markov theoretic description of 1D aperiodic crystals, describing the stacking-faulted crystal polytype as a special case of an aperiodic crystal. Under this description we generalise the centrosymmetric unit cell underlying a topologically centrosymmetric crystal to a reversible Markov chain underlying a reversible aperiodic crystal. We show that for the close-packed structure, almo…
▽ More
We review the Markov theoretic description of 1D aperiodic crystals, describing the stacking-faulted crystal polytype as a special case of an aperiodic crystal. Under this description we generalise the centrosymmetric unit cell underlying a topologically centrosymmetric crystal to a reversible Markov chain underlying a reversible aperiodic crystal. We show that for the close-packed structure, almost all stackings are irreversible when the interaction reichweite is greater than 4. Moreover, we present an analytic expression of the scattering cross section of a large class of stacking disordered aperiodic crystals, lacking translational symmetry of their layers, including ice and opaline silica (opal CT). We then relate the observed stackings and their underlying reichweite to the physics of various nucleation and growth processes of disordered ice.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
Using Echo State Networks to Approximate Value Functions for Control
Authors:
Allen G. Hart,
Kevin R. Olding,
A. M. G. Cox,
Olga Isupova,
J. H. P. Dawes
Abstract:
An Echo State Network (ESN) is a type of single-layer recurrent neural network with randomly-chosen internal weights and a trainable output layer. We prove under mild conditions that a sufficiently large Echo State Network can approximate the value function of a broad class of stochastic and deterministic control problems. Such control problems are generally non-Markovian.
We describe how the ES…
▽ More
An Echo State Network (ESN) is a type of single-layer recurrent neural network with randomly-chosen internal weights and a trainable output layer. We prove under mild conditions that a sufficiently large Echo State Network can approximate the value function of a broad class of stochastic and deterministic control problems. Such control problems are generally non-Markovian.
We describe how the ESN can form the basis for novel and computationally efficient reinforcement learning algorithms in a non-Markovian framework. We demonstrate this theory with two examples. In the first, we use an ESN to solve a deterministic, partially observed, control problem which is a simple game we call `Bee World'. In the second example, we consider a stochastic control problem inspired by a market making problem in mathematical finance. In both cases we can compare the dynamics of the algorithms with analytic solutions to show that even after only a single reinforcement policy iteration the algorithms arrive at a good policy.
△ Less
Submitted 25 June, 2021; v1 submitted 11 February, 2021;
originally announced February 2021.
-
Echo State Networks trained by Tikhonov least squares are L2(μ) approximators of ergodic dynamical systems
Authors:
Allen G Hart,
James L Hook,
Jonathan H P Dawes
Abstract:
Echo State Networks (ESNs) are a class of single-layer recurrent neural networks with randomly generated internal weights, and a single layer of tuneable outer weights, which are usually trained by regularised linear least squares regression. Remarkably, ESNs still enjoy the universal approximation property despite the training procedure being entirely linear. In this paper, we prove that an ESN t…
▽ More
Echo State Networks (ESNs) are a class of single-layer recurrent neural networks with randomly generated internal weights, and a single layer of tuneable outer weights, which are usually trained by regularised linear least squares regression. Remarkably, ESNs still enjoy the universal approximation property despite the training procedure being entirely linear. In this paper, we prove that an ESN trained on a sequence of observations from an ergodic dynamical system (with invariant measure $μ$) using Tikhonov least squares regression against a set of targets, will approximate the target function in the $L^2(μ)$ norm. In the special case that the targets are future observations, the ESN is learning the next step map, which allows time series forecasting. We demonstrate the theory numerically by training an ESN using Tikhonov least squares on a sequence of scalar observations of the Lorenz system.
△ Less
Submitted 18 February, 2021; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Embedding and Approximation Theorems for Echo State Networks
Authors:
Allen G Hart,
James L Hook,
Jonathan H P Dawes
Abstract:
Echo State Networks (ESNs) are a class of single layer recurrent neural networks that have enjoyed recent attention. In this paper we prove that a suitable ESN, trained on a series of measurements of an invertible dynamical system, induces a C1 map from the dynamical system's phase space to the ESN's reservoir space. We call this the Echo State Map. We then prove that the Echo State Map is generic…
▽ More
Echo State Networks (ESNs) are a class of single layer recurrent neural networks that have enjoyed recent attention. In this paper we prove that a suitable ESN, trained on a series of measurements of an invertible dynamical system, induces a C1 map from the dynamical system's phase space to the ESN's reservoir space. We call this the Echo State Map. We then prove that the Echo State Map is generically an embedding with positive probability. Under additional mild assumptions, we further conjecture that the Echo State Map is almost surely an embedding. For sufficiently large, and specially structured, but still randomly generated ESNs, we prove that there exists a linear readout layer that allows the ESN to predict the next observation of a dynamical system arbitrarily well. Consequently, if the dynamical system under observation is structurally stable then the trained ESN will exhibit dynamics that are topologically conjugate to the future behaviour of the observed dynamical system. Our theoretical results connect the theory of ESNs to the delay-embedding literature for dynamical systems, and are supported by numerical evidence from simulations of the traditional Lorenz equations. The simulations confirm that, from a one dimensional observation function, an ESN can accurately infer a range of geometric and topological features of the dynamics such as the eigenvalues of equilibrium points, Lyapunov exponents and homology groups.
△ Less
Submitted 18 May, 2020; v1 submitted 14 August, 2019;
originally announced August 2019.
-
A Markovian genomic concatenation model guided by persymmetric matrices
Authors:
Andrew G. Hart,
M. Sobottka
Abstract:
The aim of this work is to provide a rigorous mathematical analysis of a stochastic concatenation model presented by Sobottka and Hart (2011) which allows approximation of the first-order stochastic structure in bacterial DNA by means of a stationary Markov chain. Two probabilistic constructions that rigorously formalize the model are presented. Necessary and sufficient conditions for a Markov cha…
▽ More
The aim of this work is to provide a rigorous mathematical analysis of a stochastic concatenation model presented by Sobottka and Hart (2011) which allows approximation of the first-order stochastic structure in bacterial DNA by means of a stationary Markov chain. Two probabilistic constructions that rigorously formalize the model are presented. Necessary and sufficient conditions for a Markov chain to be generated by the model are given, as well as the theoretical background needed for designing new algorithms for statistical analyses of real bacterial genomes. It is shown that the model encompasses the Markov chains satisfying intra-strand parity, a property observed in most DNA sequences.
△ Less
Submitted 26 November, 2019; v1 submitted 6 May, 2018;
originally announced May 2018.
-
A model capturing novel strand symmetries in bacterial DNA
Authors:
Marcelo Sobottka,
Andrew G. Hart
Abstract:
Chargaff's second parity rule for short oligonucleotides states that the frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double stranded DNA genomes and fails to hold for single-stranded genomes. Whil…
▽ More
Chargaff's second parity rule for short oligonucleotides states that the frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.
-
Radiative improvement of the lattice NRQCD action using the background field method with applications to quarkonium spectroscopy
Authors:
T. C. Hammant,
A. G. Hart,
G. M. von Hippel,
R. R. Horgan,
C. J. Monahan
Abstract:
We apply the background field (BF) method to Non-Relativistic QCD (NRQCD) on the lattice in order to determine the one-loop radiative corrections to the coefficients of the NRQCD action in a manifestly gauge-covariant manner by matching the NRQCD prediction for particular on-shell processes with those of relativistic continuum QCD. We explain how the BF method is implemented in automated perturbat…
▽ More
We apply the background field (BF) method to Non-Relativistic QCD (NRQCD) on the lattice in order to determine the one-loop radiative corrections to the coefficients of the NRQCD action in a manifestly gauge-covariant manner by matching the NRQCD prediction for particular on-shell processes with those of relativistic continuum QCD. We explain how the BF method is implemented in automated perturbation theory and discuss the technique for matching the relativistic and non-relativistic theories. We compute the one-loop radiative corrections to the sigma.B and Darwin terms for the NRQCD action currently used in simulations, as well as the one-loop coefficients of the spin-dependent O(alpha^2) four-fermion contact terms. The effect of the corrections on the hyperfine splitting of bottomonium is estimated using earlier simulation results; the corrected lattice prediction is found to be in agreement with experiment. Agreement of the hyperfine splitting of bottomonium and the B-meson system is confirmed by recent simulation studies (Dowdall et al.) which include our NRQCD radiative corrections for the first time.
△ Less
Submitted 16 November, 2015; v1 submitted 13 March, 2013;
originally announced March 2013.
-
Radiative improvement of spin and Darwin terms in the NRQCD action
Authors:
T. C. Hammant,
A. G. Hart,
G. M. von Hippel,
R. R. Horgan,
C. J. Monahan
Abstract:
We present updated results for the radiative improvement of the σ.B term and the spin-dependent four-fermion terms in the lattice NRQCD action, and first results for the radiative corrections to the NRQCD Darwin term and spin-independent four-fermion terms. The spin-dependent terms have significant impact on getting the correct hyperfine splitting for both bottomonium and heavy-light mesons, while…
▽ More
We present updated results for the radiative improvement of the σ.B term and the spin-dependent four-fermion terms in the lattice NRQCD action, and first results for the radiative corrections to the NRQCD Darwin term and spin-independent four-fermion terms. The spin-dependent terms have significant impact on getting the correct hyperfine splitting for both bottomonium and heavy-light mesons, while the spin-independent terms suffer from a conspiracy between lattice artifacts and severe IR divergences that complicates their evaluation.
△ Less
Submitted 12 December, 2012;
originally announced December 2012.
-
Radiative improvement of the lattice NRQCD action using the background field method and application to the hyperfine splitting of quarkonium states
Authors:
T. C. Hammant,
A. G. Hart,
G. M. von Hippel,
R. R. Horgan,
C. J. Monahan
Abstract:
We present the first application of the background field method to Non-Relativistic QCD (NRQCD) on the lattice in order to determine the one-loop radiative corrections to the coefficients of the NRQCD action in a manifestly gauge-covariant manner. The coefficient of the $σ\cdot B$ term in the NRQCD action is computed at the one-loop level; the resulting shift of the hyperfine splitting of bottomon…
▽ More
We present the first application of the background field method to Non-Relativistic QCD (NRQCD) on the lattice in order to determine the one-loop radiative corrections to the coefficients of the NRQCD action in a manifestly gauge-covariant manner. The coefficient of the $σ\cdot B$ term in the NRQCD action is computed at the one-loop level; the resulting shift of the hyperfine splitting of bottomonium is found to bring the lattice predictions in line with experiment.
△ Less
Submitted 19 June, 2015; v1 submitted 26 May, 2011;
originally announced May 2011.
-
Improved automated lattice perturbation theory in background field gauge
Authors:
T. C. Hammant,
R. R. Horgan,
C. J. Monahan,
A. G. Hart,
E. H. Müller,
A. Gray,
K. Sivalingham,
G. M. von Hippel
Abstract:
We present an algorithm to automatically derive Feynman rules for lattice perturbation theory in background field gauge. Vertices with an arbitrary number of both background and quantum legs can be derived automatically from both gluonic and fermionic actions. The algorithm is a generalisation of our earlier algorithm based on prior work by Lüscher and Weisz. We also present techniques allowing fo…
▽ More
We present an algorithm to automatically derive Feynman rules for lattice perturbation theory in background field gauge. Vertices with an arbitrary number of both background and quantum legs can be derived automatically from both gluonic and fermionic actions. The algorithm is a generalisation of our earlier algorithm based on prior work by Lüscher and Weisz. We also present techniques allowing for the parallelisation of the evaluation of the often rather complex lattice Feynman rules that should allow for efficient implementation on GPUs, but also give a significant speed-up when calculating the derivatives of Feynman diagrams with respect to external momenta.
△ Less
Submitted 11 November, 2010;
originally announced November 2010.