Skip to main content

Showing 1–17 of 17 results for author: Kidger, P

.
  1. arXiv:2405.06464  [pdf, other

    math.NA cs.LG math.PR stat.CO

    Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers

    Authors: Andraž Jelinčič, James Foster, Patrick Kidger

    Abstract: Despite the success of adaptive time-step** in ODE simulation, it has so far seen few applications for Stochastic Differential Equations (SDEs). To simulate SDEs adaptively, methods such as the Virtual Brownian Tree (VBT) have been developed, which can generate Brownian motion (BM) non-chronologically. However, in most applications, knowing only the values of Brownian motion is not enough to ach… ▽ More

    Submitted 25 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  2. arXiv:2402.09983  [pdf, other

    math.OC cs.MS

    Optimistix: modular optimisation in JAX and Equinox

    Authors: Jason Rader, Terry Lyons, Patrick Kidger

    Abstract: We introduce Optimistix: a nonlinear optimisation library built in JAX and Equinox. Optimistix introduces a novel, modular approach for its minimisers and least-squares solvers. This modularity relies on new practical abstractions for optimisation which we call search and descent, and which generalise classical notions of line search, trust-region, and learning-rate algorithms. It provides high-le… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 8 pages, 4 figures, 2 tables

  3. arXiv:2311.17283  [pdf, other

    cs.MS

    Lineax: unified linear solves and linear least-squares in JAX and Equinox

    Authors: Jason Rader, Terry Lyons, Patrick Kidger

    Abstract: We introduce Lineax, a library bringing linear solves and linear least-squares to the JAX+Equinox scientific computing ecosystem. Lineax uses general linear operators, and unifies linear solves and least-squares into a single, autodifferentiable API. Solvers and operators are user-extensible, without requiring the user to implement any custom derivative rules to get differentiability. Lineax is av… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 7 pages, 1 figure, NeurIPS 2023 AI for Science workshop

  4. arXiv:2202.02435  [pdf, other

    cs.LG math.CA math.DS math.NA stat.ML

    On Neural Differential Equations

    Authors: Patrick Kidger

    Abstract: The conjoining of dynamical systems and deep learning has become a topic of great interest. In particular, neural differential equations (NDEs) demonstrate that neural networks and differential equation are two sides of the same coin. Traditional parameterised differential equations are a special case. Many popular neural network architectures, such as residual networks and recurrent networks, are… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: Doctoral thesis, Mathematical Institute, University of Oxford. 231 pages

  5. arXiv:2111.00254  [pdf, ps, other

    cs.LG cs.PL

    Equinox: neural networks in JAX via callable PyTrees and filtered transformations

    Authors: Patrick Kidger, Cristian Garcia

    Abstract: JAX and PyTorch are two popular Python autodifferentiation frameworks. JAX is based around pure functions and functional programming. PyTorch has popularised the use of an object-oriented (OO) class-based syntax for defining parameterised functions, such as neural networks. That this seems like a fundamental difference means current libraries for building parameterised functions in JAX have either… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Accepted at the Differentiable Programming workshop at NeurIPS 2021

  6. arXiv:2106.11028  [pdf, other

    cs.LG

    Neural Controlled Differential Equations for Online Prediction Tasks

    Authors: James Morrill, Patrick Kidger, Lingyi Yang, Terry Lyons

    Abstract: Neural controlled differential equations (Neural CDEs) are a continuous-time extension of recurrent neural networks (RNNs), achieving state-of-the-art (SOTA) performance at modelling functions of irregular time series. In order to interpret discrete data in continuous time, current implementations rely on non-causal interpolations of the data. This is fine when the whole time series is observed in… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  7. arXiv:2105.13493  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Efficient and Accurate Gradients for Neural SDEs

    Authors: Patrick Kidger, James Foster, Xuechen Li, Terry Lyons

    Abstract: Neural SDEs combine many of the best qualities of both RNNs and SDEs: memory efficient training, high-capacity function approximation, and strong priors on model space. This makes them a natural choice for modelling many types of temporal dynamics. Training a Neural SDE (either as a VAE or as a GAN) requires backpropagating through an SDE solve. This may be done by solving a backwards-in-time SDE… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted at NeurIPS 2021

  8. arXiv:2102.03657  [pdf, other

    cs.LG

    Neural SDEs as Infinite-Dimensional GANs

    Authors: Patrick Kidger, James Foster, Xuechen Li, Harald Oberhauser, Terry Lyons

    Abstract: Stochastic differential equations (SDEs) are a staple of mathematical modelling of temporal dynamics. However, a fundamental limitation has been that such models have typically been relatively inflexible, which recent work introducing Neural SDEs has sought to solve. Here, we show that the current classical approach to fitting SDEs may be approached as a special case of (Wasserstein) GANs, and in… ▽ More

    Submitted 11 May, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

    Comments: Published at ICML 2021

  9. arXiv:2009.09457  [pdf, other

    cs.LG math.CA

    "Hey, that's not an ODE": Faster ODE Adjoints via Seminorms

    Authors: Patrick Kidger, Ricky T. Q. Chen, Terry Lyons

    Abstract: Neural differential equations may be trained by backpropagating gradients via the adjoint method, which is another differential equation typically solved using an adaptive-step-size numerical differential equation solver. A proposed step is accepted if its error, \emph{relative to some norm}, is sufficiently small; else it is rejected, the step is shrunk, and the process is repeated. Here, we demo… ▽ More

    Submitted 10 May, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

    Comments: Published at ICML 2021

  10. arXiv:2009.08295  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Neural Rough Differential Equations for Long Time Series

    Authors: James Morrill, Cristopher Salvi, Patrick Kidger, James Foster, Terry Lyons

    Abstract: Neural controlled differential equations (CDEs) are the continuous-time analogue of recurrent neural networks, as Neural ODEs are to residual networks, and offer a memory-efficient continuous-time way to model functions of potentially irregular time series. Existing methods for computing the forward pass of a Neural CDE involve embedding the incoming time series into path space, often via interpol… ▽ More

    Submitted 21 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Published at ICML 2021

  11. arXiv:2006.00873  [pdf, other

    cs.LG stat.ML

    A Generalised Signature Method for Multivariate Time Series Feature Extraction

    Authors: James Morrill, Adeline Fermanian, Patrick Kidger, Terry Lyons

    Abstract: The 'signature method' refers to a collection of feature extraction techniques for multivariate time series, derived from the theory of controlled differential equations. There is a great deal of flexibility as to how this method can be applied. On the one hand, this flexibility allows the method to be tailored to specific problems, but on the other hand, can make precise application challenging.… ▽ More

    Submitted 6 February, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 25 pages

  12. arXiv:2005.13948  [pdf, other

    cs.LG stat.ML

    Generalised Interpretable Shapelets for Irregular Time Series

    Authors: Patrick Kidger, James Morrill, Terry Lyons

    Abstract: The shapelet transform is a form of feature extraction for time series, in which a time series is described by its similarity to each of a collection of `shapelets'. However it has previously suffered from a number of limitations, such as being limited to regularly-spaced fully-observed time series, and having to choose between efficient training and interpretability. Here, we extend the method to… ▽ More

    Submitted 29 May, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

  13. arXiv:2005.08926  [pdf, ps, other

    cs.LG stat.ML

    Neural Controlled Differential Equations for Irregular Time Series

    Authors: Patrick Kidger, James Morrill, James Foster, Terry Lyons

    Abstract: Neural ordinary differential equations are an attractive option for modelling temporal dynamics. However, a fundamental issue is that the solution to an ordinary differential equation is determined by its initial condition, and there is no mechanism for adjusting the trajectory based on subsequent observations. Here, we demonstrate how this may be resolved through the well-understood mathematics o… ▽ More

    Submitted 5 November, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Accepted at NeurIPS 2020 (Spotlight)

  14. arXiv:2001.00706  [pdf, other

    cs.LG stat.ML

    Signatory: differentiable computations of the signature and logsignature transforms, on both CPU and GPU

    Authors: Patrick Kidger, Terry Lyons

    Abstract: Signatory is a library for calculating and performing functionality related to the signature and logsignature transforms. The focus is on machine learning, and as such includes features such as CPU parallelism, GPU support, and backpropagation. To our knowledge it is the first GPU-capable library for these operations. Signatory implements new features not available in previous libraries, such as e… ▽ More

    Submitted 5 February, 2021; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: Published at ICLR 2021

    MSC Class: 60H30; 68T99; 68N30

  15. arXiv:2001.00668  [pdf, ps, other

    math.CA

    The degree-$(n+1)$ polynomials are the most difficult $C^{\,n + 1}$ functions to uniformly approximate with degree-$n$ polynomials

    Authors: Patrick Kidger

    Abstract: There exist well-known tight bounds on the error between a function $f \in C^{\,n + 1}([-1, 1])$ and its best polynomial approximation of degree $n$. We show that the error meets these bounds when and only when $f$ is a polynomial of degree $n + 1$.

    Submitted 2 January, 2020; originally announced January 2020.

    MSC Class: 41A10; 41A05; 41A17

  16. arXiv:1905.08539  [pdf, ps, other

    cs.LG math.CA stat.ML

    Universal Approximation with Deep Narrow Networks

    Authors: Patrick Kidger, Terry Lyons

    Abstract: The classical Universal Approximation Theorem holds for neural networks of arbitrary width and bounded depth. Here we consider the natural `dual' scenario for networks of bounded width and arbitrary depth. Precisely, let $n$ be the number of inputs neurons, $m$ be the number of output neurons, and let $ρ$ be any nonaffine continuous function, with a continuous nonzero derivative at some point. The… ▽ More

    Submitted 8 June, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Accepted at COLT 2020

    MSC Class: 41A46; 41A63; 68T07

  17. arXiv:1905.08494  [pdf, other

    cs.LG stat.ML

    Deep Signature Transforms

    Authors: Patric Bonnier, Patrick Kidger, Imanol Perez Arribas, Cristopher Salvi, Terry Lyons

    Abstract: The signature is an infinite graded sequence of statistics known to characterise a stream of data up to a negligible equivalence class. It is a transform which has previously been treated as a fixed feature transformation, on top of which a model may be built. We propose a novel approach which combines the advantages of the signature transform with modern deep learning frameworks. By learning an a… ▽ More

    Submitted 26 October, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019

    MSC Class: 68T01