Skip to main content

Showing 1–30 of 30 results for author: Lyons, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2304.01862  [pdf, other

    stat.ME

    The insertion method to invert the signature of a path

    Authors: Adeline Fermanian, Jiawei Chang, Terry Lyons, Gérard Biau

    Abstract: The signature is a representation of a path as an infinite sequence of its iterated integrals. Under certain assumptions, the signature characterizes the path, up to translation and reparameterization. Therefore, a crucial question of interest is the development of efficient algorithms to invert the signature, i.e., to reconstruct the path from the information of its (truncated) signature. In this… ▽ More

    Submitted 19 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  2. arXiv:2301.13112  [pdf, other

    stat.ML cs.LG

    Benchmarking optimality of time series classification methods in distinguishing diffusions

    Authors: Zehong Zhang, Fei Lu, Esther Xu Fei, Terry Lyons, Yannis Kevrekidis, Tom Woolf

    Abstract: Statistical optimality benchmarking is crucial for analyzing and designing time series classification (TSC) algorithms. This study proposes to benchmark the optimality of TSC algorithms in distinguishing diffusion processes by the likelihood ratio test (LRT). The LRT is an optimal classifier by the Neyman-Pearson lemma. The LRT benchmarks are computationally efficient because the LRT does not need… ▽ More

    Submitted 11 April, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 23 pages, 8 figures

    MSC Class: 62M02; 62M10; 62M20

  3. arXiv:2301.09517  [pdf, other

    math.NA cs.LG stat.ML

    Sampling-based Nyström Approximation and Kernel Quadrature

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We analyze the Nyström approximation of a positive definite kernel associated with a probability measure. We first prove an improved error bound for the conventional Nyström approximation with i.i.d. sampling and singular-value decomposition in the continuous regime; the proof techniques are borrowed from statistical learning theory. We further introduce a refined selection of subspaces in Nyström… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 22 pages, ICML 2023 camera-ready version. Typos fixed

  4. arXiv:2206.14674  [pdf, other

    stat.ML cs.LG math.CA math.NA math.ST stat.ME

    Signature Methods in Machine Learning

    Authors: Terry Lyons, Andrew D. McLeod

    Abstract: Signature-based techniques give mathematical insight into the interactions between complex streams of evolving data. These insights can be quite naturally translated into numerical approaches to understanding streamed data, and perhaps because of their mathematical precision, have proved useful in analysing streamed data in situations where the data is irregular, and not stationary, and the dimens… ▽ More

    Submitted 26 January, 2024; v1 submitted 29 June, 2022; originally announced June 2022.

    MSC Class: 60L10; 93C15; 68Q32; 34F05

  5. arXiv:2109.03582  [pdf, other

    stat.ML cs.LG

    Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes

    Authors: Cristopher Salvi, Maud Lemercier, Chong Liu, Blanka Hovarth, Theodoros Damoulas, Terry Lyons

    Abstract: Stochastic processes are random variables with values in some space of paths. However, reducing a stochastic process to a path-valued random variable ignores its filtration, i.e. the flow of information carried by the process through time. By conditioning the process on its filtration, we introduce a family of higher order kernel mean embeddings (KMEs) that generalizes the notion of KME and captur… ▽ More

    Submitted 3 November, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Published at NeurIPS 2021

    MSC Class: 60L10; 60L20

  6. arXiv:2107.09597  [pdf, other

    math.NA cs.LG stat.ML

    Positively Weighted Kernel Quadrature via Subsampling

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We study kernel quadrature rules with convex weights. Our approach combines the spectral properties of the kernel with recombination results about point measures. This results in effective algorithms that construct convex quadrature rules using only access to i.i.d. samples from the underlying measure and evaluation of the kernel and that result in a small worst-case error. In addition to our theo… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 29 pages, NeurIPS 2022 camera-ready version

  7. arXiv:2105.13493  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Efficient and Accurate Gradients for Neural SDEs

    Authors: Patrick Kidger, James Foster, Xuechen Li, Terry Lyons

    Abstract: Neural SDEs combine many of the best qualities of both RNNs and SDEs: memory efficient training, high-capacity function approximation, and strong priors on model space. This makes them a natural choice for modelling many types of temporal dynamics. Training a Neural SDE (either as a VAE or as a GAN) requires backpropagating through an SDE solve. This may be done by solving a backwards-in-time SDE… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted at NeurIPS 2021

  8. arXiv:2105.04211  [pdf, other

    stat.ML cs.LG

    SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Thomas Cass, Edwin V. Bonilla, Theodoros Damoulas, Terry Lyons

    Abstract: Making predictions and quantifying their uncertainty when the input data is sequential is a fundamental learning challenge, recently attracting increasing attention. We develop SigGPDE, a new scalable sparse variational inference framework for Gaussian Processes (GPs) on sequential data. Our contribution is twofold. First, we construct inducing variables underpinning the sparse approximation so th… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Published at ICML 2021

    MSC Class: 60L10; 60L20

  9. arXiv:2009.08295  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Neural Rough Differential Equations for Long Time Series

    Authors: James Morrill, Cristopher Salvi, Patrick Kidger, James Foster, Terry Lyons

    Abstract: Neural controlled differential equations (CDEs) are the continuous-time analogue of recurrent neural networks, as Neural ODEs are to residual networks, and offer a memory-efficient continuous-time way to model functions of potentially irregular time series. Existing methods for computing the forward pass of a Neural CDE involve embedding the incoming time series into path space, often via interpol… ▽ More

    Submitted 21 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Published at ICML 2021

  10. arXiv:2008.03408  [pdf, other

    cs.LG cs.CL eess.AS stat.ML

    Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews

    Authors: Bo Wang, Yue Wu, Niall Taylor, Terry Lyons, Maria Liakata, Alejo J Nevado-Holgado, Kate E A Saunders

    Abstract: Bipolar disorder (BD) and borderline personality disorder (BPD) are both chronic psychiatric disorders. However, their overlap** symptoms and common comorbidity make it challenging for the clinicians to distinguish the two conditions on the basis of a clinical interview. In this work, we first present a new multi-modal dataset containing interviews involving individuals with BD or BPD being inte… ▽ More

    Submitted 31 May, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

    MSC Class: 60L10

  11. arXiv:2006.15030  [pdf, other

    stat.AP

    Deriving information from missing data: implications for mood prediction

    Authors: Yue Wu, Terry J. Lyons, Kate E. A. Saunders

    Abstract: The availability of mobile technologies has enabled the efficient collection prospective longitudinal, ecologically valid self-reported mood data from psychiatric patients. These data streams have potential for improving the efficiency and accuracy of psychiatric diagnosis as well predicting future mood states enabling earlier intervention. However, missing responses are common in such datasets an… ▽ More

    Submitted 8 July, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    MSC Class: 60L10; 60L90; 62D10; 92-08

  12. arXiv:2006.14498  [pdf, other

    q-fin.ST cs.LG q-fin.CP q-fin.MF stat.ML

    A Data-driven Market Simulator for Small Data Environments

    Authors: Hans Bühler, Blanka Horvath, Terry Lyons, Imanol Perez Arribas, Ben Wood

    Abstract: Neural network based data-driven market simulation unveils a new and flexible way of modelling financial time series without imposing assumptions on the underlying stochastic dynamics. Though in this sense generative market simulation is model-free, the concrete modelling choices are nevertheless decisive for the features of the simulated paths. We give a brief overview of currently used generativ… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 27 pages, 9 figures

  13. arXiv:2006.05805  [pdf, other

    cs.LG stat.ML

    Distribution Regression for Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Theodoros Damoulas, Edwin V. Bonilla, Terry Lyons

    Abstract: Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic anal… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Published at AISTATS 2021

    MSC Class: 60L10; 60L20

  14. arXiv:2006.03487  [pdf, other

    cs.LG stat.ML

    Dimensionless Anomaly Detection on Multivariate Streams with Variance Norm and Path Signature

    Authors: Zhen Shao, Ryan Sze-Yin Chan, Thomas Cochrane, Peter Foster, Terry Lyons

    Abstract: In this paper, we propose a dimensionless anomaly detection method for multivariate streams. Our method is independent of the unit of measurement for the different stream channels, therefore dimensionless. We first propose the variance norm, a generalisation of Mahalanobis distance to handle infinite-dimensional feature space and singular empirical covariance matrix rigorously. We then combine the… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 June, 2020; originally announced June 2020.

  15. arXiv:2006.00873  [pdf, other

    cs.LG stat.ML

    A Generalised Signature Method for Multivariate Time Series Feature Extraction

    Authors: James Morrill, Adeline Fermanian, Patrick Kidger, Terry Lyons

    Abstract: The 'signature method' refers to a collection of feature extraction techniques for multivariate time series, derived from the theory of controlled differential equations. There is a great deal of flexibility as to how this method can be applied. On the one hand, this flexibility allows the method to be tailored to specific problems, but on the other hand, can make precise application challenging.… ▽ More

    Submitted 6 February, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 25 pages

  16. arXiv:2005.13948  [pdf, other

    cs.LG stat.ML

    Generalised Interpretable Shapelets for Irregular Time Series

    Authors: Patrick Kidger, James Morrill, Terry Lyons

    Abstract: The shapelet transform is a form of feature extraction for time series, in which a time series is described by its similarity to each of a collection of `shapelets'. However it has previously suffered from a number of limitations, such as being limited to regularly-spaced fully-observed time series, and having to choose between efficient training and interpretability. Here, we extend the method to… ▽ More

    Submitted 29 May, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

  17. arXiv:2005.08926  [pdf, ps, other

    cs.LG stat.ML

    Neural Controlled Differential Equations for Irregular Time Series

    Authors: Patrick Kidger, James Morrill, James Foster, Terry Lyons

    Abstract: Neural ordinary differential equations are an attractive option for modelling temporal dynamics. However, a fundamental issue is that the solution to an ordinary differential equation is determined by its initial condition, and there is no mechanism for adjusting the trajectory based on subsequent observations. Here, we demonstrate how this may be resolved through the well-understood mathematics o… ▽ More

    Submitted 5 November, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Accepted at NeurIPS 2020 (Spotlight)

  18. arXiv:2004.04006  [pdf, other

    cs.LG eess.SP stat.ML

    Signature features with the visibility transformation

    Authors: Yue Wu, Hao Ni, Terence J. Lyons, Robin L. Hudson

    Abstract: In this paper we put the visibility transformation on a clear theoretical footing and show that this transform is able to embed the effect of the absolute position of the data stream into signature features in a unified and efficient way. The generated feature set is particularly useful in pattern recognition tasks, for its simplifying role in allowing the signature feature set to accommodate nonl… ▽ More

    Submitted 8 October, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    MSC Class: 60L10

  19. arXiv:2002.03419  [pdf, other

    q-bio.PE stat.AP

    The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) Challenge: Results after 1 Year Follow-up

    Authors: Razvan V. Marinescu, Neil P. Oxtoby, Alexandra L. Young, Esther E. Bron, Arthur W. Toga, Michael W. Weiner, Frederik Barkhof, Nick C. Fox, Arman Eshaghi, Tina Toni, Marcin Salaterski, Veronika Lunina, Manon Ansart, Stanley Durrleman, Pascal Lu, Samuel Iddi, Dan Li, Wesley K. Thompson, Michael C. Donohue, Aviv Nahon, Yarden Levy, Dan Halbersberg, Mariya Cohen, Huiling Liao, Tengfei Li , et al. (71 additional authors not shown)

    Abstract: We present the findings of "The Alzheimer's Disease Prediction Of Longitudinal Evolution" (TADPOLE) Challenge, which compared the performance of 92 algorithms from 33 international teams at predicting the future trajectory of 219 individuals at risk of Alzheimer's disease. Challenge participants were required to make a prediction, for each month of a 5-year future time period, of three key outcome… ▽ More

    Submitted 27 December, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: Presents final results of the TADPOLE competition. 60 pages, 7 tables, 14 figures

    Journal ref: Machine Learning for Biomedical Imaging (MELBA), Dec 2021

  20. arXiv:2001.00706  [pdf, other

    cs.LG stat.ML

    Signatory: differentiable computations of the signature and logsignature transforms, on both CPU and GPU

    Authors: Patrick Kidger, Terry Lyons

    Abstract: Signatory is a library for calculating and performing functionality related to the signature and logsignature transforms. The focus is on machine learning, and as such includes features such as CPU parallelism, GPU support, and backpropagation. To our knowledge it is the first GPU-capable library for these operations. Signatory implements new features not available in previous libraries, such as e… ▽ More

    Submitted 5 February, 2021; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: Published at ICLR 2021

    MSC Class: 60H30; 68T99; 68N30

  21. arXiv:1908.08286  [pdf, other

    cs.LG stat.ML

    Learning stochastic differential equations using RNN with log signature features

    Authors: Shujian Liao, Terry Lyons, Weixin Yang, Hao Ni

    Abstract: This paper contributes to the challenge of learning a function on streamed multimodal data through evaluation. The core of the result of our paper is the combination of two quite different approaches to this problem. One comes from the mathematically principled technology of signatures and log-signatures as representations for streamed data, while the other draws on the techniques of recurrent neu… ▽ More

    Submitted 22 September, 2019; v1 submitted 22 August, 2019; originally announced August 2019.

  22. arXiv:1905.08539  [pdf, ps, other

    cs.LG math.CA stat.ML

    Universal Approximation with Deep Narrow Networks

    Authors: Patrick Kidger, Terry Lyons

    Abstract: The classical Universal Approximation Theorem holds for neural networks of arbitrary width and bounded depth. Here we consider the natural `dual' scenario for networks of bounded width and arbitrary depth. Precisely, let $n$ be the number of inputs neurons, $m$ be the number of output neurons, and let $ρ$ be any nonaffine continuous function, with a continuous nonzero derivative at some point. The… ▽ More

    Submitted 8 June, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Accepted at COLT 2020

    MSC Class: 41A46; 41A63; 68T07

  23. arXiv:1905.08494  [pdf, other

    cs.LG stat.ML

    Deep Signature Transforms

    Authors: Patric Bonnier, Patrick Kidger, Imanol Perez Arribas, Cristopher Salvi, Terry Lyons

    Abstract: The signature is an infinite graded sequence of statistics known to characterise a stream of data up to a negligible equivalence class. It is a transform which has previously been treated as a fixed feature transformation, on top of which a model may be built. We propose a novel approach which combines the advantages of the signature transform with modern deep learning frameworks. By learning an a… ▽ More

    Submitted 26 October, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019

    MSC Class: 68T01

  24. Using path signatures to predict a diagnosis of Alzheimer's disease

    Authors: P. J. Moore, J. Gallacher, T. J. Lyons

    Abstract: The path signature is a means of feature generation that can encode nonlinear interactions in the data as well as the usual linear features. It can distinguish the ordering of time-sequenced changes: for example whether or not the hippocampus shrinks fast, then slowly or the converse. It provides interpretable features and its output is a fixed length vector irrespective of the number of input poi… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: 5 pages, 3 figures. arXiv admin note: text overlap with arXiv:1808.03273

    MSC Class: 62J12; 92D30

  25. Random forest prediction of Alzheimer's disease using pairwise selection from time series data

    Authors: Paul Moore, Terry Lyons, John Gallacher

    Abstract: Time-dependent data collected in studies of Alzheimer's disease usually has missing and irregularly sampled data points. For this reason time series methods which assume regular sampling cannot be applied directly to the data without a pre-processing step. In this paper we use a machine learning method to learn the relationship between pairs of data points at different time separations. The input… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: 6 pages, 1 figure, 6 tables

    MSC Class: 62M10

  26. arXiv:1805.03911  [pdf, other

    stat.ML cs.LG

    Labelling as an unsupervised learning problem

    Authors: Terry Lyons, Imanol Perez Arribas

    Abstract: Unravelling hidden patterns in datasets is a classical problem with many potential applications. In this paper, we present a challenge whose objective is to discover nonlinear relationships in noisy cloud of points. If a set of point satisfies a nonlinear relationship that is unlikely to be due to randomness, we will label the set with this relationship. Since points can satisfy one, many or no su… ▽ More

    Submitted 30 May, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

  27. arXiv:1708.09708  [pdf, ps, other

    stat.ML cs.DS math.ST

    Sketching the order of events

    Authors: Terry Lyons, Harald Oberhauser

    Abstract: We introduce features for massive data streams. These stream features can be thought of as "ordered moments" and generalize stream sketches from "moments of order one" to "ordered moments of arbitrary order". In analogy to classic moments, they have theoretical guarantees such as universality that are important for learning algorithms.

    Submitted 31 August, 2017; originally announced August 2017.

  28. arXiv:1708.01206  [pdf, other

    stat.ML

    Detecting early signs of depressive and manic episodes in patients with bipolar disorder using the signature-based model

    Authors: Andrey Kormilitzin, Kate E. A. Saunders, Paul J. Harrison, John R. Geddes, Terry Lyons

    Abstract: Recurrent major mood episodes and subsyndromal mood instability cause substantial disability in patients with bipolar disorder. Early identification of mood episodes enabling timely mood stabilisation is an important clinical goal. Recent technological advances allow the prospective reporting of mood in real time enabling more accurate, efficient data capture. The complex nature of these data stre… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 12 pages, 3 tables, 10 figures

  29. A signature-based machine learning model for bipolar disorder and borderline personality disorder

    Authors: Imanol Perez Arribas, Kate Saunders, Guy Goodwin, Terry Lyons

    Abstract: Mobile technologies offer opportunities for higher resolution monitoring of health conditions. This opportunity seems of particular promise in psychiatry where diagnoses often rely on retrospective and subjective recall of mood states. However, getting actionable information from these rather complex time series is challenging, and at present the implications for clinical care are largely hypothet… ▽ More

    Submitted 4 October, 2017; v1 submitted 22 July, 2017; originally announced July 2017.

  30. arXiv:1606.02074  [pdf, ps, other

    stat.AP stat.ML

    Application of the Signature Method to Pattern Recognition in the CEQUEL Clinical Trial

    Authors: A. B. Kormilitzin, K. E. A. Saunders, P. J. Harrison, J. R. Geddes, T. J. Lyons

    Abstract: The classification procedure of streaming data usually requires various ad hoc methods or particular heuristic models. We explore a novel non-parametric and systematic approach to analysis of heterogeneous sequential data. We demonstrate an application of this method to classification of the delays in responding to the prompts, from subjects with bipolar disorder collected during a clinical trial,… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Comments: 16 pages, 7 figures