Skip to main content

Showing 1–24 of 24 results for author: Salvi, C

.
  1. arXiv:2407.08459  [pdf, other

    math.PR cs.LG

    Graph Expansions of Deep Neural Networks and their Universal Scaling Limits

    Authors: Nicola Muca Cirone, Jad Hamdan, Cristopher Salvi

    Abstract: We present a unified approach to obtain scaling limits of neural networks using the genus expansion technique from random matrix theory. This approach begins with a novel expansion of neural networks which is reminiscent of Butcher series for ODEs, and is obtained through a generalisation of Faà di Bruno's formula to an arbitrary number of compositions. In this expansion, the role of monomials is… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2406.10354  [pdf, other

    cs.LG

    SigDiffusions: Score-Based Diffusion Models for Long Time Series via Log-Signature Embeddings

    Authors: Barbora Barancikova, Zhuoyue Huang, Cristopher Salvi

    Abstract: Score-based diffusion models have recently emerged as state-of-the-art generative models for a variety of data modalities. Nonetheless, it remains unclear how to adapt these models to generate long multivariate time series. Viewing a time series as the discretization of an underlying continuous process, we introduce SigDiffusion, a novel diffusion model operating on log-signature embeddings of the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2405.13587  [pdf, other

    stat.ML cs.LG math.PR

    Exact Gradients for Stochastic Spiking Neural Networks Driven by Rough Signals

    Authors: Christian Holberg, Cristopher Salvi

    Abstract: We introduce a mathematically rigorous framework based on rough path theory to model stochastic spiking neural networks (SSNNs) as stochastic differential equations with event discontinuities (Event SDEs) and driven by càdlàg rough paths. Our formalism is general enough to allow for potential jumps to be present both in the solution trajectories as well as in the driving noise. We then identify a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2404.06583  [pdf, other

    cs.LG math.PR math.ST

    Lecture notes on rough paths and applications to machine learning

    Authors: Thomas Cass, Cristopher Salvi

    Abstract: These notes expound the recent use of the signature transform and rough path theory in data science and machine learning. We develop the core theory of the signature from first principles and then survey some recent popular applications of this approach, including signature-based kernel methods and neural rough differential equations. The notes are based on a course given by the two authors at Imp… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    MSC Class: 60L10; 60L20

  5. arXiv:2403.11738  [pdf, other

    math.NA q-fin.CP

    A path-dependent PDE solver based on signature kernels

    Authors: Alexandre Pannier, Cristopher Salvi

    Abstract: We develop a provably convergent kernel-based solver for path-dependent PDEs (PPDEs). Our numerical scheme leverages signature kernels, a recently introduced class of kernels on path-space. Specifically, we solve an optimal recovery problem by approximating the solution of a PPDE with an element of minimal norm in the signature reproducing kernel Hilbert space (RKHS) constrained to satisfy the PPD… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 35 pages, 1 figure

    MSC Class: 35R15; 60L10; 65N35; 91G60

  6. arXiv:2402.19047  [pdf, other

    cs.LG math.DS

    Theoretical Foundations of Deep Selective State-Space Models

    Authors: Nicola Muca Cirone, Antonio Orvieto, Benjamin Walker, Cristopher Salvi, Terry Lyons

    Abstract: Structured state-space models (SSMs) such as S4, stemming from the seminal work of Gu et al., are gaining popularity as effective approaches for modeling sequential data. Deep SSMs demonstrate outstanding performance across a diverse set of domains, at a reduced training and inference cost compared to attention-based transformers. Recent developments show that if the linear recurrence powering SSM… ▽ More

    Submitted 4 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  7. arXiv:2402.18477  [pdf, other

    cs.LG cs.AI stat.ML

    Signature Kernel Conditional Independence Tests in Causal Discovery for Stochastic Processes

    Authors: Georg Manten, Cecilia Casolo, Emilio Ferrucci, Søren Wengel Mogensen, Cristopher Salvi, Niki Kilbertus

    Abstract: Inferring the causal structure underlying stochastic dynamical systems from observational data holds great promise in domains ranging from science and health to finance. Such processes can often be accurately modeled via stochastic differential equations (SDEs), which naturally imply causal relationships via "which variables enter the differential of which other variables". In this paper, we devel… ▽ More

    Submitted 11 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  8. arXiv:2401.12905  [pdf

    stat.ME stat.AP

    Estimating the construct validity of Principal Components Analysis

    Authors: Thomas M. H. Hope, Cathy J. Price, Ajay Halai, Carola Salvi, Jenny Crinion, Merel Keijsers, Christoph Sperber, Howard Bowman

    Abstract: In many scientific disciplines, the features of interest cannot be observed directly, so must instead be inferred from observed behaviour. Latent variable analyses are increasingly employed to systematise these inferences, and Principal Components Analysis (PCA) is perhaps the simplest and most popular of these methods. Here, we examine how the assumptions that we are prepared to entertain, about… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages, 3 figures, 3 tables

  9. arXiv:2306.14258  [pdf, ps, other

    cs.LG math.OC

    A Neural RDE approach for continuous-time non-Markovian stochastic control problems

    Authors: Melker Hoglund, Emilio Ferrucci, Camilo Hernandez, Aitor Muguruza Gonzalez, Cristopher Salvi, Leandro Sanchez-Betancourt, Yufei Zhang

    Abstract: We propose a novel framework for solving continuous-time non-Markovian stochastic control problems by means of neural rough differential equations (Neural RDEs) introduced in Morrill et al. (2021). Non-Markovianity naturally arises in control problems due to the time delay effects in the system coefficients or the driving noises, which leads to optimal control strategies depending explicitly on th… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023, Workshop on New Frontiers in Learning, Control, and Dynamical Systems

  10. arXiv:2305.16274  [pdf, other

    stat.ML cs.LG q-fin.CP

    Non-adversarial training of Neural SDEs with signature kernel scores

    Authors: Zacharia Issa, Blanka Horvath, Maud Lemercier, Cristopher Salvi

    Abstract: Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series generation has been previously obtained by training these models adversarially as GANs. However, as typical for GAN architectures, training is notoriously unstable, often suffers from mode collapse, and requires specialised techniques such as weight clip** and gradient pe… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Code available at https://github.com/issaz/sigker-nsdes/

  11. arXiv:2304.01479  [pdf, ps, other

    math.PR

    Optimal Stop** via Distribution Regression: a Higher Rank Signature Approach

    Authors: Blanka Horvath, Maud Lemercier, Chong Liu, Terry Lyons, Cristopher Salvi

    Abstract: Distribution Regression on path-space refers to the task of learning functions map** the law of a stochastic process to a scalar target. The learning procedure based on the notion of path-signature, i.e. a classical transform from rough path theory, was widely used to approximate weakly continuous functionals, such as the pricing functionals of path--dependent options' payoffs. However, this app… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 33 pages

    MSC Class: Primary 60L10; Secondary 60L20; 60G40; 91G60

  12. arXiv:2303.17671  [pdf, other

    math.DS cs.LG math.PR

    Neural signature kernels as infinite-width-depth-limits of controlled ResNets

    Authors: Nicola Muca Cirone, Maud Lemercier, Cristopher Salvi

    Abstract: Motivated by the paradigm of reservoir computing, we consider randomly initialized controlled ResNets defined as Euler-discretizations of neural controlled differential equations (Neural CDEs), a unified architecture which enconpasses both RNNs and ResNets. We show that in the infinite-width-depth limit and under proper scaling, these architectures converge weakly to Gaussian processes indexed on… ▽ More

    Submitted 4 June, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Added commutativity of limits, ICML 2023 final version

    MSC Class: 60L10; 60L90

  13. arXiv:2302.04586  [pdf, other

    cs.LG

    New directions in the applications of rough path theory

    Authors: Adeline Fermanian, Terry Lyons, James Morrill, Cristopher Salvi

    Abstract: This article provides a concise overview of some of the recent advances in the application of rough path theory to machine learning. Controlled differential equations (CDEs) are discussed as the key mathematical model to describe the interaction of a stream with a physical control system. A collection of iterated integrals known as the signature naturally arises in the description of the response… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  14. arXiv:2212.00134  [pdf, ps, other

    math.CO math.RA

    A structure theorem for streamed information

    Authors: Cristopher Salvi, Joscha Diehl, Terry Lyons, Rosa Preiss, Jeremy Reizenstein

    Abstract: We identify the free half shuffle algebra of Schützenberger (1958) with an algebra of real-valued functionals on paths, where the half shuffle emulates integration of a functional against another. We then provide two, to our knowledge, new identities in arity 3 involving its commutator (area), and show that these are sufficient to recover the Zinbiel and Tortkara identities of Dzhumadil'daev (2007… ▽ More

    Submitted 30 July, 2023; v1 submitted 30 November, 2022; originally announced December 2022.

  15. arXiv:2110.10249  [pdf, other

    cs.LG

    Neural Stochastic PDEs: Resolution-Invariant Learning of Continuous Spatiotemporal Dynamics

    Authors: Cristopher Salvi, Maud Lemercier, Andris Gerasimovics

    Abstract: Stochastic partial differential equations (SPDEs) are the mathematical tool of choice for modelling spatiotemporal PDE-dynamics under the influence of randomness. Based on the notion of mild solution of an SPDE, we introduce a novel neural architecture to learn solution operators of PDEs with (possibly stochastic) forcing from partially observed data. The proposed Neural SPDE model provides an ext… ▽ More

    Submitted 24 September, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2022

    MSC Class: 60L50

  16. arXiv:2109.03582  [pdf, other

    stat.ML cs.LG

    Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes

    Authors: Cristopher Salvi, Maud Lemercier, Chong Liu, Blanka Hovarth, Theodoros Damoulas, Terry Lyons

    Abstract: Stochastic processes are random variables with values in some space of paths. However, reducing a stochastic process to a path-valued random variable ignores its filtration, i.e. the flow of information carried by the process through time. By conditioning the process on its filtration, we introduce a family of higher order kernel mean embeddings (KMEs) that generalizes the notion of KME and captur… ▽ More

    Submitted 3 November, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Published at NeurIPS 2021

    MSC Class: 60L10; 60L20

  17. arXiv:2105.04211  [pdf, other

    stat.ML cs.LG

    SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Thomas Cass, Edwin V. Bonilla, Theodoros Damoulas, Terry Lyons

    Abstract: Making predictions and quantifying their uncertainty when the input data is sequential is a fundamental learning challenge, recently attracting increasing attention. We develop SigGPDE, a new scalable sparse variational inference framework for Gaussian Processes (GPs) on sequential data. Our contribution is twofold. First, we construct inducing variables underpinning the sparse approximation so th… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Published at ICML 2021

    MSC Class: 60L10; 60L20

  18. arXiv:2102.07904  [pdf, other

    cs.CR

    SK-Tree: a systematic malware detection algorithm on streaming trees via the signature kernel

    Authors: Thomas Cochrane, Peter Foster, Varun Chhabra, Maud Lemercier, Cristopher Salvi, Terry Lyons

    Abstract: The development of machine learning algorithms in the cyber security domain has been impeded by the complex, hierarchical, sequential and multimodal nature of the data involved. In this paper we introduce the notion of a streaming tree as a generic data structure encompassing a large portion of real-world cyber security data. Starting from host-based event logs we represent computer processes as s… ▽ More

    Submitted 29 September, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Published at IEEE-CSR (International Conference on Cybersecurity and Resilience) 2021

    MSC Class: 60L10

  19. arXiv:2009.08295  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Neural Rough Differential Equations for Long Time Series

    Authors: James Morrill, Cristopher Salvi, Patrick Kidger, James Foster, Terry Lyons

    Abstract: Neural controlled differential equations (CDEs) are the continuous-time analogue of recurrent neural networks, as Neural ODEs are to residual networks, and offer a memory-efficient continuous-time way to model functions of potentially irregular time series. Existing methods for computing the forward pass of a Neural CDE involve embedding the incoming time series into path space, often via interpol… ▽ More

    Submitted 21 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Published at ICML 2021

  20. arXiv:2006.14794  [pdf, other

    math.AP cs.LG

    The Signature Kernel is the solution of a Goursat PDE

    Authors: Cristopher Salvi, Thomas Cass, James Foster, Terry Lyons, Weixin Yang

    Abstract: Recently, there has been an increased interest in the development of kernel methods for learning with sequential data. The signature kernel is a learning tool with potential to handle irregularly sampled, multivariate time series. In "Kernels for sequentially ordered data" the authors introduced a kernel trick for the truncated version of this kernel avoiding the exponential complexity that would… ▽ More

    Submitted 20 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    MSC Class: 60L10; 60L20

  21. arXiv:2006.05805  [pdf, other

    cs.LG stat.ML

    Distribution Regression for Sequential Data

    Authors: Maud Lemercier, Cristopher Salvi, Theodoros Damoulas, Edwin V. Bonilla, Terry Lyons

    Abstract: Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic anal… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Published at AISTATS 2021

    MSC Class: 60L10; 60L20

  22. arXiv:2006.00218  [pdf, other

    q-fin.CP q-fin.MF q-fin.PR

    Sig-SDEs model for quantitative finance

    Authors: Imanol Perez Arribas, Cristopher Salvi, Lukasz Szpruch

    Abstract: Mathematical models, calibrated to data, have become ubiquitous to make key decision processes in modern quantitative finance. In this work, we propose a novel framework for data-driven model selection by integrating a classical quantitative setup with a generative modelling approach. Leveraging the properties of the signature, a well-known path-transform from stochastic analysis that recently eme… ▽ More

    Submitted 3 June, 2020; v1 submitted 30 May, 2020; originally announced June 2020.

  23. arXiv:1905.08494  [pdf, other

    cs.LG stat.ML

    Deep Signature Transforms

    Authors: Patric Bonnier, Patrick Kidger, Imanol Perez Arribas, Cristopher Salvi, Terry Lyons

    Abstract: The signature is an infinite graded sequence of statistics known to characterise a stream of data up to a negligible equivalence class. It is a transform which has previously been treated as a fixed feature transformation, on top of which a model may be built. We propose a novel approach which combines the advantages of the signature transform with modern deep learning frameworks. By learning an a… ▽ More

    Submitted 26 October, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019

    MSC Class: 68T01

  24. arXiv:0711.3294  [pdf

    cs.OH

    Energy Conversion Using New Thermoelectric Generator

    Authors: Guillaume Savelli, Marc Plissonnier, Jacqueline Bablet, C. Salvi, J. M. Fournier

    Abstract: During recent years, microelectronics helped to develop complex and varied technologies. It appears that many of these technologies can be applied successfully to realize Seebeck micro generators: photolithography and deposition methods allow to elaborate thin thermoelectric structures at the micro-scale level. Our goal is to scavenge energy by develo** a miniature power source for operating e… ▽ More

    Submitted 21 November, 2007; originally announced November 2007.

    Comments: Submitted on behalf of TIMA Editions (http://irevues.inist.fr/tima-editions)

    Journal ref: Dans Symposium on Design, Test, Integration and Packaging of MEMS/MOEMS - DTIP 2006, Stresa, Lago Maggiore : Italie (2006)