Skip to main content

Showing 1–9 of 9 results for author: Fermanian, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.14578  [pdf, other

    stat.ML cs.LG math.OC

    Multivariate Online Linear Regression for Hierarchical Forecasting

    Authors: Massil Hihat, Guillaume Garrigos, Adeline Fermanian, Simon Bussy

    Abstract: In this paper, we consider a deterministic online linear regression model where we allow the responses to be multivariate. To address this problem, we introduce MultiVAW, a method that extends the well-known Vovk-Azoury-Warmuth algorithm to the multivariate setting, and show that it also enjoys logarithmic regret in time. We apply our results to the online hierarchical forecasting problem and reco… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  2. arXiv:2402.02857  [pdf, other

    stat.ML cs.LG

    Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

    Authors: Sobihan Surendran, Antoine Godichon-Baggioni, Adeline Fermanian, Sylvain Le Corff

    Abstract: Stochastic Gradient Descent (SGD) with adaptive steps is now widely used for training deep neural networks. Most theoretical results assume access to unbiased gradient estimators, which is not the case in several recent deep learning and reinforcement learning applications that use Monte Carlo methods. This paper provides a comprehensive non-asymptotic analysis of SGD with biased gradients and ada… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2401.17077  [pdf, other

    stat.ML cs.LG

    Dynamical Survival Analysis with Controlled Latent States

    Authors: Linus Bleistein, Van-Tuan Nguyen, Adeline Fermanian, Agathe Guilloux

    Abstract: We consider the task of learning individual-specific intensities of counting processes from a set of static variables and irregularly sampled time series. We introduce a novel modelization approach in which the intensity is the solution to a controlled differential equation. We first design a neural estimator by building on neural controlled differential equations. In a second time, we show that o… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: ICML 2024

  4. arXiv:2302.04586  [pdf, other

    cs.LG

    New directions in the applications of rough path theory

    Authors: Adeline Fermanian, Terry Lyons, James Morrill, Cristopher Salvi

    Abstract: This article provides a concise overview of some of the recent advances in the application of rough path theory to machine learning. Controlled differential equations (CDEs) are discussed as the key mathematical model to describe the interaction of a stream with a physical control system. A collection of iterated integrals known as the signature naturally arises in the description of the response… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  5. arXiv:2301.11647  [pdf, other

    stat.ML cs.LG

    Learning the Dynamics of Sparsely Observed Interacting Systems

    Authors: Linus Bleistein, Adeline Fermanian, Anne-Sophie Jannot, Agathe Guilloux

    Abstract: We address the problem of learning the dynamics of an unknown non-parametric system linking a target and a feature time series. The feature time series is measured on a sparse and irregular grid, while we have access to only a few points of the target time series. Once learned, we can use these dynamics to predict values of the target from the previous values of the feature time series. We frame t… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  6. arXiv:2206.06929  [pdf, other

    cs.LG stat.ML

    Scaling ResNets in the Large-depth Regime

    Authors: Pierre Marion, Adeline Fermanian, Gérard Biau, Jean-Philippe Vert

    Abstract: Deep ResNets are recognized for achieving state-of-the-art results in complex machine learning tasks. However, the remarkable performance of these architectures relies on a training procedure that needs to be carefully crafted to avoid vanishing or exploding gradients, particularly as the depth $L$ increases. No consensus has been reached on how to mitigate this issue, although a widely discussed… ▽ More

    Submitted 10 June, 2024; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 44 pages, 9 figures. Updated with clarifications and additional references

  7. arXiv:2106.01202  [pdf, other

    stat.ML cs.LG

    Framing RNN as a kernel method: A neural ODE approach

    Authors: Adeline Fermanian, Pierre Marion, Jean-Philippe Vert, Gérard Biau

    Abstract: Building on the interpretation of a recurrent neural network (RNN) as a continuous-time neural differential equation, we show, under appropriate conditions, that the solution of a RNN can be viewed as a linear function of a specific feature set of the input sequence, known as the signature. This connection allows us to frame a RNN as a kernel method in a suitable reproducing kernel Hilbert space.… ▽ More

    Submitted 29 October, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: 33 pages, 7 figures, accepted for an oral presentation at NeurIPS 2021

  8. arXiv:2006.00873  [pdf, other

    cs.LG stat.ML

    A Generalised Signature Method for Multivariate Time Series Feature Extraction

    Authors: James Morrill, Adeline Fermanian, Patrick Kidger, Terry Lyons

    Abstract: The 'signature method' refers to a collection of feature extraction techniques for multivariate time series, derived from the theory of controlled differential equations. There is a great deal of flexibility as to how this method can be applied. On the one hand, this flexibility allows the method to be tailored to specific problems, but on the other hand, can make precise application challenging.… ▽ More

    Submitted 6 February, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 25 pages

  9. arXiv:1911.13211  [pdf, other

    stat.ML cs.LG

    Embedding and learning with signatures

    Authors: Adeline Fermanian

    Abstract: Sequential and temporal data arise in many fields of research, such as quantitative finance, medicine, or computer vision. A novel approach for sequential learning, called the signature method and rooted in rough path theory, is considered. Its basic principle is to represent multidimensional paths by a graded feature set of their iterated integrals, called the signature. This approach relies crit… ▽ More

    Submitted 9 December, 2020; v1 submitted 29 November, 2019; originally announced November 2019.