Search | arXiv e-print repository

arXiv:2202.02017 [pdf, other]

Flow Redirection for Epidemic Reaction-DIffusion Control

Authors: Pierre-Yves Massé, Quentin Laborde, Maria Cherifa, Jules Olayé, Laurent Oudre

Abstract: We show we can control an epidemic reaction-diffusion on a directed, and heterogeneous, network by redirecting the flows, thanks to the optimisation of well-designed loss functions, in particular the basic reproduction number of the model. We provide a final size relation linking the basic reproduction number to the epidemic final sizes, for diffusions around a reference diffusion with basic repro… ▽ More We show we can control an epidemic reaction-diffusion on a directed, and heterogeneous, network by redirecting the flows, thanks to the optimisation of well-designed loss functions, in particular the basic reproduction number of the model. We provide a final size relation linking the basic reproduction number to the epidemic final sizes, for diffusions around a reference diffusion with basic reproduction number less than 1. Experimentally, we show control is possible for different topologies, network heterogeneity levels, and speeds of diffusion. Our experimental results highlight the relevance of the basic reproduction number loss, compared to more straightforward losses. △ Less

Submitted 4 February, 2022; originally announced February 2022.

arXiv:2005.05645 [pdf, ps, other]

Convergence of Online Adaptive and Recurrent Optimization Algorithms

Authors: Pierre-Yves Massé, Yann Ollivier

Abstract: We prove local convergence of several notable gradient descent algorithms used in machine learning, for which standard stochastic gradient descent theory does not apply directly. This includes, first, online algorithms for recurrent models and dynamical systems, such as \emph{Real-time recurrent learning} (RTRL) and its computationally lighter approximations NoBackTrack and UORO; second, several a… ▽ More We prove local convergence of several notable gradient descent algorithms used in machine learning, for which standard stochastic gradient descent theory does not apply directly. This includes, first, online algorithms for recurrent models and dynamical systems, such as \emph{Real-time recurrent learning} (RTRL) and its computationally lighter approximations NoBackTrack and UORO; second, several adaptive algorithms such as RMSProp, online natural gradient, and Adam with $β^2\to 1$.Despite local convergence being a relatively weak requirement for a new optimization algorithm, no local analysis was available for these algorithms, as far as we knew. Analysis of these algorithms does not immediately follow from standard stochastic gradient (SGD) theory. In fact, Adam has been proved to lack local convergence in some simple situations \citep{j.2018on}. For recurrent models, online algorithms modify the parameter while the model is running, which further complicates the analysis with respect to simple SGD.Local convergence for these various algorithms results from a single, more general set of assumptions, in the setup of learning dynamical systems online. Thus, these results can cover other variants of the algorithms considered.We adopt an "ergodic" rather than probabilistic viewpoint, working with empirical time averages instead of probability distributions. This is more data-agnostic and creates differences with respect to standard SGD theory, especially for the range of possible learning rates. For instance, with cycling or per-epoch reshuffling over a finite dataset instead of pure i.i.d.\ sampling with replacement, empirical averages of gradients converge at rate $1/T$ instead of $1/\sqrt{T}$ (cycling acts as a variance reduction method), theoretically allowing for larger learning rates than in SGD. △ Less

Submitted 8 January, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

arXiv:1511.02540 [pdf, ps, other]

Speed learning on the fly

Authors: Pierre-Yves Massé, Yann Ollivier

Abstract: The practical performance of online stochastic gradient descent algorithms is highly dependent on the chosen step size, which must be tediously hand-tuned in many applications. The same is true for more advanced variants of stochastic gradients, such as SAGA, SVRG, or AdaGrad. Here we propose to adapt the step size by performing a gradient descent on the step size itself, viewing the whole perform… ▽ More The practical performance of online stochastic gradient descent algorithms is highly dependent on the chosen step size, which must be tediously hand-tuned in many applications. The same is true for more advanced variants of stochastic gradients, such as SAGA, SVRG, or AdaGrad. Here we propose to adapt the step size by performing a gradient descent on the step size itself, viewing the whole performance of the learning trajectory as a function of step size. Importantly, this adaptation can be computed online at little cost, without having to iterate backward passes over the full data. △ Less

Submitted 8 November, 2015; originally announced November 2015.

Comments: preprint

arXiv:1207.3975 [pdf, ps, other]

Adaptive confidence bands in the nonparametric fixed design regression model

Authors: Pierre-Yves Massé, William Meiniel

Abstract: In this note, we consider the problem of existence of adaptive confidence bands in the fixed design regression model, adapting ideas in Hoffmann and Nickl (2011) to the present case. In the course of the proof, we show that sup-norm adaptive estimators exist as well in regression. In this note, we consider the problem of existence of adaptive confidence bands in the fixed design regression model, adapting ideas in Hoffmann and Nickl (2011) to the present case. In the course of the proof, we show that sup-norm adaptive estimators exist as well in regression. △ Less

Submitted 19 July, 2012; v1 submitted 17 July, 2012; originally announced July 2012.

Comments: draft

Showing 1–4 of 4 results for author: Massé, P