-
Flow Redirection for Epidemic Reaction-DIffusion Control
Authors:
Pierre-Yves Massé,
Quentin Laborde,
Maria Cherifa,
Jules Olayé,
Laurent Oudre
Abstract:
We show we can control an epidemic reaction-diffusion on a directed, and heterogeneous, network by redirecting the flows, thanks to the optimisation of well-designed loss functions, in particular the basic reproduction number of the model. We provide a final size relation linking the basic reproduction number to the epidemic final sizes, for diffusions around a reference diffusion with basic repro…
▽ More
We show we can control an epidemic reaction-diffusion on a directed, and heterogeneous, network by redirecting the flows, thanks to the optimisation of well-designed loss functions, in particular the basic reproduction number of the model. We provide a final size relation linking the basic reproduction number to the epidemic final sizes, for diffusions around a reference diffusion with basic reproduction number less than 1. Experimentally, we show control is possible for different topologies, network heterogeneity levels, and speeds of diffusion. Our experimental results highlight the relevance of the basic reproduction number loss, compared to more straightforward losses.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Convergence of Online Adaptive and Recurrent Optimization Algorithms
Authors:
Pierre-Yves Massé,
Yann Ollivier
Abstract:
We prove local convergence of several notable gradient descent algorithms used in machine learning, for which standard stochastic gradient descent theory does not apply directly. This includes, first, online algorithms for recurrent models and dynamical systems, such as \emph{Real-time recurrent learning} (RTRL) and its computationally lighter approximations NoBackTrack and UORO; second, several a…
▽ More
We prove local convergence of several notable gradient descent algorithms used in machine learning, for which standard stochastic gradient descent theory does not apply directly. This includes, first, online algorithms for recurrent models and dynamical systems, such as \emph{Real-time recurrent learning} (RTRL) and its computationally lighter approximations NoBackTrack and UORO; second, several adaptive algorithms such as RMSProp, online natural gradient, and Adam with $β^2\to 1$.Despite local convergence being a relatively weak requirement for a new optimization algorithm, no local analysis was available for these algorithms, as far as we knew. Analysis of these algorithms does not immediately follow from standard stochastic gradient (SGD) theory. In fact, Adam has been proved to lack local convergence in some simple situations \citep{j.2018on}. For recurrent models, online algorithms modify the parameter while the model is running, which further complicates the analysis with respect to simple SGD.Local convergence for these various algorithms results from a single, more general set of assumptions, in the setup of learning dynamical systems online. Thus, these results can cover other variants of the algorithms considered.We adopt an "ergodic" rather than probabilistic viewpoint, working with empirical time averages instead of probability distributions. This is more data-agnostic and creates differences with respect to standard SGD theory, especially for the range of possible learning rates. For instance, with cycling or per-epoch reshuffling over a finite dataset instead of pure i.i.d.\ sampling with replacement, empirical averages of gradients converge at rate $1/T$ instead of $1/\sqrt{T}$ (cycling acts as a variance reduction method), theoretically allowing for larger learning rates than in SGD.
△ Less
Submitted 8 January, 2021; v1 submitted 12 May, 2020;
originally announced May 2020.
-
Speed learning on the fly
Authors:
Pierre-Yves Massé,
Yann Ollivier
Abstract:
The practical performance of online stochastic gradient descent algorithms is highly dependent on the chosen step size, which must be tediously hand-tuned in many applications. The same is true for more advanced variants of stochastic gradients, such as SAGA, SVRG, or AdaGrad. Here we propose to adapt the step size by performing a gradient descent on the step size itself, viewing the whole perform…
▽ More
The practical performance of online stochastic gradient descent algorithms is highly dependent on the chosen step size, which must be tediously hand-tuned in many applications. The same is true for more advanced variants of stochastic gradients, such as SAGA, SVRG, or AdaGrad. Here we propose to adapt the step size by performing a gradient descent on the step size itself, viewing the whole performance of the learning trajectory as a function of step size. Importantly, this adaptation can be computed online at little cost, without having to iterate backward passes over the full data.
△ Less
Submitted 8 November, 2015;
originally announced November 2015.
-
Adaptive confidence bands in the nonparametric fixed design regression model
Authors:
Pierre-Yves Massé,
William Meiniel
Abstract:
In this note, we consider the problem of existence of adaptive confidence bands in the fixed design regression model, adapting ideas in Hoffmann and Nickl (2011) to the present case. In the course of the proof, we show that sup-norm adaptive estimators exist as well in regression.
In this note, we consider the problem of existence of adaptive confidence bands in the fixed design regression model, adapting ideas in Hoffmann and Nickl (2011) to the present case. In the course of the proof, we show that sup-norm adaptive estimators exist as well in regression.
△ Less
Submitted 19 July, 2012; v1 submitted 17 July, 2012;
originally announced July 2012.