-
Local convergence rates for Wasserstein gradient flows and McKean-Vlasov equations with multiple stationary solutions
Authors:
Pierre Monmarché,
Julien Reygner
Abstract:
Non-linear versions of log-Sobolev inequalities, that link a free energy to its dissipation along the corresponding Wasserstein gradient flow (i.e. corresponds to Polyak-Lojasiewicz inequalities in this context), are known to provide global exponential long-time convergence to the free energy minimizers, and have been shown to hold in various contexts. However they cannot hold when the free energy…
▽ More
Non-linear versions of log-Sobolev inequalities, that link a free energy to its dissipation along the corresponding Wasserstein gradient flow (i.e. corresponds to Polyak-Lojasiewicz inequalities in this context), are known to provide global exponential long-time convergence to the free energy minimizers, and have been shown to hold in various contexts. However they cannot hold when the free energy admits critical points which are not global minimizers, which is for instance the case of the granular media equation in a double-well potential with quadratic attractive interaction at low temperature. This work addresses such cases, extending the general arguments when a log-Sobolev inequality only holds locally and, as an example, establishing such local inequalities for the granular media equation with quadratic interaction either in the one-dimensional symmetric double-well case or in higher dimension in the low temperature regime. The method provides quantitative convergence rates for initial conditions in a Wasserstein ball around the stationary solutions. The same analysis is carried out for the kinetic counterpart of the gradient flow, i.e. the corresponding Vlasov-Fokker-Planck equation. The local exponential convergence to stationary solutions for the mean-field equations, both elliptic and kinetic, is shown to induce for the corresponding particle systems a fast (i.e. uniform in the number or particles) decay of the particle system free energy toward the level of the non-linear limit.
△ Less
Submitted 13 May, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Time-uniform log-Sobolev inequalities and applications to propagation of chaos
Authors:
Pierre Monmarché,
Zhenjie Ren,
Songbo Wang
Abstract:
Time-uniform log-Sobolev inequalities (LSI) satisfied by solutions of semi-linear mean-field equations have recently appeared to be a key tool to obtain time-uniform propagation of chaos estimates. This work addresses the more general settings of time-inhomogeneous Fokker-Planck equations. Time-uniform LSI are obtained in two cases, either with the bounded-Lipschitz perturbation argument with resp…
▽ More
Time-uniform log-Sobolev inequalities (LSI) satisfied by solutions of semi-linear mean-field equations have recently appeared to be a key tool to obtain time-uniform propagation of chaos estimates. This work addresses the more general settings of time-inhomogeneous Fokker-Planck equations. Time-uniform LSI are obtained in two cases, either with the bounded-Lipschitz perturbation argument with respect to a reference measure, or with a coupling approach at high temperature. These arguments are then applied to mean-field equations, where, on the one hand, sharp marginal propagation of chaos estimates are obtained in smooth cases and, on the other hand, time-uniform global propagation of chaos is shown in the case of vortex interactions with quadratic confinement potential on the whole space. In this second case, an important point is to establish global gradient and Hessian estimates, which is of independent interest. We prove these bounds in the more general situation of non-attractive logarithmic and Riesz singular interactions.
△ Less
Submitted 30 January, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
A note on a Vlasov-Fokker-Planck equation with non-symmetric interaction
Authors:
Pierre Monmarché
Abstract:
In the recent [3], Cesbron and Herda study a Vlasov-Fokker-Planck (VFP) equation with non-symmetric interaction, introduced in physics to model the distribution of electrons in a synchrotron particle accelerator. We make four remarks in view of their work: first, it is noticed in [3] that the free energy classically considered for the (symmetric) VFP equation is not a Lyapunov function in the non-…
▽ More
In the recent [3], Cesbron and Herda study a Vlasov-Fokker-Planck (VFP) equation with non-symmetric interaction, introduced in physics to model the distribution of electrons in a synchrotron particle accelerator. We make four remarks in view of their work: first, it is noticed in [3] that the free energy classically considered for the (symmetric) VFP equation is not a Lyapunov function in the non-symmetric case, and we will show however that this is still the case for a suitable definition of the free energy (with no explicit expression in general). Second, when the interaction is sufficiently small (in $W^{1,\infty}$), it is proven in [3] that the equation has a unique stationary solution which is locally attractive; in this spirit, we will see that, when the interaction force is Lipschitz with a sufficiently small constant, the convergence is global. Third, we also briefly discuss the mean-field interacting particle system corresponding to the VFP equation which, interestingly, is a non-equilibrium Langevin process. Finally, we will see that, in the small interaction regime, a suitable (explicit) non-linear Fisher information is contracted at constant rate, similarly to the situation of Wasserstein gradient flows for convex functionals (although here the dynamics is not a gradient flow).
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case
Authors:
Martin Chak,
Pierre Monmarché
Abstract:
Contraction in Wasserstein 1-distance with explicit rates is established for generalized Hamiltonian Monte Carlo with stochastic gradients under possibly nonconvex conditions. The algorithms considered include splitting schemes of kinetic Langevin diffusion. As consequence, quantitative Gaussian concentration bounds are provided for empirical averages. Convergence in Wasserstein 2-distance and tot…
▽ More
Contraction in Wasserstein 1-distance with explicit rates is established for generalized Hamiltonian Monte Carlo with stochastic gradients under possibly nonconvex conditions. The algorithms considered include splitting schemes of kinetic Langevin diffusion. As consequence, quantitative Gaussian concentration bounds are provided for empirical averages. Convergence in Wasserstein 2-distance and total variation are also given, together with numerical bias estimates.
△ Less
Submitted 17 April, 2024; v1 submitted 28 October, 2023;
originally announced October 2023.
-
$L^2$-Wasserstein contraction for Euler schemes of elliptic diffusions and interacting particle systems
Authors:
Linshan Liu,
Mateusz B. Majka,
Pierre Monmarché
Abstract:
We show the $L^2$-Wasserstein contraction for the transition kernel of a discretised diffusion process, under a contractivity at infinity condition on the drift and a sufficiently high diffusivity requirement. This extends recent results that, under similar assumptions on the drift but without the diffusivity restrictions, showed the $L^1$-Wasserstein contraction, or $L^p$-Wasserstein bounds for…
▽ More
We show the $L^2$-Wasserstein contraction for the transition kernel of a discretised diffusion process, under a contractivity at infinity condition on the drift and a sufficiently high diffusivity requirement. This extends recent results that, under similar assumptions on the drift but without the diffusivity restrictions, showed the $L^1$-Wasserstein contraction, or $L^p$-Wasserstein bounds for $p > 1$ that were, however, not true contractions. We explain how showing the true $L^2$-Wasserstein contraction is crucial for obtaining the local Poincaré inequality for the transition kernel of the Euler scheme of a diffusion. Moreover, we discuss other consequences of our contraction results, such as concentration inequalities and convergence rates in KL-divergence and total variation. We also study the corresponding $L^2$-Wasserstein contraction for discretisations of interacting diffusions. As a particular application, this allows us to analyse the behaviour of particle systems that can be used to approximate a class of McKean-Vlasov SDEs that were recently studied in the mean-field optimization literature.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Asymptotic expansion of the invariant measurefor Markov-modulated ODEs at high frequency
Authors:
Pierre Monmarché,
Edouard Strickler
Abstract:
We consider time-inhomogeneous ODEs whose parameters are governed by an underlying ergodic Markov process. When this underlying process is accelerated by a factor $\varepsilon^{-1}$, an averaging phenomenon occurs and the solution of the ODE converges to a deterministic ODE as $\varepsilon$ vanishes. We are interested in cases where this averaged flow is globally attracted to a point. In that case…
▽ More
We consider time-inhomogeneous ODEs whose parameters are governed by an underlying ergodic Markov process. When this underlying process is accelerated by a factor $\varepsilon^{-1}$, an averaging phenomenon occurs and the solution of the ODE converges to a deterministic ODE as $\varepsilon$ vanishes. We are interested in cases where this averaged flow is globally attracted to a point. In that case, the equilibrium distribution of the solution of the ODE converges to a Dirac mass at this point. We prove an asymptotic expansion in terms of $\varepsilon$ for this convergence, with a somewhat explicit formula for the first order term. The results are applied in three contexts: linear Markov-modulated ODEs, randomized splitting schemes, and Lotka-Volterra models in random environment. In particular, as a corollary, we prove the existence of two matrices whose convex combinations are all stable but such that, for a suitable jump rate, the top Lyapunov exponent of a Markov-modulated linear ODE switching between these two matrices is positive.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Leveraging Analog Quantum Computing with Neutral Atoms for Solvent Configuration Prediction in Drug Discovery
Authors:
Mauro D'Arcangelo,
Daniele Loco,
Fresnel team,
Nicolaï Gouraud,
Stanislas Angebault,
Jules Sueiro,
Pierre Monmarché,
Jérôme Forêt,
Louis-Paul Henry,
Loïc Henriet,
Jean-Philip Piquemal
Abstract:
We introduce quantum algorithms able to sample equilibrium water solvent molecules configurations within proteins thanks to analog quantum computing. To do so, we combine a quantum placement strategy to the 3D Reference Interaction Site Model (3D-RISM), an approach capable of predicting continuous solvent distributions. The intrinsic quantum nature of such coupling guarantees molecules not to be p…
▽ More
We introduce quantum algorithms able to sample equilibrium water solvent molecules configurations within proteins thanks to analog quantum computing. To do so, we combine a quantum placement strategy to the 3D Reference Interaction Site Model (3D-RISM), an approach capable of predicting continuous solvent distributions. The intrinsic quantum nature of such coupling guarantees molecules not to be placed too close to each other, a constraint usually imposed by hand in classical approaches. We present first a full quantum adiabatic evolution model that uses a local Rydberg Hamiltonian to cast the general problem into an anti-ferromagnetic Ising model. Its solution, an NP-hard problem in classical computing, is embodied into a Rydberg atom array Quantum Processing Unit (QPU). Following a classical emulator implementation, a QPU portage allows to experimentally validate the algorithm performances on an actual quantum computer. As a perspective of use on next generation devices, we emulate a second hybrid quantum-classical version of the algorithm. Such a variational quantum approach (VQA) uses a classical Bayesian minimization routine to find the optimal laser parameters. Overall, these Quantum-3D-RISM (Q-3D-RISM) algorithms open a new route towards the application of analog quantum computing in molecular modelling and drug design.
△ Less
Submitted 22 September, 2023; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Logarithmic Sobolev inequalities for non-equilibrium steady states
Authors:
Pierre Monmarché,
Songbo Wang
Abstract:
We consider two methods to establish log-Sobolev inequalities for the invariant measure of a diffusion process when its density is not explicit and the curvature is not positive everywhere. In the first approach, based on the Holley-Stroock and Aida-Shigekawa perturbation arguments [16, 1], the control on the (non-explicit) perturbation is obtained by stochastic control methods, following the comp…
▽ More
We consider two methods to establish log-Sobolev inequalities for the invariant measure of a diffusion process when its density is not explicit and the curvature is not positive everywhere. In the first approach, based on the Holley-Stroock and Aida-Shigekawa perturbation arguments [16, 1], the control on the (non-explicit) perturbation is obtained by stochastic control methods, following the comparison technique introduced by Conforti [7]. The second method combines the Wasserstein-$2$ contraction method, used in [24] to prove a Poincaré inequality in some non-equilibrium cases, with Wang's hypercontractivity results [29].
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Some remarks on the effect of the Random Batch Method on phase transition
Authors:
Arnaud Guillin,
Pierre Le Bris,
Pierre Monmarché
Abstract:
In this article, we focus on two toy models : the Curie-Weiss model and the system of $N$ particles in linear interactions in a double well confining potential. Both models, which have been extensively studied, describe a large system of particles with a mean-field limit that admits a phase transition. We are concerned with the numerical simulation of these particle systems. To deal with the quadr…
▽ More
In this article, we focus on two toy models : the Curie-Weiss model and the system of $N$ particles in linear interactions in a double well confining potential. Both models, which have been extensively studied, describe a large system of particles with a mean-field limit that admits a phase transition. We are concerned with the numerical simulation of these particle systems. To deal with the quadratic complexity of the numerical scheme, corresponding to the computation of the $O(N^2)$ interactions per time step, the Random Batch Method (RBM) has been suggested. It consists in randomly (and uniformly) dividing the particles into batches of size $p>1$, and computing the interactions only within each batch, thus reducing the numerical complexity to $O(Np)$ per time step. The convergence of this numerical method has been proved in other works.
This work is motivated by the observation that the RBM, via the random constructions of batches, artificially adds noise to the particle system. The goal of this article is to study the effect of this added noise on the phase transition of the nonlinear limit, and more precisely we study the effective dynamics of the two models to show how a phase transition may still be observed with the RBM but at a lower critical temperature.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Wasserstein contraction for the stochastic Morris-Lecar neuron model
Authors:
Maxime Herda,
Pierre Monmarché,
Benoît Perthame
Abstract:
Neuron models have attracted a lot of attention recently, both in mathematics and neuroscience. We are interested in studying long-time and large-population emerging properties in a simplified toy model. From a mathematical perspective, this amounts to study the long-time behaviour of a degenerate reflected diffusion process. Using coupling arguments, the flow is proven to be a contraction of the…
▽ More
Neuron models have attracted a lot of attention recently, both in mathematics and neuroscience. We are interested in studying long-time and large-population emerging properties in a simplified toy model. From a mathematical perspective, this amounts to study the long-time behaviour of a degenerate reflected diffusion process. Using coupling arguments, the flow is proven to be a contraction of the Wasserstein distance for long times, which implies the exponential relaxation toward a (non-explicit) unique globally attractive equilibrium distribution. The result is extended to a McKean-Vlasov type non-linear variation of the model, when the mean-field interaction is sufficiently small. The ergodicity of the process results from a combination of deterministic contraction properties and local diffusion, the noise being sufficient to drive the system away from non-contractive domains.
△ Less
Submitted 30 January, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Lambda-ABF: Simplified, Portable, Accurate and Cost-effective Alchemical Free Energy Computations
Authors:
Louis Lagardère,
Lise Maurin,
Olivier Adjoua,
Krystel El Hage,
Pierre Monmarché,
Jean-Philip Piquemal,
Jérôme Hénin
Abstract:
We introduce an efficient and robust method to compute alchemical free energy differences, resulting from the application of multiple walker Adaptive Biasing Force (ABF) in conjunction with strongly damped Langevin $λ$-dynamics. Unbiased alchemical free energy surfaces are naturally recovered by Thermodynamic Integration (TI). No manual optimization of the $λ$ schedule is required as the sampling…
▽ More
We introduce an efficient and robust method to compute alchemical free energy differences, resulting from the application of multiple walker Adaptive Biasing Force (ABF) in conjunction with strongly damped Langevin $λ$-dynamics. Unbiased alchemical free energy surfaces are naturally recovered by Thermodynamic Integration (TI). No manual optimization of the $λ$ schedule is required as the sampling of the $λ$ variable is continuous and converges towards a uniform distribution. Free diffusion of $λ$ improves orthogonal relaxation compared to fixed $λ$ methods such as standard TI or Free Energy Perturbation (FEP). Furthermore, the multiple walker strategy provides coverage of orthogonal space in a generic way with minimal user input and negligible computational overhead. Of practical importance, no adiabatic decoupling between the alchemical and Cartesian degrees of freedom is assumed, ensuring unbiased estimates for a wide envelope of numerical parameters. We present two high-performance implementations of the method in production molecular dynamics engines, namely NAMD and Tinker-HP, through coupling with the Colvars open source library. These interfaces enable the combination of the rich feature sets of those packages. We demonstrate the correctness and efficiency of the approach on several real-world cases: from solvation free energies up to ligand-receptor binding (using a recently proposed binding restraint scheme) with both fixed-charge and polarizable models. We find that, for a chosen accuracy, the computational cost is strongly reduced compared to state-of-the-art fixed-lambda methods and that results within 1~kcal/mol of experimental value are recovered for the most complex system. The implementation is publicly available and readily usable by practitioners of current alchemical methods.
△ Less
Submitted 14 May, 2024; v1 submitted 16 July, 2023;
originally announced July 2023.
-
Second order quantitative bounds for unadjusted generalized Hamiltonian Monte Carlo
Authors:
Evan Camrud,
Alain Durmus,
Pierre Monmarché,
Gabriel Stoltz
Abstract:
This paper provides a convergence analysis for generalized Hamiltonian Monte Carlo samplers, a family of Markov Chain Monte Carlo methods based on leapfrog integration of Hamiltonian dynamics and kinetic Langevin diffusion, that encompasses the unadjusted Hamiltonian Monte Carlo method. Assuming that the target distribution $π$ satisfies a log-Sobolev inequality and mild conditions on the correspo…
▽ More
This paper provides a convergence analysis for generalized Hamiltonian Monte Carlo samplers, a family of Markov Chain Monte Carlo methods based on leapfrog integration of Hamiltonian dynamics and kinetic Langevin diffusion, that encompasses the unadjusted Hamiltonian Monte Carlo method. Assuming that the target distribution $π$ satisfies a log-Sobolev inequality and mild conditions on the corresponding potential function, we establish quantitative bounds on the relative entropy of the iterates defined by the algorithm, with respect to $π$. Our approach is based on a perturbative and discrete version of the modified entropy method developed to establish hypocoercivity for the continuous-time kinetic Langevin process. As a corollary of our main result, we are able to derive complexity bounds for the class of algorithms at hand. In particular, we show that the total number of iterations to achieve a target accuracy $\varepsilon >0$ is of order $d/\varepsilon^{1/4}$, where $d$ is the dimension of the problem. This result can be further improved in the case of weakly interacting mean field potentials, for which we find a total number of iterations of order $(d/\varepsilon)^{1/4}$.
△ Less
Submitted 13 May, 2024; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Recent advances in the long-time analysis of killed degenerate processes and their particle approximation
Authors:
Bertrand Cloez,
Lucas Journel,
Pierre Monmarché,
Boris Nectoux,
Mouad Ramil
Abstract:
We review some recent results of quantitative long-time convergence for the law of a killed Markov process conditioned to survival toward a quasi-stationary distribution, and on the analogous question for the particle systems used in practice to sample these distributions. With respect to the existing literature, one of the novelties of these works is the degeneracy of the underlying process with…
▽ More
We review some recent results of quantitative long-time convergence for the law of a killed Markov process conditioned to survival toward a quasi-stationary distribution, and on the analogous question for the particle systems used in practice to sample these distributions. With respect to the existing literature, one of the novelties of these works is the degeneracy of the underlying process with respect to classical elliptic diffusion, namely it can be a non-elliptic hypoelliptic diffusion, a piecewise deterministic Markov process or an Euler numerical scheme.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Switched diffusion processes for non-convex optimization and saddle points search
Authors:
Lucas Journel,
Pierre Monmarché
Abstract:
We introduce and investigate stochastic processes designed to find local minimizers and saddle points of non-convex functions, exploring the landscape more efficiently than the standard noisy gradient descent. The processes switch between two behaviours, a noisy gradient descent and a noisy saddle point search. It is proven to be well-defined and to converge to a stationary distribution in the lon…
▽ More
We introduce and investigate stochastic processes designed to find local minimizers and saddle points of non-convex functions, exploring the landscape more efficiently than the standard noisy gradient descent. The processes switch between two behaviours, a noisy gradient descent and a noisy saddle point search. It is proven to be well-defined and to converge to a stationary distribution in the long time. Numerical experiments are provided on low-dimensional toy models and for Lennard-Jones clusters.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Piecewise deterministic sampling with splitting schemes
Authors:
Andrea Bertazzi,
Paul Dobson,
Pierre Monmarché
Abstract:
We introduce novel Markov chain Monte Carlo (MCMC) algorithms based on numerical approximations of piecewise-deterministic Markov processes obtained with the framework of splitting schemes. We present unadjusted as well as adjusted algorithms, for which the asymptotic bias due to the discretisation error is removed applying a non-reversible Metropolis-Hastings filter. In a general framework we dem…
▽ More
We introduce novel Markov chain Monte Carlo (MCMC) algorithms based on numerical approximations of piecewise-deterministic Markov processes obtained with the framework of splitting schemes. We present unadjusted as well as adjusted algorithms, for which the asymptotic bias due to the discretisation error is removed applying a non-reversible Metropolis-Hastings filter. In a general framework we demonstrate that the unadjusted schemes have weak error of second order in the step size, while typically maintaining a computational cost of only one gradient evaluation of the negative log-target function per iteration. Focusing then on unadjusted schemes based on the Bouncy Particle and Zig-Zag samplers, we provide conditions ensuring geometric ergodicity and consider the expansion of the invariant measure in terms of the step size. We analyse the dependence of the leading term in this expansion on the refreshment rate and on the structure of the splitting scheme, giving a guideline on which structure is best. Finally, we illustrate the competitiveness of our samplers with numerical experiments on a Bayesian imaging inverse problem and a system of interacting particles.
△ Less
Submitted 20 October, 2023; v1 submitted 6 January, 2023;
originally announced January 2023.
-
An entropic approach for Hamiltonian Monte Carlo: the idealized case
Authors:
Pierre Monmarché
Abstract:
Quantitative long-time entropic convergence and short-time regularization are established for an idealized Hamiltonian Monte Carlo chain which alternatively follows an Hamiltonian dynamics for a fixed time and then partially or totally refreshes its velocity with an auto-regressive Gaussian step. These results, in discrete time, are the analogous of similar results for the continuous-time kinetic…
▽ More
Quantitative long-time entropic convergence and short-time regularization are established for an idealized Hamiltonian Monte Carlo chain which alternatively follows an Hamiltonian dynamics for a fixed time and then partially or totally refreshes its velocity with an auto-regressive Gaussian step. These results, in discrete time, are the analogous of similar results for the continuous-time kinetic Langevin diffusion, and the latter can be obtained from our bounds in a suitable limit regime. The dependency in the log-Sobolev constant of the target measure is sharp and is illustrated on a mean-field case and on a low-temperature regime, with an application to the simulated annealing algorithm. The practical unadjusted algorithm is briefly discussed.
△ Less
Submitted 3 June, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Uniform convergence of the Fleming-Viot process in a hard killing metastable case
Authors:
Lucas Journel,
Pierre Monmarché
Abstract:
We study the long-time convergence of a Fleming-Viot process, in the case where the underlying process is a metastable diffusion killed when it reaches some level set. Through a coupling argument, we establish the long-time convergence of the Fleming-Viot process toward some stationary measure at an exponential rate independent of $N$, the size of the system, as well as uniform in time propagation…
▽ More
We study the long-time convergence of a Fleming-Viot process, in the case where the underlying process is a metastable diffusion killed when it reaches some level set. Through a coupling argument, we establish the long-time convergence of the Fleming-Viot process toward some stationary measure at an exponential rate independent of $N$, the size of the system, as well as uniform in time propagation of chaos estimates.
△ Less
Submitted 20 June, 2024; v1 submitted 5 July, 2022;
originally announced July 2022.
-
On systems of particles in singular repulsive interaction in dimension one : log and Riesz gas
Authors:
Arnaud Guillin,
Pierre Le Bris,
Pierre Monmarché
Abstract:
In this article, we prove the first quantitative uniform in time propagation of chaos for a class of systems of particles in singular repulsive interaction in dimension one that contains the Dyson Brownian motion. We start by establishing existence and uniqueness for the Riesz gases, before proving propagation of chaos with an original approach to the problem, namely coupling with a Cauchy sequenc…
▽ More
In this article, we prove the first quantitative uniform in time propagation of chaos for a class of systems of particles in singular repulsive interaction in dimension one that contains the Dyson Brownian motion. We start by establishing existence and uniqueness for the Riesz gases, before proving propagation of chaos with an original approach to the problem, namely coupling with a Cauchy sequence type argument. We also give a general argument to turn a result of weak propagation of chaos into a strong and uniform in time result using the long time behavior and some bounds on moments, in particular enabling us to get a uniform in time version of the result of Cépa-Lé**le.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
HMC and underdamped Langevin united in the unadjusted convex smooth case
Authors:
Nicolaï Gouraud,
Pierre Le Bris,
Adrien Majka,
Pierre Monmarché
Abstract:
We consider a family of unadjusted generalized HMC samplers, which includes standard position HMC samplers and discretizations of the underdamped Langevin process. A detailed analysis and optimization of the parameters is conducted in the Gaussian case, which shows an improvement from $1/κ$ to $1/\sqrtκ$ for the convergence rate in terms of the condition number $κ$ by using partial velocity refres…
▽ More
We consider a family of unadjusted generalized HMC samplers, which includes standard position HMC samplers and discretizations of the underdamped Langevin process. A detailed analysis and optimization of the parameters is conducted in the Gaussian case, which shows an improvement from $1/κ$ to $1/\sqrtκ$ for the convergence rate in terms of the condition number $κ$ by using partial velocity refreshment, with respect to classical full refreshments. A similar effect is observed empirically for two related algorithms, namely Metropolis-adjusted gHMC and kinetic piecewise-deterministic Markov processes. Then, a stochastic gradient version of the samplers is considered, for which dimension-free convergence rates are established for log-concave smooth targets over a large range of parameters, gathering in a unified framework previous results on position HMC and underdamped Langevin and extending them to HMC with inertia.
△ Less
Submitted 22 May, 2024; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Wasserstein contraction and Poincaré inequalities for elliptic diffusions at high temperature
Authors:
Pierre Monmarché
Abstract:
We consider elliptic diffusion processes on $\mathbb R^d$. Assuming that the drift contracts distances outside a compact set, we prove that, at a sufficiently high temperature, the Markov semi-group associated to the process is a contraction of the $\mathcal W_2$ Wasserstein distance, which implies a Poincaré inequality for its invariant measure. The result doesn't require neither reversibility no…
▽ More
We consider elliptic diffusion processes on $\mathbb R^d$. Assuming that the drift contracts distances outside a compact set, we prove that, at a sufficiently high temperature, the Markov semi-group associated to the process is a contraction of the $\mathcal W_2$ Wasserstein distance, which implies a Poincaré inequality for its invariant measure. The result doesn't require neither reversibility nor an explicit expression of the invariant measure, and the estimates have a sharp dependency on the dimension. Some variations of the arguments are then used to study, first, the stability of the invariant measure of the process with respect to its drift and, second, systems of interacting particles, yielding a criterion for dimension-free Poincaré inequalities and quantitative long-time convergence for non-linear McKean-Vlasov type processes.
△ Less
Submitted 19 July, 2023; v1 submitted 19 January, 2022;
originally announced January 2022.
-
Position-dependent memory kernel in generalized Langevin equations: theory and numerical estimation
Authors:
Hadrien Vroylandt,
Pierre Monmarché
Abstract:
Generalized Langevin equations with non-linear forces and position-dependent linear friction memory kernels, such as commonly used to describe the effective dynamics of coarse-grained variables in molecular dynamics, are rigorously derived within the Mori-Zwanzig formalism. A fluctuation-dissipation theorem relating the properties of the noise to the memory kernel is shown. The derivation also yie…
▽ More
Generalized Langevin equations with non-linear forces and position-dependent linear friction memory kernels, such as commonly used to describe the effective dynamics of coarse-grained variables in molecular dynamics, are rigorously derived within the Mori-Zwanzig formalism. A fluctuation-dissipation theorem relating the properties of the noise to the memory kernel is shown. The derivation also yields Volterra-type equations for the kernel, which can be used for a numerical parametrization of the model from all-atom simulations.
△ Less
Submitted 6 July, 2022; v1 submitted 7 January, 2022;
originally announced January 2022.
-
On the gap between deterministic and probabilistic Lyapunov exponents for continuous-time linear systems
Authors:
Yacine Chitour,
Guilherme Mazanti,
Pierre Monmarché,
Mario Sigalotti
Abstract:
Consider a non-autonomous continuous-time linear system in which the time-dependent matrix determining the dynamics is piecewise constant and takes finitely many values $A_1, \dotsc, A_N$. This paper studies the equality cases between the maximal Lyapunov exponent associated with the set of matrices $\{A_1, \dotsc, A_N\}$, on the one hand, and the corresponding ones for piecewise deterministic Mar…
▽ More
Consider a non-autonomous continuous-time linear system in which the time-dependent matrix determining the dynamics is piecewise constant and takes finitely many values $A_1, \dotsc, A_N$. This paper studies the equality cases between the maximal Lyapunov exponent associated with the set of matrices $\{A_1, \dotsc, A_N\}$, on the one hand, and the corresponding ones for piecewise deterministic Markov processes with modes $A_1, \dotsc, A_N$, on the other hand. A fundamental step in this study consists in establishing a result of independent interest, namely, that any sequence of Markov processes associated with the matrices $A_1,\dotsc, A_N$ converges, up to extracting a subsequence, to a Markov process associated with a suitable convex combination of those matrices.
△ Less
Submitted 21 November, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Reducing exit-times of diffusions with repulsive interactions
Authors:
Paul-Eric Chaudru de Raynal,
Manh Hong Duong,
Pierre Monmarché,
Milica Tomašević,
Julian Tugaut
Abstract:
In this work we prove a Kramers' type law for the low-temperature behavior of the exit-times from a metastable state for a class of self-interacting nonlinear diffusion processes. Contrary to previous works, the interaction is not assumed to be convex, which means that this result covers cases where the exit-time for the interacting process is smaller than the exit-time for the associated non-inte…
▽ More
In this work we prove a Kramers' type law for the low-temperature behavior of the exit-times from a metastable state for a class of self-interacting nonlinear diffusion processes. Contrary to previous works, the interaction is not assumed to be convex, which means that this result covers cases where the exit-time for the interacting process is smaller than the exit-time for the associated non-interacting process. The technique of the proof is based on the fact that, under an appropriate contraction condition, the interacting process is conveniently coupled with a non-interacting (linear) Markov process where the interacting law is replaced by a constant Dirac mass at the fixed point of the deterministic zero-temperature process.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Likelihood-based non-Markovian models from molecular dynamics
Authors:
Hadrien Vroylandt,
Ludovic Goudenège,
Pierre Monmarché,
Fabio Pietrucci,
Benjamin Rotenberg
Abstract:
We introduce a new method to accurately and efficiently estimate the effective dynamics of collective variables in molecular simulations. Such reduced dynamics play an essential role in the study of a broad class of processes, ranging from chemical reactions in solution to conformational changes in biomolecules or phase transitions in condensed matter systems. The standard Markovian approximation…
▽ More
We introduce a new method to accurately and efficiently estimate the effective dynamics of collective variables in molecular simulations. Such reduced dynamics play an essential role in the study of a broad class of processes, ranging from chemical reactions in solution to conformational changes in biomolecules or phase transitions in condensed matter systems. The standard Markovian approximation often breaks down due to the lack of a proper separation of time scales and memory effects must be taken into account. Using a parametrization based on hidden auxiliary variables, we obtain a generalized Langevin equation by maximizing the statistical likelihood of the observed trajectories. Both the memory kernel and random noise are correctly recovered by this procedure. This data-driven approach provides a reduced dynamical model for multidimensional collective variables, enabling the accurate sampling of their long-time dynamical properties at a computational cost drastically reduced with respect to all-atom numerical simulations. The present strategy, based on the reproduction of the dynamics of trajectories rather than the memory kernel or the velocity-autocorrelation function, conveniently provides other observables beyond these two, including e.g. stationary currents in non-equilibrium situations, or the distribution of first passage times between metastable states.
△ Less
Submitted 23 February, 2022; v1 submitted 8 October, 2021;
originally announced October 2021.
-
Overdamped limit at stationarity for non-equilibrium Langevin diffusions
Authors:
Pierre Monmarché,
Mouad Ramil
Abstract:
In this note, we establish that the stationary distribution of a possibly non-equilibrium Langevin diffusion converges, as the dam** parameter goes to infinity (or equivalently in the Smoluchowski-Kramers vanishing mass limit), toward a tensor product of the stationary distribution of the corresponding overdamped process and of a Gaussian distribution.
In this note, we establish that the stationary distribution of a possibly non-equilibrium Langevin diffusion converges, as the dam** parameter goes to infinity (or equivalently in the Smoluchowski-Kramers vanishing mass limit), toward a tensor product of the stationary distribution of the corresponding overdamped process and of a Gaussian distribution.
△ Less
Submitted 11 January, 2022; v1 submitted 4 October, 2021;
originally announced October 2021.
-
Uniform in time propagation of chaos for the 2D vortex model and other singular stochastic systems
Authors:
Arnaud Guillin,
Pierre Le Bris,
Pierre Monmarché
Abstract:
In this article, we adapt the work of Jabin and Wang (2018) to show the first result of uniform in time propagation of chaos for a class of singular interaction kernels. In particular, our models contain the Biot-Savart kernel on the torus and thus the 2D vortex model.
In this article, we adapt the work of Jabin and Wang (2018) to show the first result of uniform in time propagation of chaos for a class of singular interaction kernels. In particular, our models contain the Biot-Savart kernel on the torus and thus the 2D vortex model.
△ Less
Submitted 4 October, 2023; v1 submitted 19 August, 2021;
originally announced August 2021.
-
Convergence of the kinetic annealing for general potentials
Authors:
Lucas Journel,
Pierre Monmarché
Abstract:
The convergence of the kinetic Langevin simulated annealing is proven under mild assumptions on the potential $U$ for slow logarithmic cooling schedules. Moreover, non-convergence for fast logarithmic and non-logarithmic cooling schedules is established. The results are based on an adaptation to non-elliptic non-reversible kinetic settings of a localization/local convergence strategy developed by…
▽ More
The convergence of the kinetic Langevin simulated annealing is proven under mild assumptions on the potential $U$ for slow logarithmic cooling schedules. Moreover, non-convergence for fast logarithmic and non-logarithmic cooling schedules is established. The results are based on an adaptation to non-elliptic non-reversible kinetic settings of a localization/local convergence strategy developed by Fournier and Tardif in the overdamped elliptic case, and on precise quantitative high order Sobolev hypocoercive estimates.
△ Less
Submitted 7 December, 2022; v1 submitted 24 July, 2021;
originally announced July 2021.
-
Convergence rates for the Vlasov-Fokker-Planck equation and uniform in time propagation of chaos in non convex cases
Authors:
Arnaud Guillin,
Pierre Le Bris,
Pierre Monmarché
Abstract:
We prove the existence of a contraction rate for Vlasov-Fokker-Planck equation in Wasserstein distance, provided the interaction potential is (locally) Lipschitz continuous and the confining potential is both Lipschitz continuous and greater than a quadratic function, thus requiring no convexity conditions. Our strategy relies on coupling methods suggested by A. Eberle adapted to the kinetic setti…
▽ More
We prove the existence of a contraction rate for Vlasov-Fokker-Planck equation in Wasserstein distance, provided the interaction potential is (locally) Lipschitz continuous and the confining potential is both Lipschitz continuous and greater than a quadratic function, thus requiring no convexity conditions. Our strategy relies on coupling methods suggested by A. Eberle adapted to the kinetic setting enabling also to obtain uniform in time propagation of chaos in a non convex setting.
△ Less
Submitted 16 July, 2021; v1 submitted 19 May, 2021;
originally announced May 2021.
-
Discrete sticky couplings of functional autoregressive processes
Authors:
Alain Durmus,
Andreas Eberle,
Aurélien Enfroy,
Arnaud Guillin,
Pierre Monmarché
Abstract:
In this paper, we provide bounds in Wasserstein and total variation distances between the distributions of the successive iterates of two functional autoregressive processes with isotropic Gaussian noise of the form $Y_{k+1} = \mathrm{T}_γ(Y_k) + \sqrt{γσ^2} Z_{k+1}$ and $\tilde{Y}_{k+1} = \tilde{\mathrm{T}}_γ(\tilde{Y}_k) + \sqrt{γσ^2} \tilde{Z}_{k+1}$. More precisely, we give non-asymptotic boun…
▽ More
In this paper, we provide bounds in Wasserstein and total variation distances between the distributions of the successive iterates of two functional autoregressive processes with isotropic Gaussian noise of the form $Y_{k+1} = \mathrm{T}_γ(Y_k) + \sqrt{γσ^2} Z_{k+1}$ and $\tilde{Y}_{k+1} = \tilde{\mathrm{T}}_γ(\tilde{Y}_k) + \sqrt{γσ^2} \tilde{Z}_{k+1}$. More precisely, we give non-asymptotic bounds on $ρ(\mathcal{L}(Y_{k}),\mathcal{L}(\tilde{Y}_k))$, where $ρ$ is an appropriate weighted Wasserstein distance or a $V$-distance, uniformly in the parameter $γ$, and on $ρ(π_γ,\tildeπ_γ)$, where $π_γ$ and $\tildeπ_γ$ are the respective stationary measures of the two processes. The class of considered processes encompasses the Euler-Maruyama discretization of Langevin diffusions and its variants. The bounds we derive are of order $γ$ as $γ\to 0$. To obtain our results, we rely on the construction of a discrete sticky Markov chain $(W_k^{(γ)})_{k \in \mathbb{N}}$ which bounds the distance between an appropriate coupling of the two processes. We then establish stability and quantitative convergence results for this process uniformly on $γ$. In addition, we show that it converges in distribution to the continuous sticky process studied in previous work. Finally, we apply our result to Bayesian inference of ODE parameters and numerically illustrate them on two particular problems.
△ Less
Submitted 28 November, 2023; v1 submitted 14 April, 2021;
originally announced April 2021.
-
The Adaptive Biasing Force algorithm with non-conservative forces and related topics
Authors:
Tony Lelièvre,
Lise Maurin,
Pierre Monmarché
Abstract:
We propose a study of the Adaptive Biasing Force method's robustness under generic (possibly non-conservative) forces. We first ensure the flat histogram property is satisfied in all cases. We then introduce a fixed point problem yielding the existence of a stationary state for both the Adaptive Biasing Force and Projected Adapted Biasing Force algorithms, relying on generic bounds on the invarian…
▽ More
We propose a study of the Adaptive Biasing Force method's robustness under generic (possibly non-conservative) forces. We first ensure the flat histogram property is satisfied in all cases. We then introduce a fixed point problem yielding the existence of a stationary state for both the Adaptive Biasing Force and Projected Adapted Biasing Force algorithms, relying on generic bounds on the invariant probability measures of homogeneous diffusions. Using classical entropy techniques, we prove the exponential convergence of both biasing force and law as time goes to infinity, for both the Adaptive Biasing Force and the Projected Adaptive Biasing Force methods.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Almost sure contraction for diffusions on $\mathbb R^d$. Application to generalised Langevin diffusions
Authors:
Pierre Monmarché
Abstract:
In the case of diffusions on $\mathbb R^d$ with constant diffusion matrix, without assuming reversibility nor hypoellipticity, we prove that the contractivity of the deterministic drift is equivalent to the constant rate contraction of Wasserstein distances $\mathcal W_p$, $p\in[1,\infty]$. It also implies concentration inequalities for ergodic means of the process. Such a contractivity property i…
▽ More
In the case of diffusions on $\mathbb R^d$ with constant diffusion matrix, without assuming reversibility nor hypoellipticity, we prove that the contractivity of the deterministic drift is equivalent to the constant rate contraction of Wasserstein distances $\mathcal W_p$, $p\in[1,\infty]$. It also implies concentration inequalities for ergodic means of the process. Such a contractivity property is then established for some non-equilibrium chains of anharmonic oscillators and for some generalised Langevin diffusions when the potential is convex with bounded Hessian and the friction is sufficiently high. This extends previous known results for the usual (kinetic) Langevin diffusion.
△ Less
Submitted 5 April, 2023; v1 submitted 22 September, 2020;
originally announced September 2020.
-
Exact targeting of Gibbs distributions using velocity-jump processes
Authors:
Pierre Monmarché,
Mathias Rousset,
Pierre-André Zitt
Abstract:
This work introduces and studies a new family of velocity jump Markov processes directly amenable to exact simulation with the following two properties: i) trajectories converge in law when a time-step parameter vanishes towards a given Langevin or Hamil-tonian dynamics; ii) the stationary distribution of the process is always exactly given by the product of a Gaussian (for velocities) by any targ…
▽ More
This work introduces and studies a new family of velocity jump Markov processes directly amenable to exact simulation with the following two properties: i) trajectories converge in law when a time-step parameter vanishes towards a given Langevin or Hamil-tonian dynamics; ii) the stationary distribution of the process is always exactly given by the product of a Gaussian (for velocities) by any target log-density whose gradient is pointwise computabe together with some additional explicit appropriate upper bound. The process does not exhibit any velocity reflections (jump sizes can be controlled) and is suitable for the 'factorization method'. We provide a rigorous mathematical proof of: i) the small time-step convergence towards Hamiltonian/Langevin dynamics, as well as ii) the exponentially fast convergence towards the target distribution when suitable noise on velocity is present. Numerical implementation is detailed and illustrated.
△ Less
Submitted 14 September, 2020; v1 submitted 21 August, 2020;
originally announced August 2020.
-
Adaptive force biasing algorithms: new convergence results and tensor approximations of the bias
Authors:
Virginie Ehrlacher,
Tony Lelièvre,
Pierre Monmarché
Abstract:
A modification of the Adaptive Biasing Force method is introduced, in which the free energy is approximated by a sum of tensor products of one-dimensional functions. This enables to handle a larger number of reaction coordinates than the classical algorithm. We prove the algorithm is well-defined and prove the long-time convergence toward a regularized version of the free energy for an idealized v…
▽ More
A modification of the Adaptive Biasing Force method is introduced, in which the free energy is approximated by a sum of tensor products of one-dimensional functions. This enables to handle a larger number of reaction coordinates than the classical algorithm. We prove the algorithm is well-defined and prove the long-time convergence toward a regularized version of the free energy for an idealized version of the algorithm. Numerical experiments demonstrate that the method is able to capture correlations between reaction coordinates.
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin diffusion
Authors:
Pierre Monmarché
Abstract:
The efficiency of a Markov sampler based on the underdamped Langevin diffusion is studied for high dimensional targets with convex and smooth potentials. We consider a classical second-order integrator which requires only one gradient computation per iteration. Contrary to previous works on similar samplers, a dimension-free contraction of Wasserstein distances and convergence rate for the total v…
▽ More
The efficiency of a Markov sampler based on the underdamped Langevin diffusion is studied for high dimensional targets with convex and smooth potentials. We consider a classical second-order integrator which requires only one gradient computation per iteration. Contrary to previous works on similar samplers, a dimension-free contraction of Wasserstein distances and convergence rate for the total variance distance are proven for the discrete time chain itself. Non-asymptotic Wasserstein and total variation efficiency bounds and concentration inequalities are obtained for both the Metropolis adjusted and unadjusted chains. \nv{In particular, for the unadjusted chain,} in terms of the dimension $d$ and the desired accuracy $\varepsilon$, the Wasserstein efficiency bounds are of order $\sqrt d / \varepsilon$ in the general case, $\sqrt{d/\varepsilon}$ if the Hessian of the potential is Lipschitz, and $d^{1/4}/\sqrt\varepsilon$ in the case of a separable target, in accordance with known results for other kinetic Langevin or HMC schemes.
△ Less
Submitted 18 June, 2021; v1 submitted 10 July, 2020;
originally announced July 2020.
-
Metastability for systems of interacting neurons
Authors:
Eva Löcherbach,
Pierre Monmarché
Abstract:
We study a stochastic system of interacting neurons and its metastable properties. The system consists of $N$ neurons, each spiking randomly with rate depending on its membrane potential. At its spiking time, the neuron potential is reset to $0$ and all other neurons receive an additional amount $h/N$ of potential. In between successive spike times, each neuron looses potential at exponential spee…
▽ More
We study a stochastic system of interacting neurons and its metastable properties. The system consists of $N$ neurons, each spiking randomly with rate depending on its membrane potential. At its spiking time, the neuron potential is reset to $0$ and all other neurons receive an additional amount $h/N$ of potential. In between successive spike times, each neuron looses potential at exponential speed. We study this system in the supercritical regime, that is, for sufficiently high values of the synaptic weight $h.$ Under very mild conditions on the spiking rate function, is has been shown in Duarte and Ost \cite{do} that the only invariant distribution of the finite system is the trivial measure $ δ_{\bf 0}$ corresponding to extinction of the process. Under minimal conditions on the behavior of the spiking rate function in the vicinity of $0$, we prove that the extinction time arrives at exponentially late times in $ N$, and discuss the stability of the equilibrium $δ_{\bf 0}$ for the non-linear mean-field limit process depending on the parameters of the dynamics. We then specify our study to the case of saturating spiking rates and show that, under suitable conditions on the parameters of the model, 1) the non-linear mean-field limit admits a unique and globally attracting equilibrium and 2) the rescaled exit times for the mean spiking rate of a finite system from a neighbourhood of the non-linear equilibrium rate converge in law to an exponential distribution, as the system size diverges. In other words, the system exhibits a metastable behavior.
△ Less
Submitted 8 December, 2020; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Uniform long-time and propagation of chaos estimates for mean field kinetic particles in non-convex landscapes
Authors:
Arnaud Guillin,
Pierre Monmarché
Abstract:
Combining the results of [14] and [10], the trend to equilibrium in large time is studied for a large particle system associated to a Vlasov-Fokker-Planck equation. Under some conditions (that allow non-convex confining potentials) the convergence rate is proven to be independent from the number of particles. From this are derived uniform in time propagation of chaos estimates and an exponentially…
▽ More
Combining the results of [14] and [10], the trend to equilibrium in large time is studied for a large particle system associated to a Vlasov-Fokker-Planck equation. Under some conditions (that allow non-convex confining potentials) the convergence rate is proven to be independent from the number of particles. From this are derived uniform in time propagation of chaos estimates and an exponentially fast convergence for the nonlinear equation itself.
△ Less
Submitted 10 April, 2020; v1 submitted 2 March, 2020;
originally announced March 2020.
-
Velocity jump processes : an alternative to multi-timestep methods for faster and accurate molecular dynamics simulations
Authors:
Pierre Monmarché,
Jérémy Weisman,
Louis Lagardère,
Jean-Philip Piquemal
Abstract:
We propose a new route to accelerate molecular dynamics through the use of velocity jump processes allowing for an adaptive time-step specific to each atom-atom pair (2-body) interactions. We start by introducing the formalism of the new velocity jump molecular dynamics, ergodic with respect to the canonical measure. We then introduce the new BOUNCE integrator that allows for long-range forces to…
▽ More
We propose a new route to accelerate molecular dynamics through the use of velocity jump processes allowing for an adaptive time-step specific to each atom-atom pair (2-body) interactions. We start by introducing the formalism of the new velocity jump molecular dynamics, ergodic with respect to the canonical measure. We then introduce the new BOUNCE integrator that allows for long-range forces to be evaluated at random and optimal time-steps, leading to strong savings in direct space. The accuracy and computational performances of a first BOUNCE implementation dedicated to classical (non-polarizable) force fields is tested in the cases of pure direct-space droplet-like simulations and of periodic boundary conditions (PBC) simulations using Smooth Particule Mesh Ewald. An analysis of the capability of BOUNCE to reproduce several condensed phase properties is provided. Since electrostatics and van der Waals 2-body contributions are evaluated much less often than with standard integrators using a 1fs timestep, up to a 400 % direct-space acceleration is observed. Applying the reversible reference system propagator algorithms (RESPA(1)) to reciprocal space (many-body) interactions allows BOUNCE-RESPA(1) to maintain large speedups in PBC while maintaining precision. Overall, we show that replacing the BAOAB integrator by the BOUNCE adaptive framework preserves a similar accuracy and leads to significant computational savings.
△ Less
Submitted 9 June, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Hypocoercivité $L^2$, inégalité de concentration, temps d'atteinte et fonctions de Lyapunov
Authors:
Pierre Monmarché
Abstract:
We establish that, for a Markov semi-group, $L^2$ hypocoercivity, i.e. contractivity for a modified $L^2$ norm, implies quantitative deviation bounds for additive functionals of the associated Markov process and exponential integrability of the hitting time of sets with positive measure. Moreover, in the case of diffusion processes and under a strong hypoellipticity assumption, we prove that…
▽ More
We establish that, for a Markov semi-group, $L^2$ hypocoercivity, i.e. contractivity for a modified $L^2$ norm, implies quantitative deviation bounds for additive functionals of the associated Markov process and exponential integrability of the hitting time of sets with positive measure. Moreover, in the case of diffusion processes and under a strong hypoellipticity assumption, we prove that $L^2$ hypocoercivity implies the existence of a Lyapunov function for the generator. An english translation of the original article in french is provided.
-----
On montre que, pour un semi-groupe de Markov, l'hypocoercivité $L^2$ -- c'est-à-dire la contractivité d'une norme $L^2$ modifiée -- implique des inégalités de concentration quantitatives et l'intégrabilité exponentielle des temps d'atteinte des ensembles de mesure positive. D'autre part, pour les diffusions et sous une hypothèse forte d'hypoellipticité, on établit que l'hypocoercivité $L^2$ implique l'existence d'une fonction de Lyapunov pour le générateur associé. Une traduction en anglais est disponible.
△ Less
Submitted 19 December, 2019; v1 submitted 5 November, 2019;
originally announced November 2019.
-
Convergence of a particle approximation for the quasi-stationary distribution of a diffusion process: uniform estimates in a compact soft case
Authors:
Lucas Journel,
Pierre Monmarché
Abstract:
We establish the convergences (with respect to the simulation time $t$; the number of particles $N$; the timestep $γ$) of a Moran/Fleming-Viot type particle scheme toward the quasi-stationary distribution of a diffusion on the $d$-dimensional torus, killed at a smooth rate. In these conditions, quantitative bounds are obtained that, for each parameter ($t\rightarrow \infty$, $N\rightarrow \infty$…
▽ More
We establish the convergences (with respect to the simulation time $t$; the number of particles $N$; the timestep $γ$) of a Moran/Fleming-Viot type particle scheme toward the quasi-stationary distribution of a diffusion on the $d$-dimensional torus, killed at a smooth rate. In these conditions, quantitative bounds are obtained that, for each parameter ($t\rightarrow \infty$, $N\rightarrow \infty$ or $γ\rightarrow 0$) are independent from the two others.
△ Less
Submitted 20 October, 2020; v1 submitted 11 October, 2019;
originally announced October 2019.
-
Analysis of an Adaptive Biasing Force method based on self-interacting dynamics
Authors:
Michel Benaïm,
Charles-Edouard Bréhier,
Pierre Monmarché
Abstract:
This article fills a gap in the mathematical analysis of Adaptive Biasing algorithms, which are extensively used in molecular dynamics computations. Given a reaction coordinate, ideally, the bias in the overdamped Langevin dynamics would be given by the gradient of the associated free energy function, which is unknown. We consider an adaptive biased version of the overdamped dynamics, where the bi…
▽ More
This article fills a gap in the mathematical analysis of Adaptive Biasing algorithms, which are extensively used in molecular dynamics computations. Given a reaction coordinate, ideally, the bias in the overdamped Langevin dynamics would be given by the gradient of the associated free energy function, which is unknown. We consider an adaptive biased version of the overdamped dynamics, where the bias depends on the past of the trajectory and is designed to approximate the free energy.
The main result of this article is the consistency and efficiency of this approach. More precisely we prove the almost sure convergence of the bias as time goes to infinity, and that the limit is close to the ideal bias, as an auxiliary parameter of the algorithm goes to $0$.
The proof is based on interpreting the process as a self-interacting dynamics, and on the study of a non-trivial fixed point problem for the limiting flow obtained using the ODE method.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.
-
Simulated Annealing In $\mathbf{R}^d$ With Slowly Growing Potentials
Authors:
Nicolas Fournier,
Pierre Monmarché,
Camille Tardif
Abstract:
We use a localization procedure to weaken the growth assumptions of Royer [8], Miclo [4] and Zitt [9] concerning the continuous-time simulated annealing in $\mathbf{R}^d$. We show that a transition occurs for potentials growing like $a \log \log |x|$ at infinity. We also study a class of potentials with possibly unbounded sets of local minima.
We use a localization procedure to weaken the growth assumptions of Royer [8], Miclo [4] and Zitt [9] concerning the continuous-time simulated annealing in $\mathbf{R}^d$. We show that a transition occurs for potentials growing like $a \log \log |x|$ at infinity. We also study a class of potentials with possibly unbounded sets of local minima.
△ Less
Submitted 17 September, 2020; v1 submitted 4 September, 2019;
originally announced September 2019.
-
Kinetic walks for sampling
Authors:
Pierre Monmarché
Abstract:
The persistent walk is a classical model in kinetic theory, which has also been studied as a toy model for MCMC questions. Its continuous limit, the telegraph process, has recently been extended to various velocity jump processes (Bouncy Particle Sampler, Zig-Zag process, etc.) in order to sample general target distributions on $\mathbb R^d$. This paper studies, from a sampling point of view, gene…
▽ More
The persistent walk is a classical model in kinetic theory, which has also been studied as a toy model for MCMC questions. Its continuous limit, the telegraph process, has recently been extended to various velocity jump processes (Bouncy Particle Sampler, Zig-Zag process, etc.) in order to sample general target distributions on $\mathbb R^d$. This paper studies, from a sampling point of view, general kinetic walks that are natural discrete-time (and possibly discrete-space) counterparts of these continuous-space processes. The main contributions of the paper are the definition and study of a discrete-space Zig-Zag sampler and the definition and time-discretisation of hybrid jump/diffusion kinetic samplers for multi-scale potentials on $\mathbb R^d$.
△ Less
Submitted 18 February, 2020; v1 submitted 1 March, 2019;
originally announced March 2019.
-
Elementary coupling approach for non-linear perturbation of Markov processes with mean-field jump mechanims and related problems
Authors:
Pierre Monmarché
Abstract:
Mean-field integro-differential equations are studied in an abstract framework, through couplings of the corresponding stochastic processes. In the perturbative regime, the equation is proven to admit a unique equilibrium, toward which the process converges exponentially fast. Similarly, in this case, the associated particle system is proven to converge toward its equilibrium at a rate independent…
▽ More
Mean-field integro-differential equations are studied in an abstract framework, through couplings of the corresponding stochastic processes. In the perturbative regime, the equation is proven to admit a unique equilibrium, toward which the process converges exponentially fast. Similarly, in this case, the associated particle system is proven to converge toward its equilibrium at a rate independent from the number of particles.
△ Less
Submitted 13 January, 2023; v1 submitted 28 September, 2018;
originally announced September 2018.
-
Piecewise Deterministic Markov Processes and their invariant measures
Authors:
Alain Durmus,
Arnaud Guillin,
Pierre Monmarché
Abstract:
Piecewise Deterministic Markov Processes (PDMPs) are studied in a general framework. First, different constructions are proven to be equivalent. Second, we introduce a coupling between two PDMPs following the same differential flow which implies quantitative bounds on the total variation between the marginal distributions of the two processes. Finally two results are established regarding the inva…
▽ More
Piecewise Deterministic Markov Processes (PDMPs) are studied in a general framework. First, different constructions are proven to be equivalent. Second, we introduce a coupling between two PDMPs following the same differential flow which implies quantitative bounds on the total variation between the marginal distributions of the two processes. Finally two results are established regarding the invariant measures of PDMPs. A practical condition to show that a probability measure is invariant for the associated PDMP semi-group is presented. In a second time, a bound on the invariant probability measures in $V$-norm of two PDMPs following the same differential flow is established. This last result is then applied to study the asymptotic bias of some non-exact PDMP MCMC methods.
△ Less
Submitted 2 August, 2021; v1 submitted 14 July, 2018;
originally announced July 2018.
-
Geometric ergodicity of the bouncy particle sampler
Authors:
Alain Durmus,
Arnaud Guillin,
Pierre Monmarché
Abstract:
The Bouncy Particle Sampler (BPS) is a Monte Carlo Markov Chain algorithm to sample from a target density known up to a multiplicative constant. This method is based on a kinetic piecewise deterministic Markov process for which the target measure is invariant. This paper deals with theoretical properties of BPS. First, we establish geometric ergodicity of the associated semi-group under weaker con…
▽ More
The Bouncy Particle Sampler (BPS) is a Monte Carlo Markov Chain algorithm to sample from a target density known up to a multiplicative constant. This method is based on a kinetic piecewise deterministic Markov process for which the target measure is invariant. This paper deals with theoretical properties of BPS. First, we establish geometric ergodicity of the associated semi-group under weaker conditions than in [10] both on the target distribution and the velocity probability distribution. This result is based on a new coupling of the process which gives a quantitative minorization condition and yields more insights on the convergence. In addition, we study on a toy model the dependency of the convergence rates on the dimension of the state space. Finally, we apply our results to the analysis of simulated annealing algorithms based on BPS.
△ Less
Submitted 15 November, 2019; v1 submitted 14 July, 2018;
originally announced July 2018.
-
Entropic multipliers method for langevin diffusion and weighted log sobolev inequalities
Authors:
Patrick Cattiaux,
Arnaud Guillin,
Pierre Monmarché,
Chaoen Zhang
Abstract:
In his work about hypocercivity, Villani [18] considers in particular convergence to equilibrium for the kinetic Langevin process. While his convergence results in L 2 are given in a quite general setting, convergence in entropy requires some boundedness condition on the Hessian of the Hamiltonian. We will show here how to get rid of this assumption in the study of the hypocoercive entropic relaxa…
▽ More
In his work about hypocercivity, Villani [18] considers in particular convergence to equilibrium for the kinetic Langevin process. While his convergence results in L 2 are given in a quite general setting, convergence in entropy requires some boundedness condition on the Hessian of the Hamiltonian. We will show here how to get rid of this assumption in the study of the hypocoercive entropic relaxation to equilibrium for the Langevin diffusion. Our method relies on a generalization to entropy of the multipliers method and an adequate functional inequality. As a byproduct, we also give tractable conditions for this functional inequality, which is a particular instance of a weighted logarithmic Sobolev inequality, to hold.
△ Less
Submitted 3 August, 2017;
originally announced August 2017.
-
A note on Fisher Information hypocoercive decay for the linear Boltzmann equation
Authors:
Pierre Monmarché
Abstract:
This note deals with the linear Boltzmann equation in the non-compact setting with a confining potential which is close to quadratic. We prove that in this case, starting from a smooth initial datum, the Fisher Information (hence, the relative entropy) with respect to the stationary state converges exponentially fast to zero.
This note deals with the linear Boltzmann equation in the non-compact setting with a confining potential which is close to quadratic. We prove that in this case, starting from a smooth initial datum, the Fisher Information (hence, the relative entropy) with respect to the stationary state converges exponentially fast to zero.
△ Less
Submitted 27 October, 2020; v1 submitted 30 March, 2017;
originally announced March 2017.
-
Weakly self-interacting velocity jump processes for bacterial chemotaxis and adaptive algorithms
Authors:
Pierre Monmarché
Abstract:
A self-interacting velocity jump process is introduced, which behaves in large time similarly to the corresponding self-interacting diffusion, namely the evolution of its normalized occupation measure approaches a deterministic flow.
A self-interacting velocity jump process is introduced, which behaves in large time similarly to the corresponding self-interacting diffusion, namely the evolution of its normalized occupation measure approaches a deterministic flow.
△ Less
Submitted 31 October, 2017; v1 submitted 31 January, 2017;
originally announced January 2017.
-
Strongly self-interacting processes on the circle
Authors:
Carl-Erik Gauthier,
Pierre Monmarché
Abstract:
The purpose of this paper is to investigate the long time behaviour for a self-interacting diffusion and a self-interacting velocity jump process. While the diffusion case has already been studied for some particular potential function, the second one, which belongs to the family of piecewise deterministic processes, is new.
Depending on the underlying potential function's shape, we prove either…
▽ More
The purpose of this paper is to investigate the long time behaviour for a self-interacting diffusion and a self-interacting velocity jump process. While the diffusion case has already been studied for some particular potential function, the second one, which belongs to the family of piecewise deterministic processes, is new.
Depending on the underlying potential function's shape, we prove either the almost sure convergence or the recurrence for a natural extended process given by a change a variable.
△ Less
Submitted 31 January, 2019; v1 submitted 9 June, 2016;
originally announced June 2016.
-
Optimal linear drift for the speed of convergence of an hypoelliptic diffusion
Authors:
Arnaud Guillin,
Pierre Monmarché
Abstract:
Among all generalized Ornstein-Uhlenbeck processes which sample the same invariant measure and for which the same amount of randomness (a $N$-dimensional Brownian motion) is injected in the system, we prove that the asymptotic rate of convergence is maximized by a non-reversible hypoelliptic one.
Among all generalized Ornstein-Uhlenbeck processes which sample the same invariant measure and for which the same amount of randomness (a $N$-dimensional Brownian motion) is injected in the system, we prove that the asymptotic rate of convergence is maximized by a non-reversible hypoelliptic one.
△ Less
Submitted 5 October, 2021; v1 submitted 25 April, 2016;
originally announced April 2016.