-
Heat flow, log-concavity, and Lipschitz transport maps
Authors:
Giovanni Brigati,
Francesco Pedrotti
Abstract:
In this paper we derive estimates for the Hessian of the logarithm (log-Hessian) for solutions to the heat equation. For initial data in the form of log-Lipschitz perturbation of strongly log-concave measures, the log-Hessian admits an explicit, uniform (in space) lower bound. This yields a new estimate for the Lipschitz constant of a transport map pushing forward the standard Gaussian to a measur…
▽ More
In this paper we derive estimates for the Hessian of the logarithm (log-Hessian) for solutions to the heat equation. For initial data in the form of log-Lipschitz perturbation of strongly log-concave measures, the log-Hessian admits an explicit, uniform (in space) lower bound. This yields a new estimate for the Lipschitz constant of a transport map pushing forward the standard Gaussian to a measure in this class. Further connections are discussed with score-based diffusion models and improved Gaussian logarithmic Sobolev inequalities. Finally, we show that assuming only fast decay of the tails of the initial datum does not suffice to guarantee uniform log-Hessian upper bounds.
△ Less
Submitted 6 May, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
$L^\infty$-optimal transport of anisotropic log-concave measures and exponential convergence in Fisher's infinitesimal model
Authors:
Ksenia A. Khudiakova,
Jan Maas,
Francesco Pedrotti
Abstract:
We prove upper bounds on the $L^\infty$-Wasserstein distance from optimal transport between strongly log-concave probability densities and log-Lipschitz perturbations. In the simplest setting, such a bound amounts to a transport-information inequality involving the $L^\infty$-Wasserstein metric and the relative $L^\infty$-Fisher information. We show that this inequality can be sharpened significan…
▽ More
We prove upper bounds on the $L^\infty$-Wasserstein distance from optimal transport between strongly log-concave probability densities and log-Lipschitz perturbations. In the simplest setting, such a bound amounts to a transport-information inequality involving the $L^\infty$-Wasserstein metric and the relative $L^\infty$-Fisher information. We show that this inequality can be sharpened significantly in situations where the involved densities are anisotropic. Our proof is based on probabilistic techniques using Langevin dynamics. As an application of these results, we obtain sharp exponential rates of convergence in Fisher's infinitesimal model from quantitative genetics, generalising recent results by Calvez, Poyato, and Santambrogio in dimension 1 to arbitrary dimensions.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Contractive coupling rates and curvature lower bounds for Markov chains
Authors:
Francesco Pedrotti
Abstract:
Contractive coupling rates have been recently introduced by Conforti as a tool to establish convex Sobolev inequalities (including modified log-Sobolev and Poincaré inequality) for some classes of Markov chains. In this work, we show how contractive coupling rates can also be used to prove stronger inequalities, in the form of curvature lower bounds for Markov chains and geodesic convexity of entr…
▽ More
Contractive coupling rates have been recently introduced by Conforti as a tool to establish convex Sobolev inequalities (including modified log-Sobolev and Poincaré inequality) for some classes of Markov chains. In this work, we show how contractive coupling rates can also be used to prove stronger inequalities, in the form of curvature lower bounds for Markov chains and geodesic convexity of entropic functionals. We illustrate this in several examples discussed by Conforti, where in particular, after appropriately choosing a parameter function, we establish positive curvature in the entropic and (discrete) Bakry--Émery sense. In addition, we recall and give straightforward generalizations of some notions of coarse Ricci curvature, and we discuss some of their properties and relations with the concepts of couplings and coupling rates: as an application, we show exponential contraction of the $p$-Wasserstein distance for the heat flow in the aforementioned examples.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
Authors:
Francesco Pedrotti,
Jan Maas,
Marco Mondelli
Abstract:
Score-based generative models (SGMs) are powerful tools to sample from complex data distributions. Their underlying idea is to (i) run a forward process for time $T_1$ by adding noise to the data, (ii) estimate its score function, and (iii) use such estimate to run a reverse process. As the reverse process is initialized with the stationary distribution of the forward one, the existing analysis pa…
▽ More
Score-based generative models (SGMs) are powerful tools to sample from complex data distributions. Their underlying idea is to (i) run a forward process for time $T_1$ by adding noise to the data, (ii) estimate its score function, and (iii) use such estimate to run a reverse process. As the reverse process is initialized with the stationary distribution of the forward one, the existing analysis paradigm requires $T_1\to\infty$. This is however problematic: from a theoretical viewpoint, for a given precision of the score approximation, the convergence guarantee fails as $T_1$ diverges; from a practical viewpoint, a large $T_1$ increases computational costs and leads to error propagation. This paper addresses the issue by considering a version of the popular predictor-corrector scheme: after running the forward process, we first estimate the final distribution via an inexact Langevin dynamics and then revert the process. Our key technical contribution is to provide convergence guarantees which require to run the forward process only for a fixed finite time $T_1$. Our bounds exhibit a mild logarithmic dependence on the input dimension and the subgaussian norm of the target distribution, have minimal assumptions on the data, and require only to control the $L^2$ loss on the score approximation, which is the quantity minimized in practice.
△ Less
Submitted 4 June, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Local Conditions for Global Convergence of Gradient Flows and Proximal Point Sequences in Metric Spaces
Authors:
Lorenzo Dello Schiavo,
Jan Maas,
Francesco Pedrotti
Abstract:
This paper deals with local criteria for the convergence to a global minimiser for gradient flow trajectories and their discretisations. To obtain quantitative estimates on the speed of convergence, we consider variations on the classical Kurdyka--Łojasiewicz inequality for a large class of parameter functions. Our assumptions are given in terms of the initial data, without any reference to an equ…
▽ More
This paper deals with local criteria for the convergence to a global minimiser for gradient flow trajectories and their discretisations. To obtain quantitative estimates on the speed of convergence, we consider variations on the classical Kurdyka--Łojasiewicz inequality for a large class of parameter functions. Our assumptions are given in terms of the initial data, without any reference to an equilibrium point. The main results are convergence statements for gradient flow curves and proximal point sequences to a global minimiser, together with sharp quantitative estimates on the speed of convergence. These convergence results apply in the general setting of lower semicontinuous functionals on complete metric spaces, generalising recent results for smooth functionals on $\mathbb{R}^n$. While the non-smooth setting covers very general spaces, it is also useful for (non)-smooth functionals on $\mathbb{R}^n$.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.