Search | arXiv e-print repository

Entropy-regularized optimal transport over networks with incomplete marginals information

Authors: Aayan Masood Pathan, Michele Pavon

Abstract: We study in this paper optimal mass transport over a strongly connected, directed graph on a given discrete time interval. Differently from previous literature, we do not assume full knowledge of the initial and final goods distribution over the network nodes. In spite of the meager information, we show that it is possible to characterize the most likely flow in two important cases: The first one… ▽ More We study in this paper optimal mass transport over a strongly connected, directed graph on a given discrete time interval. Differently from previous literature, we do not assume full knowledge of the initial and final goods distribution over the network nodes. In spite of the meager information, we show that it is possible to characterize the most likely flow in two important cases: The first one is when the initial and/or final distribution is only known on proper subsets of the nodes. The second case is when only some moments of the marginal distributions are known. As an important by-product, we determine the most likely initial and final marginals on the whole state space. △ Less

Submitted 30 March, 2024; originally announced April 2024.

MSC Class: 60J10

arXiv:2311.06824 [pdf, ps, other]

On the rate of change of Varentropy for Markov diffusion processes

Authors: Michele Pavon

Abstract: Relying on the reverse-time space-time harmonic property of the ratio of two solutions of the Fokker-Plank equation, we establish an explicit formula for derivative of the \emph{varentropy} for a Markov diffusion process. The formula involves a nonlinear function of the {\em local free energy} $\ln(p_t(x)/\bar{p}(x))$.We then verify that our formula yields the correct result in the simple case of… ▽ More Relying on the reverse-time space-time harmonic property of the ratio of two solutions of the Fokker-Plank equation, we establish an explicit formula for derivative of the \emph{varentropy} for a Markov diffusion process. The formula involves a nonlinear function of the {\em local free energy} $\ln(p_t(x)/\bar{p}(x))$.We then verify that our formula yields the correct result in the simple case of a scalar Gaussian diffusion. In the latter case, varentropy is exponentially decaying to zero. △ Less

Submitted 12 November, 2023; originally announced November 2023.

arXiv:2307.05103 [pdf, other]

Control and estimation of multi-commodity network flow under aggregation

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: A paradigm put forth by E. Schrödinger in 1931/32, known as Schrödinger bridges, represents a formalism to pose and solve control and estimation problems seeking a perturbation from an initial control schedule (in the case of control), or from a prior probability law (in the case of estimation), sufficient to reconcile data in the form of marginal distributions and minimal in the sense of relative… ▽ More A paradigm put forth by E. Schrödinger in 1931/32, known as Schrödinger bridges, represents a formalism to pose and solve control and estimation problems seeking a perturbation from an initial control schedule (in the case of control), or from a prior probability law (in the case of estimation), sufficient to reconcile data in the form of marginal distributions and minimal in the sense of relative entropy to the prior. In the same spirit, we consider traffic-flow and apply a Schrödinger-type dictum, to perturb minimally with respect to a suitable relative entropy functional a prior schedule/law so as to reconcile the traffic flow with scarce aggregate distributions on families of indistinguishable individuals. Specifically, we consider the problem to regulate/estimate multi-commodity network flow rates based only on empirical distributions of commodities being transported (e.g., types of vehicles through a network, in motion) at two given times. Thus, building on Schrödinger's large deviation rationale, we develop a method to identify {\em the most likely flow rates (traffic flow)}, given prior information and aggregate observations. Our method further extends the Schrödinger bridge formalism to the multi-commodity setting, allowing commodities to exit or enter the flow field as well (e.g., vehicles to enter and stop and park) at any time. The behavior of entering or exiting the flow field, by commodities or vehicles, is modeled by a Markov chains with killing and creation states. Our method is illustrated with a numerical experiment. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 12 pages, 5 figures

MSC Class: 93E20; 90B10; 90C35; 90B06; 15B48; 97M40; 05C81; 82C41

arXiv:2204.13049 [pdf, ps, other]

On local entropy, stochastic control and deep neural networks

Authors: Michele Pavon

Abstract: In this paper, we connect some recent papers on smoothing of energy landscapes and scored-based generative models of machine learning to classical work in stochastic control. We clarify these connections providing rigorous statements and representations which may serve as guidelines for further learning models. In this paper, we connect some recent papers on smoothing of energy landscapes and scored-based generative models of machine learning to classical work in stochastic control. We clarify these connections providing rigorous statements and representations which may serve as guidelines for further learning models. △ Less

Submitted 11 July, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

arXiv:2109.14338 [pdf, other]

Harmonic Analysis of Social Cognition

Authors: Anne Maass, Michele Pavon, Caterina Suitner

Abstract: In this paper, we argue that some fundamental concepts and tools of signal processing may be effectively applied to represent and interpret social cognition processes. From this viewpoint, individuals or, more generally, social stimuli are thought of as a weighted sum of harmonics with different frequencies: Low frequencies represent general categories such as gender, ethnic group, nationality, et… ▽ More In this paper, we argue that some fundamental concepts and tools of signal processing may be effectively applied to represent and interpret social cognition processes. From this viewpoint, individuals or, more generally, social stimuli are thought of as a weighted sum of harmonics with different frequencies: Low frequencies represent general categories such as gender, ethnic group, nationality, etc., whereas high frequencies account for personal characteristics. Individuals are then seen by observers as the output of a filter that emphasizes a certain range of high or low frequencies. The selection of the filter depends on the social distance between the observing individual or group and the person being observed as well as on motivation, cognitive resources and cultural background. Enhancing low- or high-frequency harmonics is not on equal footing, the latter requiring supplementary energy. This mirrors a well-known property of signal processing filters. More generally, in the light of this correspondence, we show that several established results of social cognition admit a natural interpretation and integration in the signal processing language. While the potential of this connection between an area of social psychology and one of information engineering appears considerable (compression, information retrieval, filtering, feedback, feedforward, sampling, aliasing, etc.), in this paper we shall limit ourselves to laying down what we consider the pillars of this bridge on which future research may be founded. △ Less

Submitted 29 September, 2021; originally announced September 2021.

arXiv:2108.02879 [pdf, other]

The most likely evolution of diffusing and vanishing particles: Schrodinger Bridges with unbalanced marginals

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: Stochastic flows of an advective-diffusive nature are ubiquitous in physical sciences. Of particular interest is the problem to reconcile observed marginal distributions with a given prior posed by E. Schrodinger in 1932/32 and known as the Schrodinger Bridge Problem (SBP). Due to its fundamental significance, interest in SBP has in recent years enticed a broad spectrum of disciplines. Yet, while… ▽ More Stochastic flows of an advective-diffusive nature are ubiquitous in physical sciences. Of particular interest is the problem to reconcile observed marginal distributions with a given prior posed by E. Schrodinger in 1932/32 and known as the Schrodinger Bridge Problem (SBP). Due to its fundamental significance, interest in SBP has in recent years enticed a broad spectrum of disciplines. Yet, while the mathematics and applications of SBP have been develo** at a considerable pace, accounting for marginals of unequal mass has received scant attention; the problem to interpolate between unbalanced marginals has been approached by introducing source/sink terms in an Adhoc manner. Nevertheless, losses are inherent in many physical processes and, thereby, models that account for lossy transport may also need to be reconciled with observed marginals following Schrodinger's dictum; that is, to adjust the probability of trajectories of particles, including those that do not make it to the terminal observation point, so that the updated law represents the most likely way that particles may have been transported, or vanished, at some intermediate point. Thus, the purpose of this work is to develop such a natural generalization of the SBP for stochastic evolution with losses, whereupon particles are "killed" according to a probabilistic law. Through a suitable embedding, we turn the problem into an SBP for stochastic processes that combine diffusive and jump characteristics. Then, following a large-deviations formalism, given a prior law that allows for losses, we ask for the most probable evolution of particles along with the most likely killing rate as the particles transition between the specified marginals. Our approach differs sharply from previous work involving a Feynman-Kac multiplicative reweighing of the reference measure: The latter, as we argue, is far from Schrodinger's quest. △ Less

Submitted 5 August, 2021; originally announced August 2021.

Comments: 22 pages, 4 figures

MSC Class: 49Q22; 47B93; 60F10; 82Cxx; 93Exx

arXiv:2102.12628 [pdf, ps, other]

Optimal steering to invariant distributions for networks flows

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: We derive novel results on the ergodic theory of irreducible, aperiodic Markov chains. We show how to optimally steer the network flow to a stationary distribution over a finite or infinite time horizon. Optimality is with respect to an entropic distance between distributions on feasible paths. When the prior is reversible, it shown that solutions to this discrete time and space steering problem a… ▽ More We derive novel results on the ergodic theory of irreducible, aperiodic Markov chains. We show how to optimally steer the network flow to a stationary distribution over a finite or infinite time horizon. Optimality is with respect to an entropic distance between distributions on feasible paths. When the prior is reversible, it shown that solutions to this discrete time and space steering problem are reversible as well. A notion of temperature is defined for Boltzmann distributions on networks, and problems analogous to cooling (in this case, for evolutions in discrete space and time) are discussed. △ Less

Submitted 24 February, 2021; originally announced February 2021.

Comments: 7 pages

MSC Class: 93E20; 90B10

arXiv:2006.10000 [pdf, other]

Regularized transport between singular covariance matrices

Authors: Valentina Ciccone, Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: We consider the problem of steering a linear stochastic system between two end-point degenerate Gaussian distributions in finite time. This accounts for those situations in which some but not all of the state entries are uncertain at the initial, t = 0, and final time, t = T . This problem entails non-trivial technical challenges as the singularity of terminal state-covariance causes the control t… ▽ More We consider the problem of steering a linear stochastic system between two end-point degenerate Gaussian distributions in finite time. This accounts for those situations in which some but not all of the state entries are uncertain at the initial, t = 0, and final time, t = T . This problem entails non-trivial technical challenges as the singularity of terminal state-covariance causes the control to grow unbounded at the final time T. Consequently, the entropic interpolation (Schroedinger Bridge) is provided by a diffusion process which is not finite-energy, thereby placing this case outside of most of the current theory. In this paper, we show that a feasible interpolation can be derived as a limiting case of earlier results for non-degenerate cases, and that it can be expressed in closed form. Moreover, we show that such interpolation belongs to the same reciprocal class of the uncontrolled evolution. By doing so we also highlight a time-symmetry of the problem, contrasting dual formulations in the forward and reverse time-directions, where in each the control grows unbounded as time approaches the end-point (in the forward and reverse time-direction, respectively). △ Less

Submitted 17 June, 2020; originally announced June 2020.

Comments: 8 pages

MSC Class: 93E20

arXiv:2005.10963 [pdf, other]

Stochastic control liaisons: Richard Sinkhorn meets Gaspard Monge on a Schroedinger bridge

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: In 1931/32, Schroedinger studied a hot gas Gedankenexperiment, an instance of large deviations of the empirical distribution and an early example of the so-called maximum entropy inference method. This so-called Schroedinger bridge problem (SBP) was recently recognized as a regularization of the Monge-Kantorovich Optimal Mass Transport (OMT), leading to effective computation of the latter. Specifi… ▽ More In 1931/32, Schroedinger studied a hot gas Gedankenexperiment, an instance of large deviations of the empirical distribution and an early example of the so-called maximum entropy inference method. This so-called Schroedinger bridge problem (SBP) was recently recognized as a regularization of the Monge-Kantorovich Optimal Mass Transport (OMT), leading to effective computation of the latter. Specifically, OMT with quadratic cost may be viewed as a zero-temperature limit of SBP, which amounts to minimization of the Helmholtz's free energy over probability distributions constrained to possess given marginals. The problem features a delicate compromise, mediated by a temperature parameter, between minimizing the internal energy and maximizing the entropy. These concepts are central to a rapidly expanding area of modern science dealing with the so-called {\em Sinkhorn algorithm} which appears as a special case of an algorithm first studied by the French analyst Robert Fortet in 1938/40 specifically for Schroedinger bridges. Due to the constraint on end-point distributions, dynamic programming is not a suitable tool to attack these problems. Instead, Fortet's iterative algorithm and its discrete counterpart, the Sinkhorn iteration, permit computation by iteratively solving the so-called {\em Schroedinger system}. In both the continuous as well as the discrete-time and space settings, {\em stochastic control} provides a reformulation and dynamic versions of these problems. The formalism behind these control problems have attracted attention as they lead to a variety of new applications in spacecraft guidance, control of robot or biological swarms, sensing, active cooling, network routing as well as in computer and data science. This multifacet and versatile framework, intertwining SBP and OMT, provides the substrate for a historical and technical overview of the field taken up in this paper. △ Less

Submitted 26 November, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

Comments: 67 pages

MSC Class: 93E20; 60F10; 35Qxx; 90C25; 49J45; 90B06; 49M99

arXiv:1909.05468 [pdf, ps, other]

Covariance steering in zero-sum linear-quadratic two-player differential games

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: We formulate a new class of two-person zero-sum differential games, in a stochastic setting, where a specification on a target terminal state distribution is imposed on the players. We address such added specification by introducing incentives to the game that guides the players to steer the join distribution accordingly. In the present paper, we only address linear quadratic games with Gaussian t… ▽ More We formulate a new class of two-person zero-sum differential games, in a stochastic setting, where a specification on a target terminal state distribution is imposed on the players. We address such added specification by introducing incentives to the game that guides the players to steer the join distribution accordingly. In the present paper, we only address linear quadratic games with Gaussian target distribution. The solution is characterized by a coupled Riccati equations system, resembling that in the standard linear quadratic differential games. Indeed, once the incentive function is calculated, our problem reduces to a standard one. Tthe framework developed in this paper extends previous results in covariance control, a fast growing research area. On the numerical side, problems herein are reformulated as convex-concave minimax problems for which efficient and reliable algorithms are available. △ Less

Submitted 12 September, 2019; originally announced September 2019.

Comments: 10 pages

MSC Class: 93E20; 49N70; 91A10; 60G99

arXiv:1903.03638 [pdf]

Controlled and Uncontrolled Stochastic Norton-Simon-Massagué Tumor Growth Models

Authors: Zehor Belkhatir, Michele Pavon, James C. Mathews, Maryam Pouryahya, Joseph O. Deasy, Larry Norton, Allen R. Tannenbaum

Abstract: Tumorigenesis is a complex process that is heterogeneous and affected by numerous sources of variability. This study presents a stochastic extension of a biologically grounded tumor growth model, referred to as the Norton-Simon-Massagué (NSM) tumor growth model. We first study the uncontrolled version of the model where the effect of chemotherapeutic drug agent is absent. Conditions on the model's… ▽ More Tumorigenesis is a complex process that is heterogeneous and affected by numerous sources of variability. This study presents a stochastic extension of a biologically grounded tumor growth model, referred to as the Norton-Simon-Massagué (NSM) tumor growth model. We first study the uncontrolled version of the model where the effect of chemotherapeutic drug agent is absent. Conditions on the model's parameters are derived to guarantee the positivity of the tumor volume and hence the validity of the proposed stochastic NSM model. To calibrate the proposed model we utilize a maximum likelihood-based estimation algorithm and population mixed-effect modeling formulation. The algorithm is tested by fitting previously published tumor volume mice data. Then, we study the controlled version of the model which includes the effect of chemotherapy treatment. Analysis of the influence of adding the control drug agent into the model and how sensitive it is to the stochastic parameters is performed both in open-loop and closed-loop viewpoints through different numerical simulations. △ Less

Submitted 8 March, 2019; originally announced March 2019.

arXiv:1903.00525 [pdf, other]

Optimal steering for non-Markovian Gaussian processes

Authors: Daniele Alpago, Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: At present, the problem to steer a non-Markovian process with minimum energy between specified end-point marginal distributions remains unsolved. Herein, we consider the special case for a non-Markovian process y(t) which, however, assumes a finite-dimensional stochastic realization with a Markov state process that is fully observable. In this setting, and over a finite time horizon [0,T], we dete… ▽ More At present, the problem to steer a non-Markovian process with minimum energy between specified end-point marginal distributions remains unsolved. Herein, we consider the special case for a non-Markovian process y(t) which, however, assumes a finite-dimensional stochastic realization with a Markov state process that is fully observable. In this setting, and over a finite time horizon [0,T], we determine an optimal (least) finite-energy control law that steers the stochastic system to a final distribution that is compatible with a specified distribution for the terminal output process y(T); the solution is given in closed-form. This work provides a key step towards the important problem to steer a stochastic system based on partial observations of the state (i.e., an output process) corrupted by noise, which will be the subject of forthcoming work. △ Less

Submitted 1 March, 2019; originally announced March 2019.

Comments: 5 pages, 2 figures

arXiv:1806.01364 [pdf, other]

The data-driven Schroedinger bridge

Authors: Michele Pavon, Esteban G Tabak, Giulio Trigila

Abstract: Erwin Schroedinger posed, and to a large extent solved in 1931/32 the problem of finding the most likely random evolution between two continuous probability distributions. This article considers this problem in the case when only samples of the two distributions are available. A novel iterative procedure is proposed, inspired by Fortet-Sinkhorn type algorithms. Since only samples of the marginals… ▽ More Erwin Schroedinger posed, and to a large extent solved in 1931/32 the problem of finding the most likely random evolution between two continuous probability distributions. This article considers this problem in the case when only samples of the two distributions are available. A novel iterative procedure is proposed, inspired by Fortet-Sinkhorn type algorithms. Since only samples of the marginals are available, the new approach features constrained maximum likelihood estimation in place of the nonlinear boundary couplings, and importance sampling to propagate the functions $\varphi$ and $\hat{\varphi}$ solving the Schroedinger system. This method is well-suited to high-dimensional settings, where introducing grids leads to numerically unfeasible or unreliable methods. The methodology is illustrated in two applications: entropic interpolation of two-dimensional Gaussian mixtures, and the estimation of integrals through a variation of importance sampling. △ Less

Submitted 5 June, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

arXiv:1805.02695 [pdf, ps, other]

Traversing the Schroedinger Bridge strait: Robert Fortet's marvelous proof redux

Authors: Montacer Essid, Michele Pavon

Abstract: In the early 1930's, Erwin Schroedinger, motivated by his quest for a more classical formulation of quantum mechanics, posed a large deviation problem for a cloud of independent Brownian particles. He showed that the solution to the problem could be obtained trough a system of two linear equations with nonlinear coupling at the boundary (Schrödinger system). Existence and uniqueness for such a sys… ▽ More In the early 1930's, Erwin Schroedinger, motivated by his quest for a more classical formulation of quantum mechanics, posed a large deviation problem for a cloud of independent Brownian particles. He showed that the solution to the problem could be obtained trough a system of two linear equations with nonlinear coupling at the boundary (Schrödinger system). Existence and uniqueness for such a system, which represents a sort of bottleneck for the problem, was first established by R. Fortet in 1938/40 under rather general assumptions by proving convergence of an ingenious but complex approximation method. It is the first proof of what are nowadays called Sinkhorn-type algorithms in the much more challenging continuous case. Schrödinger bridges are also an early example of the maximum entropy approach and have been more recently recognized as a regularization of the important Optimal Mass Transport problem. Unfortunately, Fortet's contribution is by and large ignored in contemporary literature. This is likely due to the complexity of his approach coupled with an idiosyncratic exposition style and to missing details and steps in the proofs. Nevertheless, Fortet's approach maintains its importance to this day as it provides the only existing algorithmic proof under rather mild assumptions. It can be adapted, in principle, to other relevant problems such as the regularized Wasserstein barycenter problem. It is the purpose of this paper to remedy this situation by rewriting the bulk of his paper with all the missing passages and in a transparent fashion so as to make it fully available to the scientific community. We consider the problem in $R^d$ rather than $R$ and use as much as possible his notation to facilitate comparison. △ Less

Submitted 19 September, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

arXiv:1802.04436 [pdf, ps, other]

Ruelle-Bowen continuous-time random walk

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: We define the probability structure of a continuous-time time-homogeneous Markov jump process, on a finite graph, that represents the continuous-time counterpart of the so-called Ruelle-Bowen discrete-time random walk. It constitutes the unique jump process having maximal entropy rate. Moreover, it has the property that, given the number of jumps between any two specified end-points on the graph,… ▽ More We define the probability structure of a continuous-time time-homogeneous Markov jump process, on a finite graph, that represents the continuous-time counterpart of the so-called Ruelle-Bowen discrete-time random walk. It constitutes the unique jump process having maximal entropy rate. Moreover, it has the property that, given the number of jumps between any two specified end-points on the graph, the probability of traversing any one of the alternative paths that are consistent with the specified number of jumps and end-points, is the same for all, and thereby depends only on the number of jumps and the end-points and not the particular path being traversed. △ Less

Submitted 12 February, 2018; originally announced February 2018.

Comments: 4 pages

MSC Class: 60Gxx; 68Wxx

arXiv:1801.07852 [pdf, other]

Relaxed Schroedinger bridges and robust network routing

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon, Allen Tannenbaum

Abstract: We consider network routing under random link failures with a desired final distribution. We provide a mathematical formulation of a relaxed transport problem where the final distribution only needs to be close to the desired one. The problem is a maximum entropy problem for path distributions with an extra terminal cost. We show that the unique solution may be obtained solving a generalized Schro… ▽ More We consider network routing under random link failures with a desired final distribution. We provide a mathematical formulation of a relaxed transport problem where the final distribution only needs to be close to the desired one. The problem is a maximum entropy problem for path distributions with an extra terminal cost. We show that the unique solution may be obtained solving a generalized Schroedinger system. An iterative algorithm to compute the solution is provided. It contracts the Hilbert metric with contraction ratio less than 1/2 leading to extremely fast convergence. △ Less

Submitted 23 January, 2018; originally announced January 2018.

Comments: 16 pages, 1 figure

arXiv:1712.03578 [pdf, other]

Steering the distribution of agents in mean-field and cooperative games

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: The purpose of this work is to pose and solve the problem to guide a collection of weakly interacting dynamical systems (agents, particles, etc.) to a specified terminal distribution. The framework is that of mean-field and of cooperative games. A terminal cost is used to accomplish the task; we establish that the map between terminal costs and terminal probability distributions is onto. Our appro… ▽ More The purpose of this work is to pose and solve the problem to guide a collection of weakly interacting dynamical systems (agents, particles, etc.) to a specified terminal distribution. The framework is that of mean-field and of cooperative games. A terminal cost is used to accomplish the task; we establish that the map between terminal costs and terminal probability distributions is onto. Our approach relies on and extends the theory of optimal mass transport and its generalizations. △ Less

Submitted 10 December, 2017; originally announced December 2017.

Comments: 20 pages, 8 figures

MSC Class: 82C22; 91A10; 93E20

arXiv:1712.02257 [pdf, ps, other]

Extremal flows on Wasserstein space

Authors: Giovanni Conforti, Michele Pavon

Abstract: We develop an intrinsic geometric approach to calculus of variations on Wasserstein space. We show that the flows associated to the Schroedinger bridge with general prior, to Optimal Mass Transport and to the Madelung fluid can all be characterized as annihilating the first variation of a suitable action. We then discuss the implications of this unified framework for stochastic mechanics: It entai… ▽ More We develop an intrinsic geometric approach to calculus of variations on Wasserstein space. We show that the flows associated to the Schroedinger bridge with general prior, to Optimal Mass Transport and to the Madelung fluid can all be characterized as annihilating the first variation of a suitable action. We then discuss the implications of this unified framework for stochastic mechanics: It entails, in particular, a sort of fluid-dynamic reconciliation between Bohm's and Nelson's stochastic mechanics. △ Less

Submitted 6 December, 2017; originally announced December 2017.

Comments: 19 pages

arXiv:1712.00680 [pdf, other]

A variational derivation of a class of BFGS-like methods

Authors: Michele Pavon

Abstract: We provide a maximum entropy derivation of a new family of BFGS-like methods. Similar results are then derived for block BFGS methods. This also yields an independent proof of a result of Fletcher 1991 and its generalisation to the block case. We provide a maximum entropy derivation of a new family of BFGS-like methods. Similar results are then derived for block BFGS methods. This also yields an independent proof of a result of Fletcher 1991 and its generalisation to the block case. △ Less

Submitted 8 May, 2018; v1 submitted 2 December, 2017; originally announced December 2017.

Comments: 10 pages

arXiv:1701.07625 [pdf, other]

Efficient-robust routing for single commodity network flows

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon, Allen Tannenbaum

Abstract: We study single commodity network flows with suitable robustness and efficiency specs. An original use of a maximum entropy problem for distributions on the paths of the graph turns this problem into a steering problem for Markov chains with prescribed initial and final marginals. From a computational standpoint, viewing scheduling this way is especially attractive in light of the existence of an… ▽ More We study single commodity network flows with suitable robustness and efficiency specs. An original use of a maximum entropy problem for distributions on the paths of the graph turns this problem into a steering problem for Markov chains with prescribed initial and final marginals. From a computational standpoint, viewing scheduling this way is especially attractive in light of the existence of an iterative algorithm to compute the solution. The present paper builds on [13] by introducing an index of efficiency of a transportation plan and points, accordingly, to efficient-robust transport policies. In develo** the theory, we establish two new invariance properties of the solution (called bridge) - an iterated bridge invariance property and the invariance of the most probable paths. These properties, which were tangentially mentioned in our previous work, are fully developed here. We also show that the distribution on paths of the optimal transport policy, which depends on a "temperature" parameter, tends to the solution of the "most economical" but possibly less robust optimal mass transport problem as the temperature goes to zero. The relevance of all of these properties for transport over networks is illustrated in an example. △ Less

Submitted 27 September, 2017; v1 submitted 26 January, 2017; originally announced January 2017.

Comments: 18 pages

arXiv:1610.03307 [pdf, ps, other]

Ball Intersection Properties in Metric Spaces

Authors: Benjamin Miesch, Maël Pavón

Abstract: We show that in complete metric spaces, $4$-hyperconvexity is equivalent to finite hyperconvexity. Moreover, every complete, almost $n$-hyperconvex metric space is $n$-hyperconvex. This generalizes among others results of Lindenstrauss and answers questions of Aronszajn-Panitchpakdi. Furthermore, we prove local-to-global results for externally and weakly externally hyperconvex subsets of hyperco… ▽ More We show that in complete metric spaces, $4$-hyperconvexity is equivalent to finite hyperconvexity. Moreover, every complete, almost $n$-hyperconvex metric space is $n$-hyperconvex. This generalizes among others results of Lindenstrauss and answers questions of Aronszajn-Panitchpakdi. Furthermore, we prove local-to-global results for externally and weakly externally hyperconvex subsets of hyperconvex metric spaces and find sufficient conditions in order for those classes of subsets to be convex with respect to a geodesic bicombing. △ Less

Submitted 11 October, 2016; originally announced October 2016.

Comments: 21 pages

arXiv:1608.03622 [pdf, other]

Optimal steering of a linear stochastic system to a final probability distribution, Part III

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: The subject of this work has its roots in the so called Schroedginer Bridge Problem (SBP) which asks for the most likely distribution of Brownian particles in their passage between observed empirical marginal distributions at two distinct points in time. Renewed interest in this problem was sparked by a reformulation in the language of stochastic control. In earlier works, presented as Part I and… ▽ More The subject of this work has its roots in the so called Schroedginer Bridge Problem (SBP) which asks for the most likely distribution of Brownian particles in their passage between observed empirical marginal distributions at two distinct points in time. Renewed interest in this problem was sparked by a reformulation in the language of stochastic control. In earlier works, presented as Part I and Part II, we explored a generalization of the original SBP that amounts to optimal steering of linear stochastic dynamical systems between state-distributions, at two points in time, under full state feedback. In these works the cost was quadratic in the control input. The purpose of the present work is to detail the technical steps in extending the framework to the case where a quadratic cost in the state is also present. In the zero-noise limit, we obtain the solution of a (deterministic) mass transport problem with general quadratic cost. △ Less

Submitted 11 August, 2016; originally announced August 2016.

Comments: 7 pages, 8 figures

MSC Class: 93E20

arXiv:1603.08129 [pdf, other]

Robust transport over networks

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon, Allen Tannenbaum

Abstract: We consider transport over a strongly connected, directed graph. The scheduling amounts to selecting transition probabilities for a discrete-time Markov evolution which is designed to be consistent with certain initial and final marginals. The random evolution is selected to be closest to a prior measure on paths in the relative entropy sense, i.e., a Schroedinger bridge between the two marginals.… ▽ More We consider transport over a strongly connected, directed graph. The scheduling amounts to selecting transition probabilities for a discrete-time Markov evolution which is designed to be consistent with certain initial and final marginals. The random evolution is selected to be closest to a prior measure on paths in the relative entropy sense, i.e., a Schroedinger bridge between the two marginals. This is an atypical stochastic control problem where the control consists in suitably modifying the transition mechanism. The prior can incorporate cost of traversing edges or allocate equal probability to all paths of equal length connecting any two given nodes, i.e., a uniform measure on paths. This latter choice relies on the so-called Ruelle-Bowen random walk and gives rise to a scheduling that tends to utilize all paths as uniformly as the topology allows. Thus, when the Ruelle-Bowen law is taken as prior, the transportation plan tends to lessen congestion and ensure a level of robustness. We show that the Ruelle-Bowen law is itself a Schroedinger bridge albeit with a prior that is not a probability measure. The paradigm of Schroedinger bridges as a mechanism for scheduling transport on networks can be adapted to graphs that are not strongly connected as well as to weighted graphs. The latter leads to transportation plans that effect a compromise between robustness and transportation cost. △ Less

Submitted 26 March, 2016; originally announced March 2016.

Comments: 9 pages, 2 figures

MSC Class: 90C35; 90B06; 15B48; 97M40; 05C81; 82C41

arXiv:1601.04891 [pdf, ps, other]

Stochastic control, entropic interpolation and gradient flows on Wasserstein product spaces

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: Since the early nineties, it has been observed that the Schroedinger bridge problem can be formulated as a stochastic control problem with atypical boundary constraints. This in turn has a fluid dynamic counterpart where the flow of probability densities represents an entropic interpolation between the given initial and final marginals. In the zero noise limit, such entropic interpolation converge… ▽ More Since the early nineties, it has been observed that the Schroedinger bridge problem can be formulated as a stochastic control problem with atypical boundary constraints. This in turn has a fluid dynamic counterpart where the flow of probability densities represents an entropic interpolation between the given initial and final marginals. In the zero noise limit, such entropic interpolation converges in a suitable sense to the displacement interpolation of optimal mass transport (OMT). We consider two absolutely continuous curves in Wasserstein space ${\cal W}_2$ and study the evolution of the relative entropy on ${\cal W}_2\times {\cal W}_2$ on a finite time interval. Thus, this study differs from previous work in OMT theory concerning relative entropy from a fixed (often equilibrium) distribution (density). We derive a gradient flow on Wasserstein product space. We find the remarkable property that fluxes in the two components are opposite. Plugging in the "steepest descent" into the evolution of the relative entropy we get what appears to be a new formula: The two flows approach each other at a faster rate than that of two solutions of the same Fokker-Planck. We then study the evolution of relative entropy in the case of uncontrolled-controlled diffusions. In two special cases of the Schroedinger bridge problem, we show that such relative entropy may be monotonically decreasing or monotonically increasing. △ Less

Submitted 19 January, 2016; originally announced January 2016.

Comments: 7 pages, submitted to MTNS2016

arXiv:1511.04761 [pdf, ps, other]

Injective Subsets of $l_{\infty}(I)$

Authors: Dominic Descombes, Maël Pavón

Abstract: We give an explicit characterization of all injective subsets of the model space $l_{\infty}(I)$ for a general set $I$, in terms of inequalities involving $1$-Lipschitz functions. Since the class of all injective metric spaces coincides with the one of all absolute $1$-Lipschitz retracts, the present work yields a characterization of all the subsets of $l_{\infty}(I)$ that are absolute $1$-Lipschi… ▽ More We give an explicit characterization of all injective subsets of the model space $l_{\infty}(I)$ for a general set $I$, in terms of inequalities involving $1$-Lipschitz functions. Since the class of all injective metric spaces coincides with the one of all absolute $1$-Lipschitz retracts, the present work yields a characterization of all the subsets of $l_{\infty}(I)$ that are absolute $1$-Lipschitz retracts. △ Less

Submitted 15 November, 2015; originally announced November 2015.

Comments: This generalizes arXiv:1510.04181

arXiv:1507.07795 [pdf, ps, other]

Weakly Externally Hyperconvex Subsets and Hyperconvex Gluings

Authors: Benjamin Miesch, Maël Pavón

Abstract: We give a necessary and sufficient condition for gluings of hyperconvex metric spaces along weakly externally hyperconvex subsets in order that the resulting space be hyperconvex. This leads to a full characterization of gluings of two isometric copies of the same hyperconvex space. Furthermore, we investigate the case of gluings of finite dimensional hyperconvex linear spaces along linear subspac… ▽ More We give a necessary and sufficient condition for gluings of hyperconvex metric spaces along weakly externally hyperconvex subsets in order that the resulting space be hyperconvex. This leads to a full characterization of gluings of two isometric copies of the same hyperconvex space. Furthermore, we investigate the case of gluings of finite dimensional hyperconvex linear spaces along linear subspaces. For this purpose, we characterize the convex polyhedra in $l_\infty^n$ which are weakly externally hyperconvex. △ Less

Submitted 28 July, 2015; originally announced July 2015.

Comments: 22 pages

arXiv:1506.04255 [pdf, other]

Entropic and displacement interpolation: a computational approach using the Hilbert metric

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: Monge-Kantorovich optimal mass transport (OMT) provides a blueprint for geometries in the space of positive densities -- it quantifies the cost of transporting a mass distribution into another. In particular, it provides natural options for interpolation of distributions (displacement interpolation) and for modeling flows. As such it has been the cornerstone of recent developments in physics, prob… ▽ More Monge-Kantorovich optimal mass transport (OMT) provides a blueprint for geometries in the space of positive densities -- it quantifies the cost of transporting a mass distribution into another. In particular, it provides natural options for interpolation of distributions (displacement interpolation) and for modeling flows. As such it has been the cornerstone of recent developments in physics, probability theory, image processing, time-series analysis, and several other fields. In spite of extensive work and theoretical developments, the computation of OMT for large scale problems has remained a challenging task. An alternative framework for interpolating distributions, rooted in statistical mechanics and large deviations, is that of Schroedinger bridges (entropic interpolation). This may be seen as a stochastic regularization of OMT and can be cast as the stochastic control problem of steering the probability density of the state-vector of a dynamical system between two marginals. In this approach, however, the actual computation of flows had hardly received any attention. In recent work on Schroedinger bridges for Markov chains and quantum evolutions, we noted that the solution can be efficiently obtained from the fixed-point of a map which is contractive in the Hilbert metric. Thus, the purpose of this paper is to show that a similar approach can be taken in the context of diffusion processes which i) leads to a new proof of a classical result on Schroedinger bridges and ii) provides an efficient computational scheme for both, Schroedinger bridges and OMT. We illustrate this new computational approach by obtaining interpolation of densities in representative examples such as interpolation of images. △ Less

Submitted 13 June, 2015; originally announced June 2015.

Comments: 20 pages, 7 figures

MSC Class: 47H07; 47H09; 60J25; 34A34; 49J20

arXiv:1505.07807 [pdf, ps, other]

Injective Hulls of Infinite Totally Split-Decomposable Metric Spaces

Authors: Maël Pavón

Abstract: We consider the class of (possibly) infinite metric spaces with integer-valued totally split-decomposable metric and possessing an injective hull which has the structure of a polyhedral complex. For this class, we give a characterization for the injective hull to be combinatorially equivalent to a CAT(0) cube complex. In order to obtain these results, we extend the decomposition theory introduced… ▽ More We consider the class of (possibly) infinite metric spaces with integer-valued totally split-decomposable metric and possessing an injective hull which has the structure of a polyhedral complex. For this class, we give a characterization for the injective hull to be combinatorially equivalent to a CAT(0) cube complex. In order to obtain these results, we extend the decomposition theory introduced by Bandelt and Dress in 1992 as well as results on the tight span of totally split-decomposable metric spaces proved by Huber, Koolen and Moulton in 2006. As an application, and using results of Lang of 2013, we obtain proper actions on CAT(0) cube complexes for finitely generated groups endowed with a totally split-decomposable word metric whose associated splits satisfy an easy combinatorial property. In the case of Gromov hyperbolic groups, the action is proper as well as cocompact. △ Less

Submitted 28 May, 2015; originally announced May 2015.

Comments: 49 pages, 4 figures

arXiv:1504.00874 [pdf, other]

Steering state statistics with output feedback

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: Consider a linear stochastic system whose initial state is a random vector with a specified Gaussian distribution. Such a distribution may represent a collection of particles abiding by the specified system dynamics. In recent publications, we have shown that, provided the system is controllable, it is always possible to steer the state covariance to any specified terminal Gaussian distribution us… ▽ More Consider a linear stochastic system whose initial state is a random vector with a specified Gaussian distribution. Such a distribution may represent a collection of particles abiding by the specified system dynamics. In recent publications, we have shown that, provided the system is controllable, it is always possible to steer the state covariance to any specified terminal Gaussian distribution using state feedback. The purpose of the present work is to show that, in the case where only partial state observation is available, a necessary and sufficient condition for being able to steer the system to a specified terminal Gaussian distribution for the state vector is that the terminal state covariance be greater (in the positive-definite sense) than the error covariance of a corresponding Kalman filter. △ Less

Submitted 3 April, 2015; originally announced April 2015.

Comments: 10 pages, 2 figures

MSC Class: 93E20

arXiv:1503.04885 [pdf, other]

Optimal control of the state statistics for a linear stochastic system

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We consider a variant of the classical linear quadratic Gaussian regulator (LQG) in which penalties on the endpoint state are replaced by the specification of the terminal state distribution. The resulting theory considerably differs from LQG as well as from formulations that bound the probability of violating state constraints. We develop results for optimal state-feedback control in the two case… ▽ More We consider a variant of the classical linear quadratic Gaussian regulator (LQG) in which penalties on the endpoint state are replaced by the specification of the terminal state distribution. The resulting theory considerably differs from LQG as well as from formulations that bound the probability of violating state constraints. We develop results for optimal state-feedback control in the two cases where i) steering of the state distribution is to take place over a finite window of time with minimum energy, and ii) the goal is to maintain the state at a stationary distribution over an infinite horizon with minimum power. For both problems the distribution of noise and state are Gaussian. In the first case, we show that provided the system is controllable, the state can be steered to any terminal Gaussian distribution over any specified finite time-interval. In the second case, we characterize explicitly the covariance of admissible stationary state distributions that can be maintained with constant state-feedback control. The conditions for optimality are expressed in terms of a system of dynamically coupled Riccati equations in the finite horizon case and in terms of algebraic conditions for the stationary case. In the case where the noise and control share identical input channels, the Riccati equations for finite-horizon steering become homogeneous and can be solved in closed form. The present paper is largely based on our recent work in arxiv.longhoe.net/abs/1408.2222, arxiv.longhoe.net/abs/1410.3447 and presents an overview of certain key results. △ Less

Submitted 16 March, 2015; originally announced March 2015.

Comments: 7 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1410.3447

MSC Class: 93E20

arXiv:1503.00215 [pdf, ps, other]

Optimal mass transport over bridges

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We present an overview of our recent work on implementable solutions to the Schroedinger bridge problem and their potential application to optimal transport and various generalizations. We present an overview of our recent work on implementable solutions to the Schroedinger bridge problem and their potential application to optimal transport and various generalizations. △ Less

Submitted 28 February, 2015; originally announced March 2015.

Comments: 9 pages, 1 figure

MSC Class: 93E20;

arXiv:1502.01328 [pdf, ps, other]

A layman's note on a class of frequentist hypothesis testing problems

Authors: Michele Pavon

Abstract: It is observed that for testing between simple hypotheses where the cost of Type I and Type II errors can be quantified, it is better to let the optimization choose the test size. It is observed that for testing between simple hypotheses where the cost of Type I and Type II errors can be quantified, it is better to let the optimization choose the test size. △ Less

Submitted 4 February, 2015; originally announced February 2015.

arXiv:1502.01265 [pdf, other]

Optimal transport over a linear dynamical system

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We consider the problem of steering an initial probability density for the state vector of a linear system to a final one, in finite time, using minimum energy control. In the case where the dynamics correspond to an integrator ($\dot x(t) = u(t)$) this amounts to a Monge-Kantorovich Optimal Mass Transport (OMT) problem. In general, we show that the problem can again be reduced to solving an OMT p… ▽ More We consider the problem of steering an initial probability density for the state vector of a linear system to a final one, in finite time, using minimum energy control. In the case where the dynamics correspond to an integrator ($\dot x(t) = u(t)$) this amounts to a Monge-Kantorovich Optimal Mass Transport (OMT) problem. In general, we show that the problem can again be reduced to solving an OMT problem and that it has a unique solution. In parallel, we study the optimal steering of the state-density of a linear stochastic system with white noise disturbance; this is known to correspond to a Schrödinger bridge. As the white noise intensity tends to zero, the flow of densities converges to that of the deterministic dynamics and can serve as a way to compute the solution of its deterministic counterpart. The solution can be expressed in closed-form for Gaussian initial and final state densities in both cases. △ Less

Submitted 4 February, 2015; originally announced February 2015.

Comments: 25 pages, 13 figures

MSC Class: 93E20; 49L99; 60G99

arXiv:1412.4430 [pdf, other]

On the relation between optimal transport and Schrödinger bridges: A stochastic control viewpoint

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We take a new look at the relation between the optimal transport problem and the Schrödinger bridge problem from the stochastic control perspective. We show that the connections are richer and deeper than described in existing literature. In particular: a) We give an elementary derivation of the Benamou-Brenier fluid dynamics version of the optimal transport problem; b) We provide a new fluid dyna… ▽ More We take a new look at the relation between the optimal transport problem and the Schrödinger bridge problem from the stochastic control perspective. We show that the connections are richer and deeper than described in existing literature. In particular: a) We give an elementary derivation of the Benamou-Brenier fluid dynamics version of the optimal transport problem; b) We provide a new fluid dynamics version of the Schrödinger bridge problem; c) We observe that the latter provides an important connection with optimal transport without zero noise limits; d) We propose and solve a fluid dynamic version of optimal transport with prior; e) We can then view optimal transport with prior as the zero noise limit of Schrödinger bridges when the prior is any Markovian evolution. In particular, we work out the Gaussian case. A numerical example of the latter convergence involving Brownian particles is also provided. △ Less

Submitted 14 December, 2014; originally announced December 2014.

Comments: 28 pages

MSC Class: 93E20

arXiv:1411.1323 [pdf, other]

Fast cooling for a system of stochastic oscillators

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We study feedback control of coupled nonlinear stochastic oscillators in a force field. We first consider the problem of asymptotically driving the system to a desired {\em steady state} corresponding to reduced thermal noise. Among the feedback controls achieving the desired asymptotic transfer, we find that the most efficient one {from an energy point of view} is characterized by {\em time-rever… ▽ More We study feedback control of coupled nonlinear stochastic oscillators in a force field. We first consider the problem of asymptotically driving the system to a desired {\em steady state} corresponding to reduced thermal noise. Among the feedback controls achieving the desired asymptotic transfer, we find that the most efficient one {from an energy point of view} is characterized by {\em time-reversibility}. We also extend the theory of Schrödinger bridges to this model, thereby steering the system in {\em finite time} and with minimum effort to a target steady-state distribution. The system can then be maintained in this state through the optimal steady-state feedback control. The solution, in the finite-horizon case, involves a space-time harmonic function $\varphi$, and $-\log\varphi$ plays the role of an artificial, time-varying potential in which the desired evolution occurs. This framework appears extremely general and flexible and can be viewed as a considerable generalization of existing active control strategies such as macromolecular cooling. In the case of a quadratic potential, the results assume a form particularly attractive from the algorithmic viewpoint as the optimal control can be computed via deterministic matricial differential equations. An example involving inertial particles illustrates both transient and steady state optimal feedback control. △ Less

Submitted 30 June, 2015; v1 submitted 5 November, 2014; originally announced November 2014.

Comments: 21 pages, 2 figures

MSC Class: 93E20

arXiv:1410.7306 [pdf, ps, other]

Injective Convex Polyhedra

Authors: Maël Pavón

Abstract: It was shown by Nachbin in 1950 that an $n$-dimensional normed space $X$ is injective or equivalently is an absolute 1-Lipschitz retract if and only if $X$ is linearly isometric to $l_\infty^n$ (i.e., $\mathbb{R}^n$ endowed with the $l_{\infty}$-metric). We give an effective convex geometric characterization of injective convex polyhedra in $l_{\infty}^n$. As an application, we prove that if the s… ▽ More It was shown by Nachbin in 1950 that an $n$-dimensional normed space $X$ is injective or equivalently is an absolute 1-Lipschitz retract if and only if $X$ is linearly isometric to $l_\infty^n$ (i.e., $\mathbb{R}^n$ endowed with the $l_{\infty}$-metric). We give an effective convex geometric characterization of injective convex polyhedra in $l_{\infty}^n$. As an application, we prove that if the set of solutions to a linear system of inequalities with at most two variables per inequality is non-empty, then it is injective when endowed with the $l_{\infty}$-metric. △ Less

Submitted 27 October, 2014; originally announced October 2014.

Comments: 24 pages

arXiv:1410.3447 [pdf, other]

Optimal steering of a linear stochastic system to a final probability distribution, part II

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We consider the problem of minimum energy steering of a linear stochastic system to a final prescribed distribution over a finite horizon and to maintain a stationary distribution over an infinite horizon. We present sufficient conditions for optimality in terms of a system of dynamically coupled Riccati equations in the finite horizon case and algebraic in the stationary case. We then address the… ▽ More We consider the problem of minimum energy steering of a linear stochastic system to a final prescribed distribution over a finite horizon and to maintain a stationary distribution over an infinite horizon. We present sufficient conditions for optimality in terms of a system of dynamically coupled Riccati equations in the finite horizon case and algebraic in the stationary case. We then address the question of feasibility for both problems. For the finite-horizon case, provided the system is controllable, we prove that without any restriction on the directionality of the stochastic disturbance it is always possible to steer the state to any arbitrary Gaussian distribution over any specified finite time-interval. For the stationary infinite horizon case, it is not always possible to maintain the state at an arbitrary Gaussian distribution through constant state-feedback. It is shown that covariances of admissible stationary Gaussian distributions are characterized by a certain Lyapunov-like equation. We finally present an alternative to solving the system of coupled Riccati equations, by expressing the optimal controls in the form of solutions to (convex) semi-definite programs for both cases. We conclude with an example to steer the state covariance of the distribution of inertial particles to an admissible stationary Gaussian distribution over a finite interval, to be maintained at that stationary distribution thereafter by constant-gain state-feedback control. △ Less

Submitted 13 October, 2014; originally announced October 2014.

Comments: 15 pages, 4 figures

MSC Class: 93E20

arXiv:1410.1605 [pdf, other]

Optimal steering of inertial particles diffusing anisotropically with losses

Authors: Yongxin Chen, Tryphon T. Georgiou, Michele Pavon

Abstract: Exploiting a fluid dynamic formulation for which a probabilistic counterpart might not be available, we extend the theory of Schroedinger bridges to the case of inertial particles with losses and general, possibly singular diffusion coefficient. We find that, as for the case of constant diffusion coefficient matrix, the optimal control law is obtained by solving a system of two p.d.e.'s involving… ▽ More Exploiting a fluid dynamic formulation for which a probabilistic counterpart might not be available, we extend the theory of Schroedinger bridges to the case of inertial particles with losses and general, possibly singular diffusion coefficient. We find that, as for the case of constant diffusion coefficient matrix, the optimal control law is obtained by solving a system of two p.d.e.'s involving adjoint operators and coupled through their boundary values. In the linear case with quadratic loss function, the system turns into two matrix Riccati equations with coupled split boundary conditions. An alternative formulation of the control problem as a semidefinite programming problem allows computation of suboptimal solutions. This is illustrated in one example of inertial particles subject to a constant rate killing. △ Less

Submitted 6 October, 2014; originally announced October 2014.

Comments: 15 pages, 4 figures

MSC Class: 93E20

arXiv:1408.2222 [pdf, other]

Optimal steering of a linear stochastic system to a final probability distribution

Authors: Yongxin Chen, Tryphon Georgiou, Michele Pavon

Abstract: We consider the problem to steer a linear dynamical system with full state observation from an initial gaussian distribution in state-space to a final one with minimum energy control. The system is stochastically driven through the control channels; an example for such a system is that of an inertial particle experiencing random "white noise" forcing. We show that a target probability distribution… ▽ More We consider the problem to steer a linear dynamical system with full state observation from an initial gaussian distribution in state-space to a final one with minimum energy control. The system is stochastically driven through the control channels; an example for such a system is that of an inertial particle experiencing random "white noise" forcing. We show that a target probability distribution can always be achieved in finite time. The optimal control is given in state-feedback form and is computed explicitely by solving a pair of differential Lyapunov equations that are coupled through their boundary values. This result, given its attractive algorithmic nature, appears to have several potential applications such as to active control of nanomechanical systems and molecular cooling. The problem to steer a diffusion process between end-point marginals has a long history (Schrödinger bridges) and therefore, the present case of steering a linear stochastic system constitutes a Schrödinger bridge for possibly degenerate diffusions. Our results, however, provide the first implementable form of the optimal control for a general Gauss-Markov process. Illustrative examples of the optimal evolution and control for inertial particles and a stochastic oscillator are provided. A final result establishes directly the property of Schrödinger bridges as the most likely random evolution between given marginals to the present context of linear stochastic systems. △ Less

Submitted 10 August, 2014; originally announced August 2014.

Comments: 11 pages, 7 figures

MSC Class: 93E20

arXiv:1405.6650 [pdf, ps, other]

doi 10.1063/1.4915289

Positive contraction map**s for classical and quantum Schrodinger systems

Authors: Tryphon T. Georgiou, Michele Pavon

Abstract: The classical Schrodinger bridge seeks the most likely probability law for a diffusion process, in path space, that matches marginals at two end points in time; the likelihood is quantified by the relative entropy between the sought law and a prior, and the law dictates a controlled path that abides by the specified marginals. Schrodinger proved that the optimal steering of the density between the… ▽ More The classical Schrodinger bridge seeks the most likely probability law for a diffusion process, in path space, that matches marginals at two end points in time; the likelihood is quantified by the relative entropy between the sought law and a prior, and the law dictates a controlled path that abides by the specified marginals. Schrodinger proved that the optimal steering of the density between the two end points is effected by a multiplicative functional transformation of the prior; this transformation represents an automorphism on the space of probability measures and has since been studied by Fortet, Beurling and others. A similar question can be raised for processes evolving in a discrete time and space as well as for processes defined over non-commutative probability spaces. The present paper builds on earlier work by Pavon and Ticozzi and begins with the problem of steering a Markov chain between given marginals. Our approach is based on the Hilbert metric and leads to an alternative proof which, however, is constructive. More specifically, we show that the solution to the Schrodinger bridge is provided by the fixed point of a contractive map. We approach in a similar manner the steering of a quantum system across a quantum channel. We are able to establish existence of quantum transitions that are multiplicative functional transformations of a given Kraus map, but only for the case of uniform marginals. As in the Markov chain case, and for uniform density matrices, the solution of the quantum bridge can be constructed from the fixed point of a certain contractive map. For arbitrary marginal densities, extensive numerical simulations indicate that iteration of a similar map leads to fixed points from which we can construct a quantum bridge. For this general case, however, a proof of convergence remains elusive. △ Less

Submitted 7 October, 2014; v1 submitted 26 May, 2014; originally announced May 2014.

Comments: 27 pages

MSC Class: 81P45; 94A40; 60J10

arXiv:1303.6826 [pdf, ps, other]

Metric stability of trees and tight spans

Authors: Urs Lang, Maël Pavón, Roger Züst

Abstract: In this note, we prove optimal extension results for roughly isometric relations between metric (R-)trees and injective metric spaces. This yields sharp stability estimates, in terms of the Gromov-Hausdorff (GH) distance, for certain metric spanning constructions: The GH distance of two metric trees spanned by some subsets is smaller than or equal to the GH distance of these sets. The GH distance… ▽ More In this note, we prove optimal extension results for roughly isometric relations between metric (R-)trees and injective metric spaces. This yields sharp stability estimates, in terms of the Gromov-Hausdorff (GH) distance, for certain metric spanning constructions: The GH distance of two metric trees spanned by some subsets is smaller than or equal to the GH distance of these sets. The GH distance of the injective hulls, or tight spans, of two metric spaces is at most twice the GH distance between themselves. △ Less

Submitted 27 March, 2013; originally announced March 2013.

Comments: 8 pages, 1 figure

arXiv:1303.0707 [pdf, other]

doi 10.1109/TIFS.2015.2392565

On the Achievable Error Region of Physical Layer Authentication Techniques over Rayleigh Fading Channels

Authors: Augusto Ferrante, Nicola Laurenti, Chiara Masiero, Michele Pavon, Stefano Tomasin

Abstract: For a physical layer message authentication procedure based on the comparison of channel estimates obtained from the received messages, we focus on an outer bound on the type I/II error probability region. Channel estimates are modelled as multivariate Gaussian vectors, and we assume that the attacker has only some side information on the channel estimate, which he does not know directly. We deriv… ▽ More For a physical layer message authentication procedure based on the comparison of channel estimates obtained from the received messages, we focus on an outer bound on the type I/II error probability region. Channel estimates are modelled as multivariate Gaussian vectors, and we assume that the attacker has only some side information on the channel estimate, which he does not know directly. We derive the attacking strategy that provides the tightest bound on the error region, given the statistics of the side information. This turns out to be a zero mean, circularly symmetric Gaussian density whose correlation matrices may be obtained by solving a constrained optimization problem. We propose an iterative algorithm for its solution: Starting from the closed form solution of a relaxed problem, we obtain, by projection, an initial feasible solution; then, by an iterative procedure, we look for the fixed point solution of the problem. Numerical results show that for cases of interest the iterative approach converges, and perturbation analysis shows that the found solution is a local minimum. △ Less

Submitted 18 April, 2014; v1 submitted 4 March, 2013; originally announced March 2013.

Journal ref: IEEE Transactions on Information Forensics and Security, Volume 10, Issue 5, 1 May 2015, Article number 7010914, Pages 941-952

arXiv:1301.4823 [pdf, other]

A note on the geometric interpretation of Bell's inequalities

Authors: Paolo Dai Pra, Michele Pavon, Neeraja Sahasrabudhe

Abstract: Using results of Pitowsky and Gupta, we show in a direct, elementary fashion that, in the case of three spins, Bell's inequalities indeed provide a representation of the tetrahedron of all spin correlation matrices as intersection of half-spaces. Using results of Pitowsky and Gupta, we show in a direct, elementary fashion that, in the case of three spins, Bell's inequalities indeed provide a representation of the tetrahedron of all spin correlation matrices as intersection of half-spaces. △ Less

Submitted 12 April, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

Comments: 7 pages

arXiv:1112.5529 [pdf, other]

On the Geometry of Maximum Entropy Problems

Authors: Michele Pavon, Augusto Ferrante

Abstract: We show that a simple geometric result suffices to derive the form of the optimal solution in a large class of finite and infinite-dimensional maximum entropy problems concerning probability distributions, spectral densities and covariance matrices. These include Burg's spectral estimation method and Dempster's covariance completion, as well as various recent generalizations of the above. We then… ▽ More We show that a simple geometric result suffices to derive the form of the optimal solution in a large class of finite and infinite-dimensional maximum entropy problems concerning probability distributions, spectral densities and covariance matrices. These include Burg's spectral estimation method and Dempster's covariance completion, as well as various recent generalizations of the above. We then apply this orthogonality principle to the new problem of completing a block-circulant covariance matrix when an a priori estimate is available. △ Less

Submitted 21 December, 2012; v1 submitted 23 December, 2011; originally announced December 2011.

Comments: 22 pages

arXiv:1107.2465 [pdf, other]

doi 10.1109/TAC.2011.2125050

An Efficient Algorithm for Maximum-Entropy Extension of Block-Circulant Covariance Matrices

Authors: Francesca P. Carli, Augusto Ferrante, Michele Pavon, Giorgio Picci

Abstract: This paper deals with maximum entropy completion of partially specified block-circulant matrices. Since positive definite symmetric circulants happen to be covariance matrices of stationary periodic processes, in particular of stationary reciprocal processes, this problem has applications in signal processing, in particular to image modeling. In fact it is strictly related to maximum likelihood es… ▽ More This paper deals with maximum entropy completion of partially specified block-circulant matrices. Since positive definite symmetric circulants happen to be covariance matrices of stationary periodic processes, in particular of stationary reciprocal processes, this problem has applications in signal processing, in particular to image modeling. In fact it is strictly related to maximum likelihood estimation of bilateral AR-type representations of acausal signals subject to certain conditional independence constraints. The maximum entropy completion problem for block-circulant matrices has recently been solved by the authors, although leaving open the problem of an efficient computation of the solution. In this paper, we provide an effcient algorithm for computing its solution which compares very favourably with existing algorithms designed for positive definite matrix extension problems. The proposed algorithm benefits from the analysis of the relationship between our problem and the band-extension problem for block-Toeplitz matrices also developed in this paper. △ Less

Submitted 8 February, 2013; v1 submitted 13 July, 2011; originally announced July 2011.

Comments: 25 pages

Journal ref: IEEE Trans. on Automatic Control, 56(9):1999 - 2012, 2011

arXiv:1103.5602 [pdf, other]

Time and spectral domain relative entropy: A new approach to multivariate spectral estimation

Authors: Augusto Ferrante, Chiara Masiero, Michele Pavon

Abstract: The concept of spectral relative entropy rate is introduced for jointly stationary Gaussian processes. Using classical information-theoretic results, we establish a remarkable connection between time and spectral domain relative entropy rates. This naturally leads to a new spectral estimation technique where a multivariate version of the Itakura-Saito distance is employed}. It may be viewed as an… ▽ More The concept of spectral relative entropy rate is introduced for jointly stationary Gaussian processes. Using classical information-theoretic results, we establish a remarkable connection between time and spectral domain relative entropy rates. This naturally leads to a new spectral estimation technique where a multivariate version of the Itakura-Saito distance is employed}. It may be viewed as an extension of the approach, called THREE, introduced by Byrnes, Georgiou and Lindquist in 2000 which, in turn, followed in the footsteps of the Burg-Jaynes Maximum Entropy Method. Spectral estimation is here recast in the form of a constrained spectrum approximation problem where the distance is equal to the processes relative entropy rate. The corresponding solution entails a complexity upper bound which improves on the one so far available in the multichannel framework. Indeed, it is equal to the one featured by THREE in the scalar case. The solution is computed via a globally convergent matricial Newton-type algorithm. Simulations suggest the effectiveness of the new technique in tackling multivariate spectral estimation tasks, especially in the case of short data records. △ Less

Submitted 29 September, 2011; v1 submitted 29 March, 2011; originally announced March 2011.

Comments: 32 pages, submitted for publication

MSC Class: 46N10

arXiv:1101.4849 [pdf, other]

doi 10.1109/TAC.2011.2125050

A Maximum Entropy solution of the Covariance Extension Problem for Reciprocal Processes

Authors: Francesca Carli, Augusto Ferrante, Michele Pavon, Giorgio Picci

Abstract: Stationary reciprocal processes defined on a finite interval of the integer line can be seen as a special class of Markov random fields restricted to one dimension. Non stationary reciprocal processes have been extensively studied in the past especially by Jamison, Krener, Levy and co-workers. The specialization of the non-stationary theory to the stationary case, however, does not seem to have be… ▽ More Stationary reciprocal processes defined on a finite interval of the integer line can be seen as a special class of Markov random fields restricted to one dimension. Non stationary reciprocal processes have been extensively studied in the past especially by Jamison, Krener, Levy and co-workers. The specialization of the non-stationary theory to the stationary case, however, does not seem to have been pursued in sufficient depth in the literature. Stationary reciprocal processes (and reciprocal stochastic models) are potentially useful for describing signals which naturally live in a finite region of the time (or space) line. Estimation or identification of these models starting from observed data seems still to be an open problem which can lead to many interesting applications in signal and image processing. In this paper, we discuss a class of reciprocal processes which is the acausal analog of auto-regressive (AR) processes, familiar in control and signal processing. We show that maximum likelihood identification of these processes leads to a covariance extension problem for block-circulant covariance matrices. This generalizes the famous covariance band extension problem for stationary processes on the integer line. As in the usual stationary setting on the integer line, the covariance extension problem turns out to be a basic conceptual and practical step in solving the identification problem. We show that the maximum entropy principle leads to a complete solution of the problem. △ Less

Submitted 25 January, 2011; originally announced January 2011.

Comments: 33 pages, to appear in the IEEE Trans. Aut. Contr

arXiv:1006.5385 [pdf, other]

Matrix Completion by the Principle of Parsimony

Authors: Augusto Ferrante, Michele Pavon

Abstract: Dempster's covariance selection method is extended first to general nonsingular matrices and then to full rank rectangular matrices. Dempster observed that his completion solved a maximum entropy problem. We show that our generalized completions are also solutions of a suitable entropy-like variational problem. Dempster's covariance selection method is extended first to general nonsingular matrices and then to full rank rectangular matrices. Dempster observed that his completion solved a maximum entropy problem. We show that our generalized completions are also solutions of a suitable entropy-like variational problem. △ Less

Submitted 28 June, 2010; originally announced June 2010.

Comments: 15 pages

MSC Class: 15A83

arXiv:0911.0440 [pdf, other]

On the well-posedness of multivariate spectrum approximation and convergence of high-resolution spectral estimators

Authors: Federico Ramponi, Augusto Ferrante, Michele Pavon

Abstract: In this paper, we establish the well-posedness of the generalized moment problems recently studied by Byrnes-Georgiou-Lindquist and coworkers, and by Ferrante-Pavon-Ramponi. We then apply these continuity results to prove almost sure convergence of a sequence of high-resolution spectral estimators indexed by the sample size. In this paper, we establish the well-posedness of the generalized moment problems recently studied by Byrnes-Georgiou-Lindquist and coworkers, and by Ferrante-Pavon-Ramponi. We then apply these continuity results to prove almost sure convergence of a sequence of high-resolution spectral estimators indexed by the sample size. △ Less

Submitted 2 November, 2009; originally announced November 2009.

arXiv:0811.0933 [pdf, other]

Discrete-time classical and quantum Markovian evolutions: Maximum entropy problems on path space

Authors: Michele Pavon, Francesco Ticozzi

Abstract: The theory of Schroedinger bridges for diffusion processes is extended to classical and quantum discrete-time Markovian evolutions. The solution of the path space maximum entropy problems is obtained from the a priori model in both cases via a suitable multiplicative functional transformation. In the quantum case, nonequilibrium time reversal of quantum channels is discussed and space-time harmo… ▽ More The theory of Schroedinger bridges for diffusion processes is extended to classical and quantum discrete-time Markovian evolutions. The solution of the path space maximum entropy problems is obtained from the a priori model in both cases via a suitable multiplicative functional transformation. In the quantum case, nonequilibrium time reversal of quantum channels is discussed and space-time harmonic processes are introduced. △ Less

Submitted 29 April, 2009; v1 submitted 6 November, 2008; originally announced November 2008.

Comments: 34 pages

MSC Class: 60J10; 81Q99

Showing 1–50 of 62 results for author: Pavon, M