-
Directed Metric Structures arising in Large Language Models
Authors:
Stéphane Gaubert,
Yiannis Vlassopoulos
Abstract:
Large Language Models are transformer neural networks which are trained to produce a probability distribution on the possible next words to given texts in a corpus, in such a way that the most likely word predicted is the actual word in the training text. In this paper we find what is the mathematical structure defined by such conditional probability distributions of text extensions. Changing the…
▽ More
Large Language Models are transformer neural networks which are trained to produce a probability distribution on the possible next words to given texts in a corpus, in such a way that the most likely word predicted is the actual word in the training text. In this paper we find what is the mathematical structure defined by such conditional probability distributions of text extensions. Changing the view point from probabilities to -log probabilities we observe that the subtext order is completely encoded in a metric structure defined on the space of texts $\mathcal{L}$, by -log probabilities. We then construct a metric polyhedron $P(\mathcal{L})$ and an isometric embedding (called Yoneda embedding) of $\mathcal{L}$ into $P(\mathcal{L})$ such that texts map to generators of certain special extremal rays. We explain that $P(\mathcal{L})$ is a $(\min,+)$ (tropical) linear span of these extremal ray generators. The generators also satisfy a system of $(\min+)$ linear equations. We then show that $P(\mathcal{L})$ is compatible with adding more text and from this we derive an approximation of a text vector as a Boltzmann weighted linear combination of the vectors for words in that text. We then prove a duality theorem showing that texts extensions and text restrictions give isometric polyhedra (even though they look a priory very different). Moreover we prove that $P(\mathcal{L})$ is the lattice closure of (a version of) the so called, Isbell completion of $\mathcal{L}$ which turns out to be the $(\max,+)$ span of the text extremal ray generators. All constructions have interpretations in category theory but we don't use category theory explicitly. The categorical interpretations are briefly explained in an appendix. In the final appendix we describe how the syntax to semantics problem could fit in a general well known mathematical duality.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Order isomorphisms of sup-stable function spaces: continuous, Lipschitz, c-convex, and beyond
Authors:
Pierre-Cyril Aubin-Frankowski,
Stéphane Gaubert
Abstract:
There have been many parallel streams of research studying order isomorphisms of some specific sets $\mathcal{G}$ of functions from a set $\mathcal{X}$ to $\mathbb{R}\cup\{\pm\infty\}$, such as the sets of convex or Lipschitz functions. We provide in this article a unified abstract approach inspired by $c$-convex functions. Our results are obtained highlighting the role of inf and sup-irreducible…
▽ More
There have been many parallel streams of research studying order isomorphisms of some specific sets $\mathcal{G}$ of functions from a set $\mathcal{X}$ to $\mathbb{R}\cup\{\pm\infty\}$, such as the sets of convex or Lipschitz functions. We provide in this article a unified abstract approach inspired by $c$-convex functions. Our results are obtained highlighting the role of inf and sup-irreducible elements of $\mathcal{G}$ and the usefulness of characterizing them, to subsequently derive the structure of order isomorphisms, and in particular of those commuting with the addition of scalars. We show that in many cases all these isomorphisms $J:\mathcal{G}\to\mathcal{G}$ are of the form $Jf=g+f\circ φ$ for a translation $g:\mathcal{X}\to\mathbb{R}$ and a bijective reparametrization $φ:\mathcal{X}\to \mathcal{X}$. We apply our theory to the sets of $c$-convex functions on compact Hausdorff spaces, to the set of lower semicontinuous (convex) functions on a Hausdorff topological vector space and to Lipschitz and 1-Lipschitz functions of complete metric spaces.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
The Nullstellensatz and Positivstellensatz for Sparse Tropical Polynomial Systems
Authors:
Marianne Akian,
Antoine Béreau,
Stéphane Gaubert
Abstract:
Grigoriev and Podolskii (2018) have established a tropical analogue of the effective Nullstellensatz, showing that a system of tropical polynomial equations is solvable if and only if a linearized system obtained from a truncated Macaulay matrix is solvable. They provided an upper bound of the minimal admissible truncation degree, as a function of the degrees of the tropical polynomials. We establ…
▽ More
Grigoriev and Podolskii (2018) have established a tropical analogue of the effective Nullstellensatz, showing that a system of tropical polynomial equations is solvable if and only if a linearized system obtained from a truncated Macaulay matrix is solvable. They provided an upper bound of the minimal admissible truncation degree, as a function of the degrees of the tropical polynomials. We establish a tropical Nullstellensatz adapted to {\em sparse} tropical polynomial systems. Our approach is inspired by a construction of Canny-Emiris (1993), refined by Sturmfels (1994). This leads to an improved bound of the truncation degree, which coincides with the classical Macaulay degree in the case of $n+1$ equations in $n$ unknowns. We also establish a tropical Positivstellensatz, allowing one to decide the inclusion of tropical basic semialgebraic sets. This allows one to reduce decision problems for tropical semi-algebraic sets to the solution of systems of tropical linear equalities and inequalities.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Linear algebra over T-pairs
Authors:
Marianne Akian,
Stephane Gaubert,
Louis Rowen
Abstract:
This paper treats linear algebra over a semiring pair, in a wide range of applications to tropical algebra and related areas such as hyperrings and fuzzy rings. First we present a more general category of ``pairs'' with their morphisms, called ``weak morphisms,'' paying special attention to supertropical pairs, hyperpairs, and the doubling functor. Then we turn to matrices and the question of whet…
▽ More
This paper treats linear algebra over a semiring pair, in a wide range of applications to tropical algebra and related areas such as hyperrings and fuzzy rings. First we present a more general category of ``pairs'' with their morphisms, called ``weak morphisms,'' paying special attention to supertropical pairs, hyperpairs, and the doubling functor. Then we turn to matrices and the question of whether the row rank, column rank, and submatrix rank of a matrix are equal. Often the submatrix rank is less than or equal to the row rank and the column rank, but there is a counterexample to equality, discovered some time ago by the second author, which we provide in a more general setting (``pairs of the second kind'') that includes the hyperfield of signs. Additional positive results include a version of Cramer's rule, and we find situations when equality holds, encompassing results by Akian, Gaubert, Guterman, Izhakian, Knebusch, and Rowen. We pay special attention to the question of whether $n+1$ vectors of length $n$ need be dependent. At the end, we introduce a category with stronger morphisms, that preserve a surpassing relation.
△ Less
Submitted 22 March, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Analysis of the vanishing discount limit for optimal control problems in continuous and discrete time
Authors:
Piermarco Cannarsa,
Stephane Gaubert,
Cristian Mendico,
Marc Quincampoix
Abstract:
A classical problem in ergodic continuous time control consists of studying the limit behavior of the optimal value of a discounted cost functional with infinite horizon as the discount factor $λ$ tends to zero. In the literature, this problem has been addressed under various controllability or ergodicity conditions ensuring that the rescaled value function converges uniformly to a constant limit.…
▽ More
A classical problem in ergodic continuous time control consists of studying the limit behavior of the optimal value of a discounted cost functional with infinite horizon as the discount factor $λ$ tends to zero. In the literature, this problem has been addressed under various controllability or ergodicity conditions ensuring that the rescaled value function converges uniformly to a constant limit. In this case the limit can be characterized as the unique constant such that a suitable Hamilton-Jacobi equation has at least one continuous viscosity solution. In this paper, we study this problem without such conditions, so that the aforementioned limit needs not be constant. Our main result characterizes the uniform limit (when it exists) as the maximal subsolution of a system of Hamilton-Jacobi equations. Moreover, when such a subsolution is a viscosity solution, we obtain the convergence of optimal values as well as a rate of convergence. This mirrors the analysis of the discrete time case, where we characterize the uniform limit as the supremum over a set of sub-invariant half-lines of the dynamic programming operator. The emerging structure in both discrete and continuous time models shows that the supremum over sub-invariato half-lines with respect to the Lax-Oleinik semigroup/dynamic programming operator, captures the behavior of the limit cost as discount vanishes.
△ Less
Submitted 22 January, 2024; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Signed tropicalization of polar cones
Authors:
Marianne Akian,
Xavier Allamigeon,
Stéphane Gaubert,
Sergei Sergeev
Abstract:
We study the tropical analogue of the notion of polar of a cone, working over the semiring of tropical numbers with signs. We characterize the cones which arise as polars of sets of tropically nonnegative vectors by an invariance property with respect to a tropical analogue of Fourier-Motzkin elimination. We also relate tropical polars with images by the nonarchimedean valuation of classical polar…
▽ More
We study the tropical analogue of the notion of polar of a cone, working over the semiring of tropical numbers with signs. We characterize the cones which arise as polars of sets of tropically nonnegative vectors by an invariance property with respect to a tropical analogue of Fourier-Motzkin elimination. We also relate tropical polars with images by the nonarchimedean valuation of classical polars over real closed nonarchimedean fields and show, in particular, that for semi-algebraic sets over such fields, the operation of taking the polar commutes with the operation of signed valuation (kee** track both of the nonarchimedean valuation and sign). We apply these results to characterize images by the signed valuation of classical cones of matrices, including the cones of positive semidefinite matrices, completely positive matrices, completely positive semidefinite matrices, and their polars, including the cone of co-positive matrices, showing that hierarchies of classical cones collapse under tropicalization. We finally discuss an application of these ideas to optimization with signed tropical numbers.
△ Less
Submitted 28 February, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Solving irreducible stochastic mean-payoff games and entropy games by relative Krasnoselskii-Mann iteration
Authors:
Marianne Akian,
Stéphane Gaubert,
Ulysse Naepels,
Basile Terver
Abstract:
We analyse an algorithm solving stochastic mean-payoff games, combining the ideas of relative value iteration and of Krasnoselskii-Mann dam**. We derive parameterized complexity bounds for several classes of games satisfying irreducibility conditions. We show in particular that an $ε$-approximation of the value of an irreducible concurrent stochastic game can be computed in a number of iteration…
▽ More
We analyse an algorithm solving stochastic mean-payoff games, combining the ideas of relative value iteration and of Krasnoselskii-Mann dam**. We derive parameterized complexity bounds for several classes of games satisfying irreducibility conditions. We show in particular that an $ε$-approximation of the value of an irreducible concurrent stochastic game can be computed in a number of iterations in $O(|\logε|)$ where the constant in the $O(\cdot)$ is explicit, depending on the smallest non-zero transition probabilities. This should be compared with a bound in $O(|ε|^{-1}|\log(ε)|)$ obtained by Chatterjee and Ibsen-Jensen (ICALP 2014) for the same class of games, and to a $O(|ε|^{-1})$ bound by Allamigeon, Gaubert, Katz and Skomra (ICALP 2022) for turn-based games. We also establish parameterized complexity bounds for entropy games, a class of matrix multiplication games introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. We derive these results by methods of variational analysis, establishing contraction properties of the relative Krasnoselskii-Mann iteration with respect to Hilbert's semi-norm.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
An Adaptive Multi-Level Max-Plus Method for Deterministic Optimal Control Problems
Authors:
Marianne Akian,
Stéphane Gaubert,
Shanqing Liu
Abstract:
We introduce a new numerical method to approximate the solution of a finite horizon deterministic optimal control problem. We exploit two Hamilton-Jacobi-Bellman PDE, arising by considering the dynamics in forward and backward time. This allows us to compute a neighborhood of the set of optimal trajectories, in order to reduce the search space. The solutions of both PDE are successively approximat…
▽ More
We introduce a new numerical method to approximate the solution of a finite horizon deterministic optimal control problem. We exploit two Hamilton-Jacobi-Bellman PDE, arising by considering the dynamics in forward and backward time. This allows us to compute a neighborhood of the set of optimal trajectories, in order to reduce the search space. The solutions of both PDE are successively approximated by max-plus linear combinations of appropriate basis functions, using a hierarchy of finer and finer grids. We show that the sequence of approximate value functions obtained in this way does converge to the viscosity solution of the HJB equation in a neighborhood of optimal trajectories. Then, under certain regularity assumptions, we show that the number of arithmetic operations needed to compute an approximate optimal solution of a $d$-dimensional problem, up to a precision $\varepsilon$, is bounded by $O(C^d (1/\varepsilon) )$, for some constant $C>1$, whereas ordinary grid-based methods have a complexity in$O(1/\varepsilon^{ad}$) for some constant $a>0$.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
A Quantization Procedure for Nonlinear Pricing with an Application to Electricity Markets
Authors:
Quentin Jacquet,
Wim van Ackooij,
Clémence Alasseur,
Stéphane Gaubert
Abstract:
We consider a revenue maximization model, in which a company aims at designing a menu of contracts, given a population of customers. A standard approach consists in constructing an incentive-compatible continuum of contracts, i.e., a menu composed of an infinite number of contracts, where each contract is especially adapted to an infinitesimal customer, taking his type into account. Nonetheless, i…
▽ More
We consider a revenue maximization model, in which a company aims at designing a menu of contracts, given a population of customers. A standard approach consists in constructing an incentive-compatible continuum of contracts, i.e., a menu composed of an infinite number of contracts, where each contract is especially adapted to an infinitesimal customer, taking his type into account. Nonetheless, in many applications, the company is constrained to offering a limited number of contracts. We show that this question reduces to an optimal quantization problem, similar to the pruning problem that appeared in the max-plus based numerical methods in optimal control. We develop a new quantization algorithm, which, given an initial menu of contracts, iteratively prunes the less important contracts, to construct an implementable menu of the desired cardinality, while minimizing the revenue loss. We apply this algorithm to solve a pricing problem with price-elastic demand, originating from the electricity retail market. Numerical results show an improved performance by comparison with earlier pruning algorithms.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
A Multi-Level Fast-Marching Method For The Minimum Time Problem
Authors:
Marianne Akian,
Stéphane Gaubert,
Shanqing Liu
Abstract:
We introduce a new numerical method to approximate the solutions of a class of stationary Hamilton-Jacobi (HJ) partial differential equations arising from minimum time optimal control problems. We rely on nested grid approximations, and look for the optimal trajectories by using the coarse grid approximations to reduce the search space in fine grids. This provides an infinitesimal version of the `…
▽ More
We introduce a new numerical method to approximate the solutions of a class of stationary Hamilton-Jacobi (HJ) partial differential equations arising from minimum time optimal control problems. We rely on nested grid approximations, and look for the optimal trajectories by using the coarse grid approximations to reduce the search space in fine grids. This provides an infinitesimal version of the ``highway hierarchy'' method which has been developed to solve shortest path problems (with discrete time and discrete state). We obtain, for each level, an approximate value function on a sub-domain of the state space. We show that the sequence obtained in this way does converge to the viscosity solution of the HJ equation. Moreover, for our multi-level algorithm, if $0<γ\leq 1$ is the convergence rate of the classical numerical scheme, then the number of arithmetic operations needed to obtain an error in $O(\varepsilon)$ is in $\widetilde{O}(\varepsilon^{-θ})$, with $θ< \frac{d}γ$, to be compared with $\widetilde{O}(\varepsilon^{-d/ γ})$ for ordinary grid-based methods. Here $d$ is the dimension of the problem, $θ$ depends on $d,γ$ and on the ``stiffness" of the value function around optimal trajectories, and the notation $\widetilde{O}$ ignores logarithmic factors. In particular, in typical smooth cases, one has $γ=1$ and $θ=(d+1)/2$.
△ Less
Submitted 8 July, 2024; v1 submitted 19 March, 2023;
originally announced March 2023.
-
Complexity of Geometric programming in the Turing model and application to nonnegative tensors
Authors:
Shmuel Friedland,
Stéphane Gaubert
Abstract:
We consider a geometric programming problem consisting in minimizing a function given by the supremum of finitely many log-Laplace transforms of discrete nonnegative measures on a Euclidean space. Under a coerciveness assumption, we show that a $\varepsilon$-minimizer can be computed in a time that is polynomial in the input size and in $|\log\varepsilon|$. This is obtained by establishing bit-siz…
▽ More
We consider a geometric programming problem consisting in minimizing a function given by the supremum of finitely many log-Laplace transforms of discrete nonnegative measures on a Euclidean space. Under a coerciveness assumption, we show that a $\varepsilon$-minimizer can be computed in a time that is polynomial in the input size and in $|\log\varepsilon|$. This is obtained by establishing bit-size estimates on approximate minimizers and by applying the ellipsoid method. We also derive polynomial iteration complexity bounds for the interior point method applied to the same class of problems. We deduce that the spectral radius of a partially symmetric, weakly irreducible nonnegative tensor can be approximated within $\varepsilon$ error in poly-time. For strongly irreducible tensors, we also show that the logarithm of the positive eigenvector is poly-time computable. Our results also yield that the the maximum of a nonnegative homogeneous $d$-form in the unit ball with respect to $d$-Hölder norm can be approximated in poly-time. In particular, the spectral radius of uniform weighted hypergraphs and some known upper bounds for the clique number of uniform hypergraphs are poly-time computable.
△ Less
Submitted 19 March, 2024; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Factorization of polynomials over the symmetrized tropical semiring and Descartes' rule of sign over ordered valued fields
Authors:
Marianne Akian,
Stephane Gaubert,
Hanieh Tavakolipour
Abstract:
The symmetrized tropical semiring is an extension of the tropical semifield, initially introduced to solve tropical linear systems using Cramer's rule. It is equivalent to the real tropical hyperfield, which has been used in the study of tropicalizations of semialgebraic sets. Polynomials over the symmetrized tropical semiring, and their factorizations, were considered by Quadrat. Recently, Baker…
▽ More
The symmetrized tropical semiring is an extension of the tropical semifield, initially introduced to solve tropical linear systems using Cramer's rule. It is equivalent to the real tropical hyperfield, which has been used in the study of tropicalizations of semialgebraic sets. Polynomials over the symmetrized tropical semiring, and their factorizations, were considered by Quadrat. Recently, Baker and Lorscheid introduced a notion of multiplicity for the roots of univariate polynomials over hyperfields. In the special case of the hyperfield of signs, they related multiplicities with Descarte's rule of sign for real polynomials. We investigate here the factorizations of univariate polynomial functions over symmetrized tropical semirings, and relate them with the multiplicities of roots over these semirings. We deduce a Descartes' rule for "signs and valuations", which applies to polynomials over a real closed field with a convex valuation and an arbitrary (divisible) value group. We show in particular that the inequality of the Descartes' rule is tight when the value group is non-trivial. This extends to arbitrary value groups a characterization of Gunn in the rank one case, answering also to the tightness question. Our results are obtained using the framework of semiring systems introduced by Rowen, together with model theory of valued fields.
△ Less
Submitted 4 June, 2024; v1 submitted 13 January, 2023;
originally announced January 2023.
-
Semiring systems arising from hyperrings
Authors:
Marianne Akian,
Stephane Gaubert,
Louis Rowen
Abstract:
Hyperfields and systems are two algebraic frameworks which have been developed to provide a unified approach to classical and tropical structures. All hyperfields, and more generally hyperrings, can be represented by systems. Conversely, we show that the systems arising in this way, called {\it hypersystems}, are characterized by certain elimination axioms. Systems are preserved under standard alg…
▽ More
Hyperfields and systems are two algebraic frameworks which have been developed to provide a unified approach to classical and tropical structures. All hyperfields, and more generally hyperrings, can be represented by systems. Conversely, we show that the systems arising in this way, called {\it hypersystems}, are characterized by certain elimination axioms. Systems are preserved under standard algebraic constructions; for instance matrices and polynomials over hypersystems are systems, but not hypersystems. We illustrate these results by discussing several examples of systems and hyperfields, and constructions like matroids over systems.
△ Less
Submitted 27 April, 2023; v1 submitted 14 July, 2022;
originally announced July 2022.
-
Universal Complexity Bounds Based on Value Iteration and Application to Entropy Games
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Ricardo D. Katz,
Mateusz Skomra
Abstract:
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the…
▽ More
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and super-eigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed number of random positions can be solved in pseudo-polynomial time. The second one concerns entropy games, a model introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. The rank of an entropy game is defined as the maximal rank among all the ambiguity matrices determined by strategies of the two players. We show that entropy games with a fixed rank, in their original formulation, can be solved in polynomial time, and that an extension of entropy games incorporating weights can be solved in pseudo-polynomial time under the same fixed rank condition.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Ergodic control of a heterogeneous population and application to electricity pricing
Authors:
Quentin Jacquet,
Wim van Ackooij,
Clémence Alasseur,
Stéphane Gaubert
Abstract:
We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a "mean-field" Markov Decision Process in which the state space is a product of simplices, and the populati…
▽ More
We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a "mean-field" Markov Decision Process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dynamics in Hilbert's projective metric, we prove that the infinite-dimensional ergodic eigenproblem admits a solution and show that the latter is in general non unique. This allows us to obtain optimal strategies, and to quantify the gap between steady-state strategies and optimal ones. In particular, we prove in the one-dimensional case that there exist cyclic policies -- alternating between discount and profit taking stages -- which secure a greater gain than constant-price policies. On numerical aspects, we develop a policy iteration algorithm with "on-the-fly" generated transitions, specifically adapted to decomposable models, leading to substantial memory savings. We finally apply our results on realistic instances coming from an electricity pricing problem encountered in the retail markets, and numerically observe the emergence of cyclic promotions for sufficient inertia in the customer behavior.
△ Less
Submitted 4 April, 2024; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Tropical reproducing kernels and optimization
Authors:
Pierre-Cyril Aubin-Frankowski,
Stéphane Gaubert
Abstract:
Hilbertian kernel methods and their positive semidefinite kernels have been extensively used in various fields of applied mathematics and machine learning, owing to their several equivalent characterizations. We here unveil an analogy with concepts from tropical geometry, proving that tropical positive semidefinite kernels are also endowed with equivalent viewpoints, stemming from Fenchel-Moreau c…
▽ More
Hilbertian kernel methods and their positive semidefinite kernels have been extensively used in various fields of applied mathematics and machine learning, owing to their several equivalent characterizations. We here unveil an analogy with concepts from tropical geometry, proving that tropical positive semidefinite kernels are also endowed with equivalent viewpoints, stemming from Fenchel-Moreau conjugations. This tropical analogue of Aronszajn's theorem shows that these kernels correspond to a feature map, define monotonous operators, and generate max-plus function spaces endowed with a reproducing property. They furthermore include all the Hilbertian kernels classically studied as well as Monge arrays. However, two relevant notions of tropical reproducing kernels must be distinguished, based either on linear or sesquilinear interpretations. The sesquilinear interpretation is the most expressive one, since reproducing spaces then encompass classical max-plus spaces, such as those of (semi)convex functions. In contrast, in the linear interpretation, the reproducing kernels are characterized by a restrictive condition, von Neumann regularity. Finally, we provide a tropical analogue of the ``representer theorems'', showing that a class of infinite dimensional regression and interpolation problems admit solutions lying in finite dimensional spaces. We illustrate this theorem by an application to optimal control, in which tropical kernels allow one to represent the value function.
△ Less
Submitted 8 January, 2023; v1 submitted 23 February, 2022;
originally announced February 2022.
-
Computing Transience Bounds of Emergency Call Centers: a Hierarchical Timed Petri Net Approach
Authors:
Xavier Allamigeon,
Marin Boyet,
Stephane Gaubert
Abstract:
A fundamental issue in the analysis of emergency call centers is to estimate the time needed to return to a congestion-free regime after an unusual event with a massive arrival of calls. Call centers can generally be represented by timed Petri nets with a hierarchical structure, in which several layers describe the successive steps of treatments of calls. We study a continuous approximation of the…
▽ More
A fundamental issue in the analysis of emergency call centers is to estimate the time needed to return to a congestion-free regime after an unusual event with a massive arrival of calls. Call centers can generally be represented by timed Petri nets with a hierarchical structure, in which several layers describe the successive steps of treatments of calls. We study a continuous approximation of the Petri net dynamics (with infinitesimal tokens). Then, we show that a counter function, measuring the deviation to the stationary regime, coincides with the value function of a semi-Markov decision problem. Then, we establish a finite time convergence result, exploiting the hierarchical structure of the Petri net. We obtain an explicit bound for the transience time, as a function of the initial marking and sojourn times. This is based on methods from the theory of stochastic shortest paths and non-linear Perron--Frobenius theory. We illustrate the bound on a case study of a medical emergency call center.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
No self-concordant barrier interior point method is strongly polynomial
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Nicolas Vandame
Abstract:
It is an open question to determine if the theory of self-concordant barriers can provide an interior point method with strongly polynomial complexity in linear programming. In the special case of the logarithmic barrier, it was shown in [Allamigeon, Benchimol, Gaubert and Joswig, SIAM J. on Applied Algebra and Geometry, 2018] that the answer is negative. In this paper, we show that none of the se…
▽ More
It is an open question to determine if the theory of self-concordant barriers can provide an interior point method with strongly polynomial complexity in linear programming. In the special case of the logarithmic barrier, it was shown in [Allamigeon, Benchimol, Gaubert and Joswig, SIAM J. on Applied Algebra and Geometry, 2018] that the answer is negative. In this paper, we show that none of the self-concordant barrier interior point methods is strongly polynomial. This result is obtained by establishing that, on parametric families of convex optimization problems, the log-limit of the central path degenerates to a piecewise linear curve, independently of the choice of the barrier function. We provide an explicit linear program that falls in the same class as the Klee-Minty counterexample, i.e., in dimension $n$ with $2n$ constraints, in which the number of iterations is $Ω(2^n)$.
△ Less
Submitted 6 January, 2022;
originally announced January 2022.
-
Quadratic Regularization of Bilevel Pricing Problems and Application to Electricity Retail Markets
Authors:
Quentin Jacquet,
Wim van Ackooij,
Clémence Alasseur,
Stéphane Gaubert
Abstract:
We consider the profit-maximization problem solved by an electricity retailer who aims at designing a menu of contracts. This is an extension of the unit-demand envy-free pricing problem: customers aim to choose a contract maximizing their utility based on a reservation bill and multiple price coefficients (attributes). A basic approach supposes that the customers have deterministic utilities; the…
▽ More
We consider the profit-maximization problem solved by an electricity retailer who aims at designing a menu of contracts. This is an extension of the unit-demand envy-free pricing problem: customers aim to choose a contract maximizing their utility based on a reservation bill and multiple price coefficients (attributes). A basic approach supposes that the customers have deterministic utilities; then, the response of each customer is highly sensitive to price since it concentrates on the best offer. A second classical approach is to consider logit model to add a probabilistic behavior in the customers' choices. To circumvent the intrinsic instability of the former and the resolution difficulties of the latter, we introduce a quadratically regularized model of customer's response, which leads to a quadratic program under complementarity constraints (QPCC). This allows to robustify the deterministic model, while kee** a strong geometrical structure. In particular, we show that the customer's response is governed by a polyhedral complex, in which every polyhedral cell determines a set of contracts which is effectively chosen. Moreover, the deterministic model is recovered as a limit case of the regularized one. We exploit these geometrical properties to develop a pivoting heuristic, which we compare with implicit or non-linear methods from bilevel programming, showing the effectiveness of the approach. Throughout the paper, the electricity retailer problem is our guideline, and we present a numerical study on this application case.
△ Less
Submitted 3 April, 2023; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Multi-stage Stochastic Alternating Current Optimal Power Flow with Storage: Bounding the Relaxation Gap
Authors:
Maxime Grangereau,
Wim van Ackooij,
Stéphane Gaubert
Abstract:
We propose a generic multistage stochastic model for the Alternating Current Optimal Power Flow (AC OPF) problem for radial distribution networks, to account for the random electricity production of renewable energy sources and dynamic constraints of storage systems. We consider single-phase radial networks. Radial three-phase balanced networks (medium-voltage distribution networks typically have…
▽ More
We propose a generic multistage stochastic model for the Alternating Current Optimal Power Flow (AC OPF) problem for radial distribution networks, to account for the random electricity production of renewable energy sources and dynamic constraints of storage systems. We consider single-phase radial networks. Radial three-phase balanced networks (medium-voltage distribution networks typically have this structure) reduce to the former case. This induces a large scale optimization problem, which, given the non-convex nature of the AC OPF, is generally challenging to solve to global optimality. We derive a priori conditions guaranteeing a vanishing relaxation gap for the multi-stage AC OPF problem, which can thus be solved using convex optimization algorithms. We also give an a posteriori upper bound on the relaxation gap. In particular, we show that a null or low relaxation gap may be expected for applications with light reverse power flows or if sufficient storage capacities with low cost are available. Then, we discuss the validity of our results when incorporating voltage regulation devices. Finally, we illustrate our results on problems of planning of a realistic distribution feeder with distributed solar production and storage systems. Scenario trees for solar production are constructed from a stochastic model, by a quantile-based algorithm.
△ Less
Submitted 18 November, 2021; v1 submitted 30 September, 2021;
originally announced September 2021.
-
Ambitropical geometry, hyperconvexity and zero-sum games
Authors:
Marianne Akian,
Stephane Gaubert,
Sara Vannucci
Abstract:
Shapley operators of undiscounted zero-sum two-player games are order-preserving maps that commute with the addition of a constant. We characterize the fixed point sets of Shapley operators, in finite dimension (i.e., for games with a finite state space). Some of these characterizations are of a lattice theoretical nature, whereas some other rely on metric or tropical geometry. More precisely, we…
▽ More
Shapley operators of undiscounted zero-sum two-player games are order-preserving maps that commute with the addition of a constant. We characterize the fixed point sets of Shapley operators, in finite dimension (i.e., for games with a finite state space). Some of these characterizations are of a lattice theoretical nature, whereas some other rely on metric or tropical geometry. More precisely, we show that fixed point sets of Shapley operators are special instances of hyperconvex spaces: they are sup-norm non-expansive retracts of $\R^n$, and also lattices in the induced partial order. Moreover, they retain properties of convex sets, with a notion of ``convex hull'' defined only up to isomorphism. This provides an effective construction of the injective hull or tight span, in the case of additive cones. For deterministic games with finite action spaces, these fixed point sets are supports of polyhedral complexes, with a cell decomposition attached to stationary strategies of the players, in which each cell is an alcoved polyhedron of $A_n$ type. We finally provide an explicit local representation of the latter fixed point sets, as polyhedral fans canonically associated to lattices included in the Boolean hypercube.
△ Less
Submitted 6 July, 2023; v1 submitted 17 August, 2021;
originally announced August 2021.
-
Exact quantization of multistage stochastic linear problems
Authors:
Maël Forcier,
Stéphane Gaubert,
Vincent Leclère
Abstract:
We show that the multistage linear problem (MSLP) with an arbitrary cost distribution is equivalent to a MSLP on a finite scenario tree. We establish this exact quantization result by analyzing the polyhedral structure of MSLPs. In particular, we show that the expected cost-to-go functions are polyhedral and affine on the cells of a chamber complex, which is independent of the cost distribution. T…
▽ More
We show that the multistage linear problem (MSLP) with an arbitrary cost distribution is equivalent to a MSLP on a finite scenario tree. We establish this exact quantization result by analyzing the polyhedral structure of MSLPs. In particular, we show that the expected cost-to-go functions are polyhedral and affine on the cells of a chamber complex, which is independent of the cost distribution. This leads to new complexity results, showing that MSLP is fixed-parameter tractable.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Tropical linear regression and mean payoff games: or, how to measure the distance to equilibria
Authors:
Marianne Akian,
Stéphane Gaubert,
Yang Qi,
Omar Saadi
Abstract:
We study a tropical linear regression problem consisting in finding the best approximation of a set of points by a tropical hyperplane. We establish a strong duality theorem, showing that the value of this problem coincides with the maximal radius of a Hilbert's ball included in a tropical polyhedron. We also show that this regression problem is polynomial-time equivalent to mean payoff games. We…
▽ More
We study a tropical linear regression problem consisting in finding the best approximation of a set of points by a tropical hyperplane. We establish a strong duality theorem, showing that the value of this problem coincides with the maximal radius of a Hilbert's ball included in a tropical polyhedron. We also show that this regression problem is polynomial-time equivalent to mean payoff games. We illustrate our results by solving an inverse problem from auction theory. In this setting, a tropical hyperplane represents the set of equilibrium prices. Tropical linear regression allows us to quantify the distance of a market to the set of equilibria, and infer secret preferences of a decision maker.
△ Less
Submitted 21 June, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Tropical complementarity problems and Nash equilibria
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Frédéric Meunier
Abstract:
Linear complementarity programming is a generalization of linear programming which encompasses the computation of Nash equilibria for bimatrix games. While the latter problem is PPAD-complete, we show that the tropical analogue of the complementarity problem associated with Nash equilibria can be solved in polynomial time. Moreover, we prove that the Lemke--Howson algorithm carries over the tropic…
▽ More
Linear complementarity programming is a generalization of linear programming which encompasses the computation of Nash equilibria for bimatrix games. While the latter problem is PPAD-complete, we show that the tropical analogue of the complementarity problem associated with Nash equilibria can be solved in polynomial time. Moreover, we prove that the Lemke--Howson algorithm carries over the tropical setting and performs a linear number of pivots in the worst case. A consequence of this result is a new class of (classical) bimatrix games for which Nash equilibria computation can be done in polynomial time.
△ Less
Submitted 4 November, 2022; v1 submitted 9 December, 2020;
originally announced December 2020.
-
The tropicalization of the entropic barrier
Authors:
Xavier Allamigeon,
Abdellah Aznag,
Stéphane Gaubert,
Yassine Hamdi
Abstract:
The entropic barrier, studied by Bubeck and Eldan (Proc. Mach. Learn. Research, 2015), is a self-concordant barrier with asymptotically optimal self-concordance parameter. In this paper, we study the tropicalization of the central path associated with the entropic barrier, i.e., the logarithmic limit of this central path for a parametric family of linear programs defined over the field of Puiseux…
▽ More
The entropic barrier, studied by Bubeck and Eldan (Proc. Mach. Learn. Research, 2015), is a self-concordant barrier with asymptotically optimal self-concordance parameter. In this paper, we study the tropicalization of the central path associated with the entropic barrier, i.e., the logarithmic limit of this central path for a parametric family of linear programs defined over the field of Puiseux series. Our main result is that the tropicalization of the entropic central path is a piecewise linear curve which coincides with the tropicalization of the logarithmic central path studied by Allamigeon et al. (SIAM J. Applied Alg. Geom., 2018). One consequence is that the number of linear pieces in the tropical entropic central path can be exponential in the dimension and the number of inequalities defining the linear program.
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
Multiply Accelerated Value Iteration for Non-Symmetric Affine Fixed Point Problems and application to Markov Decision Processes
Authors:
Marianne Akian,
Stéphane Gaubert,
Zheng Qu,
Omar Saadi
Abstract:
We analyze a modified version of Nesterov accelerated gradient algorithm, which applies to affine fixed point problems with non self-adjoint matrices, such as the ones appearing in the theory of Markov decision processes with discounted or mean payoff criteria. We characterize the spectra of matrices for which this algorithm does converge with an accelerated asymptotic rate. We also introduce a…
▽ More
We analyze a modified version of Nesterov accelerated gradient algorithm, which applies to affine fixed point problems with non self-adjoint matrices, such as the ones appearing in the theory of Markov decision processes with discounted or mean payoff criteria. We characterize the spectra of matrices for which this algorithm does converge with an accelerated asymptotic rate. We also introduce a $d$th-order algorithm, and show that it yields a multiply accelerated rate under more demanding conditions on the spectrum. We subsequently apply these methods to develop accelerated schemes for non-linear fixed point problems arising from Markov decision processes. This is illustrated by numerical experiments.
△ Less
Submitted 2 July, 2021; v1 submitted 22 September, 2020;
originally announced September 2020.
-
Probabilistic and mean-field model of COVID-19 epidemics with user mobility and contact tracing
Authors:
M. Akian,
L. Ganassali,
S. Gaubert,
L. Massoulié
Abstract:
We propose a detailed discrete-time model of COVID-19 epidemics coming in two flavours, mean-field and probabilistic. The main contribution lies in several extensions of the basic model that capture i) user mobility - distinguishing routing, i.e. change of residence, from commuting, i.e. daily mobility - and ii) contact tracing procedures. We confront this model to public data on daily hospitaliza…
▽ More
We propose a detailed discrete-time model of COVID-19 epidemics coming in two flavours, mean-field and probabilistic. The main contribution lies in several extensions of the basic model that capture i) user mobility - distinguishing routing, i.e. change of residence, from commuting, i.e. daily mobility - and ii) contact tracing procedures. We confront this model to public data on daily hospitalizations, and discuss its application as well as underlying estimation procedures.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Understanding and monitoring the evolution of the Covid-19 epidemic from medical emergency calls: the example of the Paris area
Authors:
Stéphane Gaubert,
Marianne Akian,
Xavier Allamigeon,
Marin Boyet,
Baptiste Colin,
Théotime Grohens,
Laurent Massoulié,
David P. Parsons,
Frédéric Adnet,
Érick Chanzy,
Laurent Goix,
Frédéric Lapostolle,
Éric Lecarpentier,
Christophe Leroy,
Thomas Loeb,
Jean-Sébastien Marx,
Caroline Télion,
Laurent Tréluyer,
Pierre Carli
Abstract:
We portray the evolution of the Covid-19 epidemic during the crisis of March-April 2020 in the Paris area, by analyzing the medical emergency calls received by the EMS of the four central departments of this area (Centre 15 of SAMU 75, 92, 93 and 94). Our study reveals strong dissimilarities between these departments. We show that the logarithm of each epidemic observable can be approximated by a…
▽ More
We portray the evolution of the Covid-19 epidemic during the crisis of March-April 2020 in the Paris area, by analyzing the medical emergency calls received by the EMS of the four central departments of this area (Centre 15 of SAMU 75, 92, 93 and 94). Our study reveals strong dissimilarities between these departments. We show that the logarithm of each epidemic observable can be approximated by a piecewise linear function of time. This allows us to distinguish the different phases of the epidemic, and to identify the delay between sanitary measures and their influence on the load of EMS. This also leads to an algorithm, allowing one to detect epidemic resurgences. We rely on a transport PDE epidemiological model, and we use methods from Perron-Frobenius theory and tropical geometry.
△ Less
Submitted 20 July, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
A convex programming approach to solve posynomial systems
Authors:
Marianne Akian,
Xavier Allamigeon,
Marin Boyet,
Stéphane Gaubert
Abstract:
We exhibit a class of classical or tropical posynomial systems which can be solved by reduction to linear or convex programming problems. This relies on a notion of colorful vectors with respect to a collection of Newton polytopes. This extends the convex programming approach of one player stochastic games.
We exhibit a class of classical or tropical posynomial systems which can be solved by reduction to linear or convex programming problems. This relies on a notion of colorful vectors with respect to a collection of Newton polytopes. This extends the convex programming approach of one player stochastic games.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
Piecewise Affine Dynamical Models of Timed Petri Nets -- Application to Emergency Call Centers
Authors:
Xavier Allamigeon,
Marin Boyet,
Stéphane Gaubert
Abstract:
We study timed Petri nets, with preselection and priority routing. We represent the behavior of these systems by piecewise affine dynamical systems. We use tools from the theory of nonexpansive map**s to analyze these systems. We establishan equivalence theorem between priority-free fluid timed Petri nets and semi-Markov decision processes, from which we derive the convergence to a periodic regi…
▽ More
We study timed Petri nets, with preselection and priority routing. We represent the behavior of these systems by piecewise affine dynamical systems. We use tools from the theory of nonexpansive map**s to analyze these systems. We establishan equivalence theorem between priority-free fluid timed Petri nets and semi-Markov decision processes, from which we derive the convergence to a periodic regime and the polynomial-time computability of the throughput. More generally, we develop an approach inspired by tropical geometry, characterizing the congestion phases as the cells of a polyhedral complex. We illustrate these results by a current application to the performance evaluation of emergency call centers in the Paris area. We show that priorities can lead to a paradoxical behavior: in certain regimes, the throughput of the most prioritary task may not be an increasing function of the resources.
△ Less
Submitted 9 December, 2021; v1 submitted 20 April, 2020;
originally announced April 2020.
-
Tropical planar networks
Authors:
Stephane Gaubert,
Adi Niv
Abstract:
We show that every tropical totally positive matrix can be uniquely represented as the transfer matrix of a canonical totally connected weighted planar network. We deduce a uniqueness theorem for the factorization of a tropical totally positive in terms of elementary Jacobi matrices.
We show that every tropical totally positive matrix can be uniquely represented as the transfer matrix of a canonical totally connected weighted planar network. We deduce a uniqueness theorem for the factorization of a tropical totally positive in terms of elementary Jacobi matrices.
△ Less
Submitted 28 October, 2019;
originally announced October 2019.
-
Solving Ergodic Markov Decision Processes and Perfect Information Zero-sum Stochastic Games by Variance Reduced Deflated Value Iteration
Authors:
Marianne Akian,
Stéphane Gaubert,
Zheng Qu,
Omar Saadi
Abstract:
Recently, Sidford, Wang, Wu and Ye (2018) developed an algorithm combining variance reduction techniques with value iteration to solve discounted Markov decision processes. This algorithm has a sublinear complexity when the discount factor is fixed. Here, we extend this approach to mean-payoff problems, including both Markov decision processes and perfect information zero-sum stochastic games. We…
▽ More
Recently, Sidford, Wang, Wu and Ye (2018) developed an algorithm combining variance reduction techniques with value iteration to solve discounted Markov decision processes. This algorithm has a sublinear complexity when the discount factor is fixed. Here, we extend this approach to mean-payoff problems, including both Markov decision processes and perfect information zero-sum stochastic games. We obtain sublinear complexity bounds, assuming there is a distinguished state which is accessible from all initial states and for all policies. Our method is based on a reduction from the mean payoff problem to the discounted problem by a Doob h-transform, combined with a deflation technique. The complexity analysis of this algorithm uses at the same time the techniques developed by Sidford et al. in the discounted case and non-linear spectral theory techniques (Collatz-Wielandt characterization of the eigenvalue).
△ Less
Submitted 13 September, 2019;
originally announced September 2019.
-
A Privacy-preserving Method to Optimize Distributed Resource Allocation
Authors:
Olivier Beaude,
Pascal Benchimol,
Stéphane Gaubert,
Paulin Jacquot,
Nadia Oudjane
Abstract:
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimize a global, possibly nonconvex, cost while satisfying the agents' constraints, for instance an energy operator in charge of the management of energy consumption flexibilities of many individual consumers. We provide a priva…
▽ More
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimize a global, possibly nonconvex, cost while satisfying the agents' constraints, for instance an energy operator in charge of the management of energy consumption flexibilities of many individual consumers. We provide a privacy-preserving algorithm that does compute the optimal allocation of resources, avoiding each agent to reveal her private information (constraints and individual solution profile) neither to the central operator nor to a third party. Our method relies on an aggregation procedure: we compute iteratively a global allocation of resources, and gradually ensure existence of a disaggregation, that is individual profiles satisfying agents' private constraints, by a protocol involving the generation of polyhedral cuts and secure multiparty computations (SMC). To obtain these cuts, we use an alternate projection method, which is implemented locally by each agent, preserving her privacy needs. We adress especially the case in which the local and global constraints define a transportation polytope. Then, we provide theoretical convergence estimates together with numerical results, showing that the algorithm can be effectively used to solve the allocation problem in high dimension, while addressing privacy issues.
△ Less
Submitted 22 June, 2020; v1 submitted 7 August, 2019;
originally announced August 2019.
-
A Universal Approximation Result for Difference of log-sum-exp Neural Networks
Authors:
Giuseppe C. Calafiore,
Stephane Gaubert,
Member,
Corrado Possieri
Abstract:
We show that a neural network whose output is obtained as the difference of the outputs of two feedforward networks with exponential activation function in the hidden layer and logarithmic activation function in the output node (LSE networks) is a smooth universal approximator of continuous functions over convex, compact sets. By using a logarithmic transform, this class of networks maps to a fami…
▽ More
We show that a neural network whose output is obtained as the difference of the outputs of two feedforward networks with exponential activation function in the hidden layer and logarithmic activation function in the output node (LSE networks) is a smooth universal approximator of continuous functions over convex, compact sets. By using a logarithmic transform, this class of networks maps to a family of subtraction-free ratios of generalized posynomials, which we also show to be universal approximators of positive functions over log-convex, compact subsets of the positive orthant. The main advantage of Difference-LSE networks with respect to classical feedforward neural networks is that, after a standard training phase, they provide surrogate models for design that possess a specific difference-of-convex-functions form, which makes them optimizable via relatively efficient numerical methods. In particular, by adapting an existing difference-of-convex algorithm to these models, we obtain an algorithm for performing effective optimization-based design. We illustrate the proposed approach by applying it to data-driven design of a diet for a patient with type-2 diabetes.
△ Less
Submitted 21 May, 2019;
originally announced May 2019.
-
The operator approach to entropy games
Authors:
Marianne Akian,
Stéphane Gaubert,
Julien Grand-Clément,
Jérémie Guillaud
Abstract:
Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in…
▽ More
Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in which some action spaces are simplices and payments are given by a relative entropy (Kullback-Leibler divergence). In this way, we show that entropy games with a fixed number of states belonging to Despot can be solved in polynomial time. This approach also allows us to solve these games by a policy iteration algorithm, which we compare with the spectral simplex algorithm developed by Protasov.
△ Less
Submitted 10 April, 2019;
originally announced April 2019.
-
A Privacy-preserving Disaggregation Algorithm for Non-intrusive Management of Flexible Energy
Authors:
Paulin Jacquot,
Olivier Beaude,
Pascal Benchimol,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimizing a global, possibly non-convex, cost while satisfying the agents'c onstraints. We focus on the practical case of the management of energy consumption flexibilities by the operator of a microgrid. This paper provides a pr…
▽ More
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimizing a global, possibly non-convex, cost while satisfying the agents'c onstraints. We focus on the practical case of the management of energy consumption flexibilities by the operator of a microgrid. This paper provides a privacy-preserving algorithm that does compute the optimal allocation of resources, avoiding each agent to reveal her private information (constraints and individual solution profile) neither to the central operator nor to a third party. Our method relies on an aggregation procedure: we maintain a global allocation of resources, and gradually disaggregate this allocation to enforce the satisfaction of private contraints, by a protocol involving the generation of polyhedral cuts and secure multiparty computations (SMC). To obtain these cuts, we use an alternate projections method à la Von Neumann, which is implemented locally by each agent, preserving her privacy needs. Our theoretical and numerical results show that the method scales well as the number of agents gets large, and thus can be used to solve the allocation problem in high dimension, while addressing privacy issues.
△ Less
Submitted 7 March, 2019;
originally announced March 2019.
-
A bilevel optimization model for load balancing in mobile networks through price incentives
Authors:
Marianne Akian,
Mustapha Bouhtou,
Jean Bernard Eytard,
Stéphane Gaubert
Abstract:
We propose a model of incentives for data pricing in large mobile networks, in which an operator wishes to balance the number of connections (active users) of different classes of users in the different cells and at different time instants, in order to ensure them a sufficient quality of service. We assume that each user has a given total demand per day for different types of applications, which h…
▽ More
We propose a model of incentives for data pricing in large mobile networks, in which an operator wishes to balance the number of connections (active users) of different classes of users in the different cells and at different time instants, in order to ensure them a sufficient quality of service. We assume that each user has a given total demand per day for different types of applications, which he may assign to different time slots and locations, depending on his own mobility, on his preferences and on price discounts proposed by the operator. We show that this can be cast as a bilevel programming problem with a special structure allowing us to develop a polynomial time decomposition algorithm suitable for large networks. First, we determine the optimal number of connections (which maximizes a measure of balance); next, we solve an inverse problem and determine the prices generating this traffic. Our results exploit a recently developed application of tropical geometry methods to mixed auction problems, as well as algorithms in discrete convexity (minimization of discrete convex functions in the sense of Murota). We finally present an application on real data provided by Orange and we show the efficiency of the model to reduce the peaks of congestion.
△ Less
Submitted 8 January, 2019;
originally announced January 2019.
-
Matrix versions of the Hellinger distance
Authors:
Rajendra Bhatia,
Stephane Gaubert,
Tanvi Jain
Abstract:
On the space of positive definite matrices we consider distance functions of the form $d(A,B)=\left[\tr\mathcal{A}(A,B)-\tr\mathcal{G}(A,B)\right]^{1/2},$ where $\mathcal{A}(A,B)$ is the arithmetic mean and $\mathcal{G}(A,B)$ is one of the different versions of the geometric mean. When $\mathcal{G}(A,B)=A^{1/2}B^{1/2}$ this distance is $\|A^{1/2}-B^{1/2}\|_2,$ and when…
▽ More
On the space of positive definite matrices we consider distance functions of the form $d(A,B)=\left[\tr\mathcal{A}(A,B)-\tr\mathcal{G}(A,B)\right]^{1/2},$ where $\mathcal{A}(A,B)$ is the arithmetic mean and $\mathcal{G}(A,B)$ is one of the different versions of the geometric mean. When $\mathcal{G}(A,B)=A^{1/2}B^{1/2}$ this distance is $\|A^{1/2}-B^{1/2}\|_2,$ and when $\mathcal{G}(A,B)=(A^{1/2}BA^{1/2})^{1/2}$ it is the Bures-Wasserstein metric. We study two other cases: $\mathcal{G}(A,B)=A^{1/2}(A^{-1/2}BA^{-1/2})^{1/2}A^{1/2},$ the Pusz-Woronowicz geometric mean, and $\mathcal{G}(A,B)=\exp\big(\frac{\log A+\log B}{2}\big),$ the log Euclidean mean. With these choices $d(A,B)$ is no longer a metric, but it turns out that $d^2(A,B)$ is a divergence. We establish some (strict) convexity properties of these divergences. We obtain characterisations of barycentres of $m$ positive definite matrices with respect to these distance measures.
△ Less
Submitted 8 April, 2020; v1 submitted 5 January, 2019;
originally announced January 2019.
-
A game theory approach to the existence and uniqueness of nonlinear Perron-Frobenius eigenvectors
Authors:
Marianne Akian,
Stéphane Gaubert,
Antoine Hochart
Abstract:
We establish a generalized Perron-Frobenius theorem, based on a combinatorial criterion which entails the existence of an eigenvector for any nonlinear order-preserving and positively homogeneous map $f$ acting on the open orthant $\mathbb{R}_{\scriptscriptstyle >0}^n$. This criterion involves dominions, i.e., sets of states that can be made invariant by one player in a two-person game that only d…
▽ More
We establish a generalized Perron-Frobenius theorem, based on a combinatorial criterion which entails the existence of an eigenvector for any nonlinear order-preserving and positively homogeneous map $f$ acting on the open orthant $\mathbb{R}_{\scriptscriptstyle >0}^n$. This criterion involves dominions, i.e., sets of states that can be made invariant by one player in a two-person game that only depends on the behavior of $f$ "at infinity". In this way, we characterize the situation in which for all $α, β> 0$, the "slice space" $\mathcal{S}_α^β:= \{ x \in \mathbb{R}_{\scriptscriptstyle >0}^n \mid αx \leq f(x) \leq βx \}$ is bounded in Hilbert's projective metric, or, equivalently, for all uniform perturbations $g$ of $f$, all the orbits of $g$ are bounded in Hilbert's projective metric. This solves a problem raised by Gaubert and Gunawardena (Trans. AMS, 2004). We also show that the uniqueness of an eigenvector is characterized by a dominion condition, involving a different game depending now on the local behavior of $f$ near an eigenvector. We show that the dominion conditions can be verified by directed hypergraph methods. We finally illustrate these results by considering specific classes of nonlinear maps, including Shapley operators, generalized means and nonnegative tensors.
△ Less
Submitted 24 December, 2018;
originally announced December 2018.
-
Log-sum-exp neural networks and posynomial models for convex and log-log-convex data
Authors:
Giuseppe C. Calafiore,
Stephane Gaubert,
Corrado Possieri
Abstract:
We show in this paper that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is an universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named LSET. Under a suitable exponential transformation, the class of LSET functions maps to a fam…
▽ More
We show in this paper that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is an universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named LSET. Under a suitable exponential transformation, the class of LSET functions maps to a family of generalized posynomials GPOST, which we similarly show to be universal approximators for log-log-convex functions. A key feature of an LSET network is that, once it is trained on data, the resulting model is convex in the variables, which makes it readily amenable to efficient design based on convex optimization. Similarly, once a GPOST model is trained on data, it yields a posynomial model that can be efficiently optimized with respect to its variables by using geometric programming (GP). The proposed methodology is illustrated by two numerical examples, in which, first, models are constructed from simulation data of the two physical processes (namely, the level of vibration in a vehicle suspension system, and the peak power generated by the combustion of propane), and then optimization-based design is performed on these models.
△ Less
Submitted 8 December, 2018; v1 submitted 20 June, 2018;
originally announced June 2018.
-
A convergent hierarchy of non-linear eigenproblems to compute the joint spectral radius of nonnegative matrices
Authors:
Stephane Gaubert,
Nikolas Stott
Abstract:
We show that the joint spectral radius of a finite collection of nonnegative matrices can be bounded by the eigenvalue of a non-linear operator. This eigenvalue coincides with the ergodic constant of a risk-sensitive control problem, or of an entropy game, in which the state space consists of all switching sequences of a given length. We show that, by increasing this length, we arrive at a converg…
▽ More
We show that the joint spectral radius of a finite collection of nonnegative matrices can be bounded by the eigenvalue of a non-linear operator. This eigenvalue coincides with the ergodic constant of a risk-sensitive control problem, or of an entropy game, in which the state space consists of all switching sequences of a given length. We show that, by increasing this length, we arrive at a convergent approximation scheme to compute the joint spectral radius. The complexity of this method is exponential in the length of the switching sequences, but it is quite insensitive to the size of the matrices, allowing us to solve very large scale instances (several matrices in dimensions of order 1000 within a minute). An idea of this method is to replace a hierarchy of optimization problems, introduced by Ahmadi, Jungers, Parrilo and Roozbehani, by a hierarchy of nonlinear eigenproblems. To solve the latter eigenproblems, we introduce a projective version of Krasnoselskii-Mann iteration. This method is of independent interest as it applies more generally to the nonlinear eigenproblem for a monotone positively homogeneous map. Here, this method allows for scalability by avoiding the recourse to linear or semidefinite programming techniques.
△ Less
Submitted 8 May, 2018;
originally announced May 2018.
-
Spectral inequalities for nonnegative tensors and their tropical analogues
Authors:
Shmuel Friedland,
Stéphane Gaubert
Abstract:
We extend some characterizations and inequalities for the eigenvalues of nonnegative matrices, such as Donsker-Varadhan, Friedland-Karlin, Karlin-Ost inequalities, to nonnegative tensors. Our approach involves a correspondence between nonnegative tensors, ergodic control and entropy maximization: we show in particular that the logarithm of the spectral radius of a tensor is given by en entropy max…
▽ More
We extend some characterizations and inequalities for the eigenvalues of nonnegative matrices, such as Donsker-Varadhan, Friedland-Karlin, Karlin-Ost inequalities, to nonnegative tensors. Our approach involves a correspondence between nonnegative tensors, ergodic control and entropy maximization: we show in particular that the logarithm of the spectral radius of a tensor is given by en entropy maximization problem over a space of occupation measures. We study in particular the tropical analogue of the spectral radius, that we characterize as a limit of the classical spectral radius, and we give an explicit combinatorial formula for this tropical spectral radius.
△ Less
Submitted 13 September, 2019; v1 submitted 31 March, 2018;
originally announced April 2018.
-
Condition numbers of stochastic mean payoff games and what they say about nonarchimedean semidefinite programming
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Ricardo D. Katz,
Mateusz Skomra
Abstract:
Semidefinite programming can be considered over any real closed field, including fields of Puiseux series equipped with their nonarchimedean valuation. Nonarchimedean semidefinite programs encode parametric families of classical semidefinite programs, for sufficiently large values of the parameter. Recently, a correspondence has been established between nonarchimedean semidefinite programs and sto…
▽ More
Semidefinite programming can be considered over any real closed field, including fields of Puiseux series equipped with their nonarchimedean valuation. Nonarchimedean semidefinite programs encode parametric families of classical semidefinite programs, for sufficiently large values of the parameter. Recently, a correspondence has been established between nonarchimedean semidefinite programs and stochastic mean payoff games with perfect information. This correspondence relies on tropical geometry. It allows one to solve generic nonarchimedean semidefinite feasibility problems, of large scale, by means of stochastic game algorithms. In this paper, we show that the mean payoff of these games can be interpreted as a condition number for the corresponding nonarchimedean feasibility problems. This number measures how close a feasible instance is from being infeasible, and vice versa. We show that it coincides with the maximal radius of a ball in Hilbert's projective metric, that is included in the feasible set. The geometric interpretation of the condition number relies in particular on a duality theorem for tropical semidefinite feasibility programs. Then, we bound the complexity of the feasibility problem in terms of the condition number. We finally give explicit bounds for this condition number, in terms of the characteristics of the stochastic game. As a consequence, we show that the simplest algorithm to decide whether a stochastic mean payoff game is winning, namely value iteration, has a pseudopolynomial complexity when the number of random positions is fixed.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.
-
The tropical analogue of the Helton-Nie conjecture is true
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Mateusz Skomra
Abstract:
Helton and Nie conjectured that every convex semialgebraic set over the field of real numbers can be written as the projection of a spectrahedron. Recently, Scheiderer disproved this conjecture. We show, however, that the following result, which may be thought of as a tropical analogue of this conjecture, is true: over a real closed nonarchimedean field of Puiseux series, the convex semialgebraic…
▽ More
Helton and Nie conjectured that every convex semialgebraic set over the field of real numbers can be written as the projection of a spectrahedron. Recently, Scheiderer disproved this conjecture. We show, however, that the following result, which may be thought of as a tropical analogue of this conjecture, is true: over a real closed nonarchimedean field of Puiseux series, the convex semialgebraic sets and the projections of spectrahedra have precisely the same images by the nonarchimedean valuation. The proof relies on game theory methods.
△ Less
Submitted 6 January, 2018;
originally announced January 2018.
-
Analysis and Implementation of a Hourly Billing Mechanism for Demand Response Management
Authors:
Paulin Jacquot,
Olivier Beaude,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
An important part of the Smart Grid literature on residential Demand Response deals with game-theoretic consumption models. Among those papers, the hourly billing model is of special interest as an intuitive and fair mechanism. We focus on this model and answer to several theoretical and practical questions. First, we prove the uniqueness of the consumption profile corresponding to the Nash equili…
▽ More
An important part of the Smart Grid literature on residential Demand Response deals with game-theoretic consumption models. Among those papers, the hourly billing model is of special interest as an intuitive and fair mechanism. We focus on this model and answer to several theoretical and practical questions. First, we prove the uniqueness of the consumption profile corresponding to the Nash equilibrium, and we analyze its efficiency by providing a bound on the Price of Anarchy. Next, we address the computational issue of the equilibrium profile by providing two algorithms: the cycling best response dynamics and a projected gradient descent method, and by giving an upper bound on their convergence rate to the equilibrium. Last, we simulate this demand response framework in a stochastic environment where the parameters depend on forecasts. We show numerically the relevance of an online demand response procedure, which reduces the impact of inaccurate forecasts.
△ Less
Submitted 22 December, 2017;
originally announced December 2017.
-
Demand Response in the Smart Grid: the Impact of Consumers Temporal Preferences
Authors:
Paulin Jacquot,
Olivier Beaude,
Nadia Oudjane,
Stephane Gaubert
Abstract:
In Demand Response programs, price incentives might not be sufficient to modify residential consumers load profile. Here, we consider that each consumer has a preferred profile and a discomfort cost when deviating from it. Consumers can value this discomfort at a varying level that we take as a parameter. This work analyses Demand Response as a game theoretic environment. We study the equilibria o…
▽ More
In Demand Response programs, price incentives might not be sufficient to modify residential consumers load profile. Here, we consider that each consumer has a preferred profile and a discomfort cost when deviating from it. Consumers can value this discomfort at a varying level that we take as a parameter. This work analyses Demand Response as a game theoretic environment. We study the equilibria of the game between consumers with preferences within two different dynamic pricing mechanisms, respectively the daily proportional mechanism introduced by Mohsenian-Rad et al, and an hourly proportional mechanism. We give new results about equilibria as functions of the preference level in the case of quadratic system costs and prove that, whatever the preference level, system costs are smaller with the hourly mechanism. We simulate the Demand Response environment using real consumption data from PecanStreet database. While the Price of Anarchy remains always close to one up to 0.1% with the hourly mechanism, it can be more than 10% bigger with the daily mechanism.
△ Less
Submitted 30 November, 2017;
originally announced November 2017.
-
Demand Side Management in the Smart Grid: an Efficiency and Fairness Tradeoff
Authors:
Paulin Jacquot,
Olivier Beaude,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
We compare two Demand Side Management (DSM) mechanisms, introduced respectively by Mohsenian-Rad et al (2010) and Baharlouei et al (2012), in terms of efficiency and fairness. Each mechanism defines a game where the consumers optimize their flexible consumption to reduce their electricity bills. Mohsenian-Rad et al propose a daily mechanism for which they prove the social optimality. Baharlouei et…
▽ More
We compare two Demand Side Management (DSM) mechanisms, introduced respectively by Mohsenian-Rad et al (2010) and Baharlouei et al (2012), in terms of efficiency and fairness. Each mechanism defines a game where the consumers optimize their flexible consumption to reduce their electricity bills. Mohsenian-Rad et al propose a daily mechanism for which they prove the social optimality. Baharlouei et al propose a hourly billing mechanism for which we give theoretical results: we prove the uniqueness of an equilibrium in the associated game and give an upper bound on its price of anarchy. We evaluate numerically the two mechanisms, using real consumption data from Pecan Street Inc. The simulations show that the equilibrium reached with the hourly mechanism is socially optimal up to 0.1%, and that it achieves an important fairness property according to a quantitative indicator we define. We observe that the two DSM mechanisms avoid the synchronization effect induced by non- game theoretic mechanisms, e.g. Peak/OffPeak hours contracts.
△ Less
Submitted 29 November, 2017;
originally announced November 2017.
-
Log-barrier interior point methods are not strongly polynomial
Authors:
Xavier Allamigeon,
Pascal Benchimol,
Stéphane Gaubert,
Michael Joswig
Abstract:
We prove that primal-dual log-barrier interior point methods are not strongly polynomial, by constructing a family of linear programs with $3r+1$ inequalities in dimension $2r$ for which the number of iterations performed is in $Ω(2^r)$. The total curvature of the central path of these linear programs is also exponential in $r$, disproving a continuous analogue of the Hirsch conjecture proposed by…
▽ More
We prove that primal-dual log-barrier interior point methods are not strongly polynomial, by constructing a family of linear programs with $3r+1$ inequalities in dimension $2r$ for which the number of iterations performed is in $Ω(2^r)$. The total curvature of the central path of these linear programs is also exponential in $r$, disproving a continuous analogue of the Hirsch conjecture proposed by Deza, Terlaky and Zinchenko. Our method is to tropicalize the central path in linear programming. The tropical central path is the piecewise-linear limit of the central paths of parameterized families of classical linear programs viewed through logarithmic glasses. This allows us to provide combinatorial lower bounds for the number of iterations and the total curvature, in a general setting.
△ Less
Submitted 8 August, 2017; v1 submitted 4 August, 2017;
originally announced August 2017.
-
Approximating the Volume of Tropical Polytopes is Difficult
Authors:
Stephane Gaubert,
Marie MacCaig
Abstract:
We investigate the complexity of counting the number of integer points in tropical polytopes, and the complexity of calculating their volume. We study the tropical analogue of the outer parallel body and establish bounds for its volume. We deduce that there is no approximation algorithm of factor $α=2^{\text{poly}(m,n)}$ for the volume of a tropical polytope given by $n$ vertices in a space of dim…
▽ More
We investigate the complexity of counting the number of integer points in tropical polytopes, and the complexity of calculating their volume. We study the tropical analogue of the outer parallel body and establish bounds for its volume. We deduce that there is no approximation algorithm of factor $α=2^{\text{poly}(m,n)}$ for the volume of a tropical polytope given by $n$ vertices in a space of dimension $m$, unless P$=$NP. Neither is there such an approximation algorithm for counting the number of integer points in tropical polytopes described by vertices. If follows that approximating these values for tropical polytopes is more difficult than for classical polytopes. Our proofs use a reduction from the problem of calculating the tropical rank. For tropical polytopes described by inequalities we prove that counting the number of integer points and calculating the volume are $\#$P-hard.
△ Less
Submitted 20 June, 2017;
originally announced June 2017.
-
Tropical Kraus maps for optimal control of switched systems
Authors:
Stéphane Gaubert,
Nikolas Stott
Abstract:
Kraus maps (completely positive trace preserving maps) arise classically in quantum information, as they describe the evolution of noncommutative probability measures. We introduce tropical analogues of Kraus maps, obtained by replacing the addition of positive semidefinite matrices by a multivalued supremum with respect to the Löwner order. We show that non-linear eigenvectors of tropical Kraus m…
▽ More
Kraus maps (completely positive trace preserving maps) arise classically in quantum information, as they describe the evolution of noncommutative probability measures. We introduce tropical analogues of Kraus maps, obtained by replacing the addition of positive semidefinite matrices by a multivalued supremum with respect to the Löwner order. We show that non-linear eigenvectors of tropical Kraus maps determine piecewise quadratic approximations of the value functions of switched optimal control problems. This leads to a new approximation method, which we illustrate by two applications: 1) approximating the joint spectral radius, 2) computing approximate solutions of Hamilton-Jacobi PDE arising from a class of switched linear quadratic problems studied previously by McEneaney. We report numerical experiments, indicating a major improvement in terms of scalability by comparison with earlier numerical schemes, owing to the "LMI-free" nature of our method.
△ Less
Submitted 10 November, 2017; v1 submitted 14 June, 2017;
originally announced June 2017.