Search | arXiv e-print repository

Revisit the Arimoto-Blahut algorithm: New Analysis with Approximation

Authors: Michail Fasoulakis, Konstantinos Varsos, Apostolos Traganitis

Abstract: By the seminal paper of Claude Shannon \cite{Shannon48}, the computation of the capacity of a discrete memoryless channel has been considered as one of the most important and fundamental problems in Information Theory. Nearly 50 years ago, Arimoto and Blahut independently proposed identical algorithms to solve this problem in their seminal papers \cite{Arimoto1972AnAF, Blahut1972ComputationOC}. Th… ▽ More By the seminal paper of Claude Shannon \cite{Shannon48}, the computation of the capacity of a discrete memoryless channel has been considered as one of the most important and fundamental problems in Information Theory. Nearly 50 years ago, Arimoto and Blahut independently proposed identical algorithms to solve this problem in their seminal papers \cite{Arimoto1972AnAF, Blahut1972ComputationOC}. The Arimoto-Blahut algorithm was proven to converge to the capacity of the channel as $t \to \infty$ with the convergence rate upper bounded by $O\left(\log(m)/t\right)$, where $m$ is the size of the input distribution, and being inverse exponential when there is a unique solution in the interior of the input probability simplex \cite{Arimoto1972AnAF}. Recently it was proved, in \cite{Nakagawa2020AnalysisOT}, that the convergence rate is at worst inverse linear $O(1/t)$ in some specific cases. In this paper, we revisit this fundamental algorithm looking at the rate of convergence to the capacity and the time complexity, given $m,n$, where $n$ is size of the output of the channel, focusing on the approximation of the capacity. We prove that the rate of convergence to an $\varepsilon$-optimal solution, for any constant $\varepsilon > 0$, is inverse exponential $O\left(\log(m)/c^t\right)$, for a constant $c > 1$ and $O\left(\log \left(\log (m)/\varepsilon\right)\right)$ at most iterations, implying $O\left(m n\log \left(\log (m)/\varepsilon\right)\right)$ total complexity of the algorithm. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2207.07007 [pdf, ps, other]

A Polynomial-Time Algorithm for 1/2-Well-Supported Nash Equilibria in Bimatrix Games

Authors: Argyrios Deligkas, Michail Fasoulakis, Evangelos Markakis

Abstract: Since the seminal PPAD-completeness result for computing a Nash equilibrium even in two-player games, an important line of research has focused on relaxations achievable in polynomial time. In this paper, we consider the notion of $\varepsilon$-well-supported Nash equilibrium, where $\varepsilon \in [0,1]$ corresponds to the approximation guarantee. Put simply, in an $\varepsilon$-well-supported e… ▽ More Since the seminal PPAD-completeness result for computing a Nash equilibrium even in two-player games, an important line of research has focused on relaxations achievable in polynomial time. In this paper, we consider the notion of $\varepsilon$-well-supported Nash equilibrium, where $\varepsilon \in [0,1]$ corresponds to the approximation guarantee. Put simply, in an $\varepsilon$-well-supported equilibrium, every player chooses with positive probability actions that are within $\varepsilon$ of the maximum achievable payoff, against the other player's strategy. Ever since the initial approximation guarantee of 2/3 for well-supported equilibria, which was established more than a decade ago, the progress on this problem has been extremely slow and incremental. Notably, the small improvements to 0.6608, and finally to 0.6528, were achieved by algorithms of growing complexity. Our main result is a simple and intuitive algorithm, that improves the approximation guarantee to 1/2. Our algorithm is based on linear programming and in particular on exploiting suitably defined zero-sum games that arise from the payoff matrices of the two players. As a byproduct, we show how to achieve the same approximation guarantee in a query-efficient way. △ Less

Submitted 14 July, 2022; originally announced July 2022.

arXiv:2204.11525 [pdf, ps, other]

A Polynomial-Time Algorithm for 1/3-Approximate Nash Equilibria in Bimatrix Games

Authors: Argyrios Deligkas, Michail Fasoulakis, Evangelos Markakis

Abstract: Since the celebrated PPAD-completeness result for Nash equilibria in bimatrix games, a long line of research has focused on polynomial-time algorithms that compute $\varepsilon$-approximate Nash equilibria. Finding the best possible approximation guarantee that we can have in polynomial time has been a fundamental and non-trivial pursuit on settling the complexity of approximate equilibria. Despit… ▽ More Since the celebrated PPAD-completeness result for Nash equilibria in bimatrix games, a long line of research has focused on polynomial-time algorithms that compute $\varepsilon$-approximate Nash equilibria. Finding the best possible approximation guarantee that we can have in polynomial time has been a fundamental and non-trivial pursuit on settling the complexity of approximate equilibria. Despite a significant amount of effort, the algorithm of Tsaknakis and Spirakis, with an approximation guarantee of $(0.3393+δ)$, remains the state of the art over the last 15 years. In this paper, we propose a new refinement of the Tsaknakis-Spirakis algorithm, resulting in a polynomial-time algorithm that computes a $(\frac{1}{3}+δ)$-Nash equilibrium, for any constant $δ>0$. The main idea of our approach is to go beyond the use of convex combinations of primal and dual strategies, as defined in the optimization framework of Tsaknakis and Spirakis, and enrich the pool of strategies from which we build the strategy profiles that we output in certain bottleneck cases of the algorithm. △ Less

Submitted 19 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

arXiv:2106.03579 [pdf, other]

Forward Looking Best-Response Multiplicative Weights Update Methods for Bilinear Zero-sum Games

Authors: Michail Fasoulakis, Evangelos Markakis, Yannis Pantazis, Constantinos Varsos

Abstract: Our work focuses on extra gradient learning algorithms for finding Nash equilibria in bilinear zero-sum games. The proposed method, which can be formally considered as a variant of Optimistic Mirror Descent \cite{DBLP:conf/iclr/MertikopoulosLZ19}, uses a large learning rate for the intermediate gradient step which essentially leads to computing (approximate) best response strategies against the pr… ▽ More Our work focuses on extra gradient learning algorithms for finding Nash equilibria in bilinear zero-sum games. The proposed method, which can be formally considered as a variant of Optimistic Mirror Descent \cite{DBLP:conf/iclr/MertikopoulosLZ19}, uses a large learning rate for the intermediate gradient step which essentially leads to computing (approximate) best response strategies against the profile of the previous iteration. Although counter-intuitive at first sight due to the irrationally large, for an iterative algorithm, intermediate learning step, we prove that the method guarantees last-iterate convergence to an equilibrium. Particularly, we show that the algorithm reaches first an $η^{1/ρ}$-approximate Nash equilibrium, with $ρ> 1$, by decreasing the Kullback-Leibler divergence of each iterate by at least $Ω(η^{1+\frac{1}ρ})$, for sufficiently small learning rate, $η$, until the method becomes a contracting map, and converges to the exact equilibrium. Furthermore, we perform experimental comparisons with the optimistic variant of the multiplicative weights update method, by \cite{Daskalakis2019LastIterateCZ} and show that our algorithm has significant practical potential since it offers substantial gains in terms of accelerated convergence. △ Less

Submitted 8 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

arXiv:2006.06625 [pdf, other]

Cumulant GAN

Authors: Yannis Pantazis, Dipjyoti Paul, Michail Fasoulakis, Yannis Stylianou, Markos Katsoulakis

Abstract: In this paper, we propose a novel loss function for training Generative Adversarial Networks (GANs) aiming towards deeper theoretical understanding as well as improved stability and performance for the underlying optimization problem. The new loss function is based on cumulant generating functions giving rise to \emph{Cumulant GAN}. Relying on a recently-derived variational formula, we show that t… ▽ More In this paper, we propose a novel loss function for training Generative Adversarial Networks (GANs) aiming towards deeper theoretical understanding as well as improved stability and performance for the underlying optimization problem. The new loss function is based on cumulant generating functions giving rise to \emph{Cumulant GAN}. Relying on a recently-derived variational formula, we show that the corresponding optimization problem is equivalent to R{é}nyi divergence minimization, thus offering a (partially) unified perspective of GAN losses: the R{é}nyi family encompasses Kullback-Leibler divergence (KLD), reverse KLD, Hellinger distance and $χ^2$-divergence. Wasserstein GAN is also a member of cumulant GAN. In terms of stability, we rigorously prove the linear convergence of cumulant GAN to the Nash equilibrium for a linear discriminator, Gaussian distributions and the standard gradient descent ascent algorithm. Finally, we experimentally demonstrate that image generation is more robust relative to Wasserstein GAN and it is substantially improved in terms of both inception score and Fréchet inception distance when both weaker and stronger discriminators are considered. △ Less

Submitted 24 August, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

Comments: 37 pages, 27 figures

arXiv:1811.02598 [pdf, ps, other]

Training Generative Adversarial Networks with Weights

Authors: Yannis Pantazis, Dipjyoti Paul, Michail Fasoulakis, Yannis Stylianou

Abstract: The impressive success of Generative Adversarial Networks (GANs) is often overshadowed by the difficulties in their training. Despite the continuous efforts and improvements, there are still open issues regarding their convergence properties. In this paper, we propose a simple training variation where suitable weights are defined and assist the training of the Generator. We provide theoretical arg… ▽ More The impressive success of Generative Adversarial Networks (GANs) is often overshadowed by the difficulties in their training. Despite the continuous efforts and improvements, there are still open issues regarding their convergence properties. In this paper, we propose a simple training variation where suitable weights are defined and assist the training of the Generator. We provide theoretical arguments why the proposed algorithm is better than the baseline training in the sense of speeding up the training process and of creating a stronger Generator. Performance results showed that the new algorithm is more accurate in both synthetic and image datasets resulting in improvements ranging between 5% and 50%. △ Less

Submitted 6 November, 2018; originally announced November 2018.

Comments: 6 pages, 3 figures, submitted to Icassp2019

arXiv:1807.09749 [pdf, ps, other]

Jamming in multiple independent Gaussian channels as a game

Authors: Michail Fasoulakis, Apostolos Traganitis, Anthony Ephremides

Abstract: We study the problem of \emph{jamming} in multiple independent \emph{Gaussian channels} as a zero-sum game. We show that in the unique Nash equilibrium of the game the best-response strategy of the transmitter is the \emph{waterfilling} to the sum of the jamming and the noise power in each channel and the best-response strategy of the jammer is the \emph{waterfilling} only to the noise power. We study the problem of \emph{jamming} in multiple independent \emph{Gaussian channels} as a zero-sum game. We show that in the unique Nash equilibrium of the game the best-response strategy of the transmitter is the \emph{waterfilling} to the sum of the jamming and the noise power in each channel and the best-response strategy of the jammer is the \emph{waterfilling} only to the noise power. △ Less

Submitted 25 July, 2018; originally announced July 2018.

arXiv:1806.01905 [pdf, ps, other]

A New Theoretical Evaluation Framework for Satisfaction Equilibria in Wireless Networks

Authors: Michail Fasoulakis, Eirini-Eleni Tsiropoulou, Symeon Papavassiliou

Abstract: In this paper, a theoretical evaluation framework regarding the \textit{Satisfaction Equilibrium (SE)} in wireless communication networks is introduced and examined. To study these equilibria operation points, we coin some new concepts, namely the \textit{Valued Satisfaction Equilibrium}, the \textit{Price of Efficiency} and the \textit{Max Price of Satisfaction}, which can be used for measuring t… ▽ More In this paper, a theoretical evaluation framework regarding the \textit{Satisfaction Equilibrium (SE)} in wireless communication networks is introduced and examined. To study these equilibria operation points, we coin some new concepts, namely the \textit{Valued Satisfaction Equilibrium}, the \textit{Price of Efficiency} and the \textit{Max Price of Satisfaction}, which can be used for measuring the efficiency of the obtained equilibria solutions. The aforementioned framework is analyzed and evaluated in a wireless communication environment under the presence of the Gaussian Interference channel (GIC). Within this setting, a non-cooperative game among the users is studied, where users aim in a selfish manner to meet their Quality of Service (QoS) prerequisite. However instead of maximizing the QoS which is generally energy costly, we evangelize that better energy-efficiency is achieved by targeting satisfactory QoS levels only. The sufficient and necessary conditions that lead to the \textit{Satisfaction Equilibrium} are provided for the two-user case and the \textit{Efficient Satisfaction Equilibrium (ESE)} is determined, where the users satisfy their QoS constraints with the lowest possible cost. Moreover, specific measures for evaluating the efficiency of various satisfaction equilibria, in a formal and quantitative manner, expressing the tradeoff with respect to the achieved utility or a given objective function and corresponding cost, are defined and analyzed. △ Less

Submitted 5 June, 2018; originally announced June 2018.

Comments: A version of this paper was accepted for publication in BalkanCom 2018

arXiv:1512.03315 [pdf, ps, other]

doi 10.1007/s00453-018-0465-y

Distributed Methods for Computing Approximate Equilibria

Authors: Artur Czumaj, Argyrios Deligkas, Michail Fasoulakis, John Fearnley, Marcin Jurdziński, Rahul Savani

Abstract: We present a new, distributed method to compute approximate Nash equilibria in bimatrix games. In contrast to previous approaches that analyze the two payoff matrices at the same time (for example, by solving a single LP that combines the two players payoffs), our algorithm first solves two independent LPs, each of which is derived from one of the two payoff matrices, and then compute approximate… ▽ More We present a new, distributed method to compute approximate Nash equilibria in bimatrix games. In contrast to previous approaches that analyze the two payoff matrices at the same time (for example, by solving a single LP that combines the two players payoffs), our algorithm first solves two independent LPs, each of which is derived from one of the two payoff matrices, and then compute approximate Nash equilibria using only limited communication between the players. Our method has several applications for improved bounds for efficient computations of approximate Nash equilibria in bimatrix games. First, it yields a best polynomial-time algorithm for computing \emph{approximate well-supported Nash equilibria (WSNE)}, which guarantees to find a 0.6528-WSNE in polynomial time. Furthermore, since our algorithm solves the two LPs separately, it can be used to improve upon the best known algorithms in the limited communication setting: the algorithm can be implemented to obtain a randomized expected-polynomial-time algorithm that uses poly-logarithmic communication and finds a 0.6528-WSNE. The algorithm can also be carried out to beat the best known bound in the query complexity setting, requiring $O(n \log n)$ payoff queries to compute a 0.6528-WSNE. Finally, our approach can also be adapted to provide the best known communication efficient algorithm for computing \emph{approximate Nash equilibria}: it uses poly-logarithmic communication to find a 0.382-approximate Nash equilibrium. △ Less

Submitted 10 December, 2015; originally announced December 2015.

arXiv:1407.3004 [pdf, ps, other]

Approximate well-supported Nash equilibria in symmetric bimatrix games

Authors: Artur Czumaj, Michail Fasoulakis, Marcin Jurdziński

Abstract: The $\varepsilon$-well-supported Nash equilibrium is a strong notion of approximation of a Nash equilibrium, where no player has an incentive greater than $\varepsilon$ to deviate from any of the pure strategies that she uses in her mixed strategy. The smallest constant $\varepsilon$ currently known for which there is a polynomial-time algorithm that computes an $\varepsilon$-well-supported Nash e… ▽ More The $\varepsilon$-well-supported Nash equilibrium is a strong notion of approximation of a Nash equilibrium, where no player has an incentive greater than $\varepsilon$ to deviate from any of the pure strategies that she uses in her mixed strategy. The smallest constant $\varepsilon$ currently known for which there is a polynomial-time algorithm that computes an $\varepsilon$-well-supported Nash equilibrium in bimatrix games is slightly below $2/3$. In this paper we study this problem for symmetric bimatrix games and we provide a polynomial-time algorithm that gives a $(1/2+δ)$-well-supported Nash equilibrium, for an arbitrarily small positive constant $δ$. △ Less

Submitted 10 July, 2014; originally announced July 2014.

Showing 1–10 of 10 results for author: Fasoulakis, M