Skip to main content

Showing 1–14 of 14 results for author: Pattathil, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.00474  [pdf, other

    cs.SI econ.TH

    Learning, Diversity and Adaptation in Changing Environments: The Role of Weak Links

    Authors: Daron Acemoglu, Asuman Ozdaglar, Sarath Pattathil

    Abstract: Adaptation to dynamic conditions requires a certain degree of diversity. If all agents take the best current action, learning that the underlying state has changed and behavior should adapt will be slower. Diversity is harder to maintain when there is fast communication between agents, because they tend to find out and pursue the best action rapidly. We explore these issues using a model of (Bayes… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  2. arXiv:2301.13306  [pdf, other

    cs.GT cs.LG

    Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

    Authors: Brendan Lucier, Sarath Pattathil, Aleksandrs Slivkins, Mengxiao Zhang

    Abstract: We study a game between autobidding algorithms that compete in an online advertising platform. Each autobidder is tasked with maximizing its advertiser's total value over multiple rounds of a repeated auction, subject to budget and return-on-investment constraints. We propose a gradient-based learning algorithm that is guaranteed to satisfy all constraints and achieves vanishing individual regret.… ▽ More

    Submitted 11 June, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  3. arXiv:2212.13861  [pdf, ps, other

    cs.LG math.OC stat.ML

    Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation

    Authors: Asuman Ozdaglar, Sarath Pattathil, Jiawei Zhang, Kaiqing Zhang

    Abstract: Offline reinforcement learning (RL) aims to find an optimal policy for sequential decision-making using a pre-collected dataset, without further interaction with the environment. Recent theoretical progress has focused on develo** sample-efficient offline RL algorithms with various relaxed assumptions on data coverage and function approximators, especially to handle the case with excessively lar… ▽ More

    Submitted 8 February, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 35 pages

  4. arXiv:2210.12812  [pdf, ps, other

    math.OC cs.LG cs.MA stat.ML

    Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence

    Authors: Sarath Pattathil, Kaiqing Zhang, Asuman Ozdaglar

    Abstract: Multi-agent interactions are increasingly important in the context of reinforcement learning, and the theoretical foundations of policy gradient methods have attracted surging research interest. We investigate the global convergence of natural policy gradient (NPG) algorithms in multi-agent learning. We first show that vanilla NPG may not have parameter convergence, i.e., the convergence of the ve… ▽ More

    Submitted 20 March, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Initially submitted for publication in January 2022

  5. arXiv:2206.04502  [pdf, other

    stat.ML cs.LG math.OC

    What is a Good Metric to Study Generalization of Minimax Learners?

    Authors: Asuman Ozdaglar, Sarath Pattathil, Jiawei Zhang, Kaiqing Zhang

    Abstract: Minimax optimization has served as the backbone of many machine learning (ML) problems. Although the convergence behavior of optimization algorithms has been extensively studied in the minimax settings, their generalization guarantees in stochastic minimax optimization problems, i.e., how the solution trained on empirical data performs on unseen testing data, have been relatively underexplored. A… ▽ More

    Submitted 20 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 34 pages, 2 figures

  6. arXiv:2010.13724  [pdf, ps, other

    cs.LG math.OC

    Tight last-iterate convergence rates for no-regret learning in multi-player games

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis

    Abstract: We study the question of obtaining last-iterate convergence rates for no-regret learning algorithms in multi-player games. We show that the optimistic gradient (OG) algorithm with a constant step-size, which is no-regret, achieves a last-iterate rate of $O(1/\sqrt{T})$ with respect to the gap function in smooth monotone games. This result addresses a question of Mertikopoulos & Zhou (2018), who as… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020. 41 pages

  7. arXiv:2002.05683  [pdf, ps, other

    math.OC cs.LG stat.ML

    An Optimal Multistage Stochastic Gradient Method for Minimax Problems

    Authors: Alireza Fallah, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper, we study the minimax optimization problem in the smooth and strongly convex-strongly concave setting when we have access to noisy estimates of gradients. In particular, we first analyze the stochastic Gradient Descent Ascent (GDA) method with constant stepsize, and show that it converges to a neighborhood of the solution of the minimax problem. We further provide tight bounds on the… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  8. arXiv:2002.00057  [pdf, ps, other

    cs.LG math.OC stat.ML

    Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems

    Authors: Noah Golowich, Sarath Pattathil, Constantinos Daskalakis, Asuman Ozdaglar

    Abstract: In this paper we study the smooth convex-concave saddle point problem. Specifically, we analyze the last iterate convergence properties of the Extragradient (EG) algorithm. It is well known that the ergodic (averaged) iterates of EG converge at a rate of $O(1/T)$ (Nemirovski, 2004). In this paper, we show that the last iterate of EG converges at a rate of $O(1/\sqrt{T})$. To the best of our knowle… ▽ More

    Submitted 6 July, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: 27 pages

  9. arXiv:1910.14380  [pdf, other

    math.OC cs.LG stat.ML

    A Decentralized Proximal Point-type Method for Saddle Point Problems

    Authors: Weijie Liu, Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil, Zebang Shen, Nenggan Zheng

    Abstract: In this paper, we focus on solving a class of constrained non-convex non-concave saddle point problems in a decentralized manner by a group of nodes in a network. Specifically, we assume that each node has access to a summand of a global objective function and nodes are allowed to exchange information only with their neighboring nodes. We propose a decentralized variant of the proximal point metho… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 18 pages

  10. arXiv:1906.01115  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Rate of $\mathcal{O}(1/k)$ for Optimistic Gradient and Extra-gradient Methods in Smooth Convex-Concave Saddle Point Problems

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: We study the iteration complexity of the optimistic gradient descent-ascent (OGDA) method and the extra-gradient (EG) method for finding a saddle point of a convex-concave unconstrained min-max problem. To do so, we first show that both OGDA and EG can be interpreted as approximate variants of the proximal point method. This is similar to the approach taken in [Nemirovski, 2004] which analyzes EG… ▽ More

    Submitted 29 September, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 19 pages

  11. arXiv:1901.08511  [pdf, ps, other

    math.OC cs.LG stat.ML

    A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper we consider solving saddle point problems using two variants of Gradient Descent-Ascent algorithms, Extra-gradient (EG) and Optimistic Gradient Descent Ascent (OGDA) methods. We show that both of these algorithms admit a unified analysis as approximations of the classical proximal point method for solving saddle point problems. This viewpoint enables us to develop a new framework for… ▽ More

    Submitted 5 September, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 25 pages, 3 figures

  12. arXiv:1712.08712  [pdf, other

    cs.SI cs.DS

    Persistence of the Jordan center in Random Growing Trees

    Authors: Sarath Pattathil, Nikhil Karamchandani, Dhruti Shah

    Abstract: The Jordan center of a graph is defined as a vertex whose maximum distance to other nodes in the graph is minimal, and it finds applications in facility location and source detection problems. We study properties of the Jordan Center in the case of random growing trees. In particular, we consider a regular tree graph on which an infection starts from a root node and then spreads along the edges of… ▽ More

    Submitted 21 October, 2018; v1 submitted 22 December, 2017; originally announced December 2017.

    Comments: 28 pages, 14 figures

  13. arXiv:1710.11471  [pdf, other

    cs.PF eess.SY

    Distributed Server Allocation for Content Delivery Networks

    Authors: Sarath Pattathil, Vivek S. Borkar, Gaurav S. Kasbekar

    Abstract: We propose a dynamic formulation of file-sharing networks in terms of an average cost Markov decision process with constraints. By analyzing a Whittle-like relaxation thereof, we propose an index policy in the spirit of Whittle and compare it by simulations with other natural heuristics.

    Submitted 9 February, 2019; v1 submitted 28 October, 2017; originally announced October 2017.

    Comments: 22 pages, 10 figures

  14. arXiv:1610.09849  [pdf, other

    cs.IT

    Massive Machine-Type Communication (mMTC) Access with Integrated Authentication

    Authors: Nuno K. Pratas, Sarath Pattathil, Cedomir Stefanovic, Petar Popovski

    Abstract: We present a connection establishment protocol with integrated authentication, suited for Massive Machine-Type Communications (mMTC). The protocol is contention-based and its main feature is that a device contends with a unique signature that also enables the authentication of the device towards the network. The signatures are inspired by Bloom filters and are created based on the output of the MI… ▽ More

    Submitted 15 March, 2017; v1 submitted 31 October, 2016; originally announced October 2016.

    Comments: Accepted for presentation at ICC 2017