Skip to main content

Showing 1–31 of 31 results for author: Shahrampour, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.01484  [pdf, other

    math.OC cs.LG eess.SY

    Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

    Authors: Emre Sahinoglu, Shahin Shahrampour

    Abstract: We investigate the finite-time analysis of finding ($δ,ε$)-stationary points for nonsmooth nonconvex objectives in decentralized stochastic optimization. A set of agents aim at minimizing a global function using only their local information by interacting over a network. We present a novel algorithm, called Multi Epoch Decentralized Online Learning (ME-DOL), for which we establish the sample compl… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  2. arXiv:2405.11590  [pdf, other

    cs.LG math.OC

    Global Convergence of Decentralized Retraction-Free Optimization on the Stiefel Manifold

    Authors: Youbang Sun, Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: Many classical and modern machine learning algorithms require solving optimization tasks under orthogonal constraints. Solving these tasks often require calculating retraction-based gradient descent updates on the corresponding Riemannian manifold, which can be computationally expensive. Recently Ablin et al. proposed an infeasible retraction-free algorithm, which is significantly more efficient.… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  3. arXiv:2405.02769  [pdf, other

    cs.LG cs.MA math.OC

    Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization

    Authors: Youbang Sun, Tao Liu, P. R. Kumar, Shahin Shahrampour

    Abstract: This work focuses on the entropy-regularized independent natural policy gradient (NPG) algorithm in multi-agent reinforcement learning. In this work, agents are assumed to have access to an oracle with exact policy evaluation and seek to maximize their respective independent rewards. Each individual's reward is assumed to depend on the actions of all the agents in the multi-agent system, leading t… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  4. arXiv:2403.08553  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Recent advancement in online optimization and control has provided novel tools to study online linear quadratic regulator (LQR) problems, where cost matrices are varying adversarially over time. However, the controller parameterization of existing works may not satisfy practical conditions like sparsity due to physical connections. In this work, we study online linear quadratic Gaussian problems w… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2310.09727  [pdf, other

    cs.LG math.OC

    Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

    Authors: Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

    Abstract: This work studies an independent natural policy gradient (NPG) algorithm for the multi-agent reinforcement learning problem in Markov potential games. It is shown that, under mild technical assumptions and the introduction of the \textit{suboptimality gap}, the independent NPG method with an oracle providing exact policy evaluation asymptotically reaches an $ε$-Nash Equilibrium (NE) within… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Will appear in NeurIPS 2023

  6. arXiv:2310.03206  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Distributed Online Control for LTI Systems with Adversarial Disturbances

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: This paper addresses the distributed online control problem over a network of linear time-invariant (LTI) systems (with possibly unknown dynamics) in the presence of adversarial perturbations. There exists a global network cost that is characterized by a time-varying convex function, which evolves in an adversarial manner and is sequentially and partially observed by local agents. The goal of each… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  7. arXiv:2302.12320  [pdf, other

    math.OC cs.LG eess.SY

    Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

    Authors: Ting-Jui Chang, Sapana Chaudhary, Dileep Kalathil, Shahin Shahrampour

    Abstract: This paper addresses safe distributed online optimization over an unknown set of linear safety constraints. A network of agents aims at jointly minimizing a global, time-varying function, which is only partially observable to each individual agent. Therefore, agents must engage in local communications to generate a safe sequence of actions competitive with the best minimizer sequence in hindsight,… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  8. arXiv:2209.12307  [pdf, other

    cs.LG eess.SY math.OC

    On the Stability Analysis of Open Federated Learning Systems

    Authors: Youbang Sun, Heshan Fernando, Tianyi Chen, Shahin Shahrampour

    Abstract: We consider the open federated learning (FL) systems, where clients may join and/or leave the system during the FL process. Given the variability of the number of present clients, convergence to a fixed model cannot be guaranteed in open systems. Instead, we resort to a new performance metric that we term the stability of open FL systems, which quantifies the magnitude of the learned model in open… ▽ More

    Submitted 12 March, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  9. arXiv:2207.01062  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Distributed Online System Identification for LTI Systems Using Reverse Experience Replay

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Identification of linear time-invariant (LTI) systems plays an important role in control and reinforcement learning. Both asymptotic and finite-time offline system identification are well-studied in the literature. For online system identification, the idea of stochastic-gradient descent with reverse experience replay (SGD-RER) was recently proposed, where the data sequence is stored in several bu… ▽ More

    Submitted 15 September, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

  10. arXiv:2105.14385  [pdf, other

    math.OC cs.LG eess.SY

    On Centralized and Distributed Mirror Descent: Convergence Analysis Using Quadratic Constraints

    Authors: Youbang Sun, Mahyar Fazlyab, Shahin Shahrampour

    Abstract: Mirror descent (MD) is a powerful first-order optimization technique that subsumes several optimization algorithms including gradient descent (GD). In this work, we develop a semi-definite programming (SDP) framework to analyze the convergence rate of MD in centralized and distributed settings under both strongly convex and non-strongly convex assumptions. We view MD with a dynamical system lens a… ▽ More

    Submitted 18 January, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

  11. arXiv:2105.07310  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Online optimization has recently opened avenues to study optimal control for time-varying cost functions that are unknown in advance. Inspired by this line of research, we study the distributed online linear quadratic regulator (LQR) problem for linear time-invariant (LTI) systems with unknown dynamics. Consider a multi-agent network where each agent is modeled as a LTI system. The network has a g… ▽ More

    Submitted 6 February, 2022; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2009.13749

  12. arXiv:2102.07091  [pdf, other

    math.OC cs.LG eess.SY

    Decentralized Riemannian Gradient Descent on the Stiefel Manifold

    Authors: Shixiang Chen, Alfredo Garcia, Mingyi Hong, Shahin Shahrampour

    Abstract: We consider a distributed non-convex optimization where a network of agents aims at minimizing a global function over the Stiefel manifold. The global function is represented as a finite sum of smooth local functions, where each local function is associated with one agent and agents communicate with each other over an undirected connected graph. The problem is non-convex as local functions are pos… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  13. arXiv:2101.09346  [pdf, ps, other

    math.OC cs.LG eess.SY

    On the Local Linear Rate of Consensus on the Stiefel Manifold

    Authors: Shixiang Chen, Alfredo Garcia, Mingyi Hong, Shahin Shahrampour

    Abstract: We study the convergence properties of Riemannian gradient method for solving the consensus problem (for an undirected connected graph) over the Stiefel manifold. The Stiefel manifold is a non-convex set and the standard notion of averaging in the Euclidean space does not work for this problem. We propose Distributed Riemannian Consensus on Stiefel Manifold (DRCS) and prove that it enjoys a local… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

  14. arXiv:2011.12233  [pdf, other

    math.OC cs.LG eess.SY

    Linear Convergence of Distributed Mirror Descent with Integral Feedback for Strongly Convex Problems

    Authors: Youbang Sun, Shahin Shahrampour

    Abstract: Distributed optimization often requires finding the minimum of a global objective function written as a sum of local functions. A group of agents work collectively to minimize the global function. We study a continuous-time decentralized mirror descent algorithm that uses purely local gradient information to converge to the global optimal solution. The algorithm enforces consensus among agents usi… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 12 pages, 1 figure

  15. arXiv:2009.13749  [pdf, other

    math.OC cs.LG eess.SY

    Distributed Online Linear Quadratic Control for Linear Time-invariant Systems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Classical linear quadratic (LQ) control centers around linear time-invariant (LTI) systems, where the control-state pairs introduce a quadratic cost with time-invariant parameters. Recent advancement in online optimization and control has provided novel tools to study LQ problems that are robust to time-varying cost parameters. Inspired by this line of research, we study the distributed online LQ… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  16. arXiv:2009.06747  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Mirror Descent with Integral Feedback: Asymptotic Convergence Analysis of Continuous-time Dynamics

    Authors: Youbang Sun, Shahin Shahrampour

    Abstract: This work addresses distributed optimization, where a network of agents wants to minimize a global strongly convex objective function. The global function can be written as a sum of local convex functions, each of which is associated with an agent. We propose a continuous-time distributed mirror descent algorithm that uses purely local information to converge to the global optimum. Unlike previous… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  17. arXiv:2006.03912  [pdf, other

    cs.LG math.OC stat.ML

    Unconstrained Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth functions, , Zhang et al. establish the squared path-length of the minimizer sequence ($C^*_{2,T}$) as a lower bound on regret. They also show that online gradient desce… ▽ More

    Submitted 14 August, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

  18. arXiv:2004.13233  [pdf, other

    math.OC cs.LG stat.ML

    On Distributed Non-convex Optimization: Projected Subgradient Method For Weakly Convex Problems in Networks

    Authors: Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: The stochastic subgradient method is a widely-used algorithm for solving large-scale optimization problems arising in machine learning. Often these problems are neither smooth nor convex. Recently, Davis et al. [1-2] characterized the convergence of the stochastic subgradient method for the weakly convex case, which encompasses many important applications (e.g., robust phase retrieval, blind decon… ▽ More

    Submitted 23 February, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

  19. arXiv:1702.06219  [pdf, other

    math.OC cs.MA stat.ML

    An Online Optimization Approach for Multi-Agent Tracking of Dynamic Parameters in the Presence of Adversarial Noise

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This paper addresses tracking of a moving target in a multi-agent network. The target follows a linear dynamics corrupted by an adversarial noise, i.e., the noise is not generated from a statistical distribution. The location of the target at each time induces a global time-varying loss function, and the global loss is a sum of local losses, each of which is associated to one agent. Agents noisy o… ▽ More

    Submitted 20 February, 2017; originally announced February 2017.

    Comments: 8 pages, To appear in American Control Conference 2017

  20. arXiv:1609.02845  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    Distributed Online Optimization in Dynamic Environments Using Mirror Descent

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This work addresses decentralized online optimization in non-stationary environments. A network of agents aim to track the minimizer of a global time-varying convex function. The minimizer evolves according to a known dynamics corrupted by an unknown, unstructured noise. At each time, the global function can be cast as a sum of a finite number of local functions, each of which is assigned to one a… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  21. arXiv:1603.04954  [pdf, other

    cs.LG math.OC

    Online Optimization in Dynamic Environments: Improved Regret Rates for Strongly Convex Problems

    Authors: Aryan Mokhtari, Shahin Shahrampour, Ali Jadbabaie, Alejandro Ribeiro

    Abstract: In this paper, we address tracking of a time-varying parameter with unknown dynamics. We formalize the problem as an instance of online optimization in a dynamic setting. Using online gradient descent, we propose a method that sequentially predicts the value of the parameter and in turn suffers a loss. The objective is to minimize the accumulation of losses over the time horizon, a notion that is… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.

  22. arXiv:1603.00576  [pdf, ps, other

    math.OC cs.LG cs.SI

    Distributed Estimation of Dynamic Parameters : Regret Analysis

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the estimation of a time- varying parameter in a network. A group of agents sequentially receive noisy signals about the parameter (or moving target), which does not follow any particular dynamics. The parameter is not observable to an individual agent, but it is globally identifiable for the whole network. Viewing the problem with an online optimization lens, we aim to provid… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

    Comments: 6 pages, To appear in American Control Conference 2016

  23. arXiv:1512.09311  [pdf, ps, other

    eess.SY math.OC

    Finite-time Analysis of the Distributed Detection Problem

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of distributed detection in fixed and switching networks. A network of agents observe partially informative signals about the unknown state of the world. Hence, they collaborate with each other to identify the true state. We propose an update rule building on distributed, stochastic optimization methods. Our main focus is on the finite-time analysis of the problem.… ▽ More

    Submitted 31 December, 2015; originally announced December 2015.

    Comments: 6 pages, To Appear in Allerton Conference on Communication, Control, and Computing 2015

  24. arXiv:1509.04332  [pdf, ps, other

    eess.SY math.OC stat.ML

    Learning without Recall by Random Walks on Directed Graphs

    Authors: Mohammad Amin Rahimian, Shahin Shahrampour, Ali Jadbabaie

    Abstract: We consider a network of agents that aim to learn some unknown state of the world using private observations and exchange of beliefs. At each time, agents observe private signals generated based on the true unknown state. Each agent might not be able to distinguish the true state based only on her private observations. This occurs when some other states are observationally equivalent to the true s… ▽ More

    Submitted 14 September, 2015; originally announced September 2015.

    Comments: 6 pages, To Appear in Conference on Decision and Control 2015

  25. arXiv:1503.03517  [pdf, ps, other

    cs.LG math.OC stat.ML

    Switching to Learn

    Authors: Shahin Shahrampour, Mohammad Amin Rahimian, Ali Jadbabaie

    Abstract: A network of agents attempt to learn some unknown state of the world drawn by nature from a finite set. Agents observe private signals conditioned on the true state, and form beliefs about the unknown state accordingly. Each agent may face an identification problem in the sense that she cannot distinguish the truth in isolation. However, by communicating with each other, agents are able to benefit… ▽ More

    Submitted 11 March, 2015; originally announced March 2015.

    Comments: 6 pages, To appear in American Control Conference 2015

  26. arXiv:1501.06225  [pdf, ps, other

    cs.LG math.OC stat.ML

    Online Optimization : Competing with Dynamic Comparators

    Authors: Ali Jadbabaie, Alexander Rakhlin, Shahin Shahrampour, Karthik Sridharan

    Abstract: Recent literature on online learning has focused on develo** adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against complex benchmarks. In this paper, we address these two directions together. We present a fully adaptive method that… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: 23 pages, To appear in International Conference on Artificial Intelligence and Statistics (AISTATS) 2015

  27. arXiv:1409.8606  [pdf, other

    math.OC cs.LG cs.SI stat.ML

    Distributed Detection : Finite-time Analysis and Impact of Network Topology

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of distributed detection in multi-agent networks. Agents receive private signals about an unknown state of the world. The underlying state is globally identifiable, yet informative signals may be dispersed throughout the network. Using an optimization-based framework, we develop an iterative local strategy for updating individual beliefs. In contrast to the existin… ▽ More

    Submitted 30 September, 2014; originally announced September 2014.

    Comments: 29 pages, 5 figures

  28. arXiv:1310.0432  [pdf, ps, other

    math.OC cs.LG cs.SI stat.ML

    Online Learning of Dynamic Parameters in Social Networks

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of online learning in a dynamic setting. We consider a social network in which each individual observes a private signal about the underlying state of the world and communicates with her neighbors at each time period. Unlike many existing approaches, the underlying state is dynamic, and evolves according to a geometric random walk. We view the scenario as an optimi… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: 12 pages, To appear in Neural Information Processing Systems (NIPS) 2013

  29. arXiv:1309.2350  [pdf, ps, other

    cs.LG cs.SI math.OC stat.ML

    Exponentially Fast Parameter Estimation in Networks Using Distributed Dual Averaging

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: In this paper we present an optimization-based view of distributed parameter estimation and observational social learning in networks. Agents receive a sequence of random, independent and identically distributed (i.i.d.) signals, each of which individually may not be informative about the underlying true state, but the signals together are globally informative enough to make the true state identif… ▽ More

    Submitted 9 September, 2013; originally announced September 2013.

    Comments: 6 pages, To appear in Conference on Decision and Control 2013

  30. arXiv:1308.2248  [pdf, ps, other

    eess.SY math.DS math.OC

    Topology Identification of Directed Dynamical Networks via Power Spectral Analysis

    Authors: Shahin Shahrampour, Victor M. Preciado

    Abstract: We address the problem of identifying the topology of an unknown weighted, directed network of LTI systems stimulated by wide-sense stationary noises of unknown power spectral densities. We propose several reconstruction algorithms based on the cross-power spectral densities of the network's response to the input noises. Our first algorithm reconstructs the Boolean structure (i.e., existence and d… ▽ More

    Submitted 9 August, 2013; originally announced August 2013.

    Comments: 17 pages

  31. arXiv:1303.3250  [pdf, ps, other

    cs.SI math.OC physics.soc-ph

    Reconstruction of Directed Networks from Consensus Dynamics

    Authors: Shahin Shahrampour, Victor M. Preciado

    Abstract: This paper addresses the problem of identifying the topology of an unknown, weighted, directed network running a consensus dynamics. We propose a methodology to reconstruct the network topology from the dynamic response when the system is stimulated by a wide-sense stationary noise of unknown power spectral density. The method is based on a node-knockout, or grounding, procedure wherein the ground… ▽ More

    Submitted 15 March, 2013; v1 submitted 13 March, 2013; originally announced March 2013.

    Comments: 6 pages

    Journal ref: S. Shahrampour and V.M. Preciado,"Reconstruction of Directed Networks from Consensus Dynamics," in Proc. American Control Conference, 2013