Skip to main content

Showing 1–26 of 26 results for author: Shamma, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08844  [pdf, other

    cs.GT math.OC

    Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

    Authors: Runyu Zhang, Jeff Shamma, Na Li

    Abstract: While there are numerous works in multi-agent reinforcement learning (MARL), most of them focus on designing algorithms and proving convergence to a Nash equilibrium (NE) or other equilibrium such as coarse correlated equilibrium. However, NEs can be non-unique and their performance varies drastically. Thus, it is important to design algorithms that converge to Nash equilibrium with better rewards… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2309.06705  [pdf, other

    cs.GT

    Distributed Learning Dynamics for Coalitional Games

    Authors: Aya Hamed, Jeff S. Shamma

    Abstract: In the framework of transferable utility coalitional games, a scoring (characteristic) function determines the value of any subset/coalition of agents. Agents decide on both which coalitions to form and the allocations of the values of the formed coalitions among their members. An important concept in coalitional games is that of a core solution, which is a partitioning of agents into coalitions a… ▽ More

    Submitted 27 November, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 9 pages, 5 figures; accepted for CDC 2023

  3. arXiv:2304.04282  [pdf, other

    cs.GT

    Higher-Order Uncoupled Dynamics Do Not Lead to Nash Equilibrium -- Except When They Do

    Authors: Sarah A. Toonsi, Jeff S. Shamma

    Abstract: The framework of multi-agent learning explores the dynamics of how individual agent strategies evolve in response to the evolving strategies of other agents. Of particular interest is whether or not agent strategies converge to well known solution concepts such as Nash Equilibrium (NE). Most "fixed order" learning dynamics restrict an agent's underlying state to be its own strategy. In "higher ord… ▽ More

    Submitted 19 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

  4. arXiv:2207.01346  [pdf, other

    eess.SY cs.MA cs.RO

    Can Competition Outperform Collaboration? The Role of Misbehaving Agents

    Authors: Luca Ballotta, Giacomo Como, Jeff S. Shamma, Luca Schenato

    Abstract: We investigate a novel approach to resilient distributed optimization with quadratic costs in a multi-agent system prone to unexpected events that make some agents misbehave. In contrast to commonly adopted filtering strategies, we draw inspiration from phenomena modeled through the Friedkin-Johnsen dynamics and argue that adding competition to the mix can improve resilience in the presence of mis… ▽ More

    Submitted 30 October, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted in IEEE TAC; 17 pages, 44 figures

    MSC Class: 93D50 (Primary) 93B70 (Secondary) ACM Class: I.2.8; I.2.9

  5. arXiv:2203.14099  [pdf, other

    eess.SY cs.MA math.OC

    Competition-Based Resilience in Distributed Quadratic Optimization

    Authors: Luca Ballotta, Giacomo Como, Jeff S. Shamma, Luca Schenato

    Abstract: This paper proposes a novel approach to resilient distributed optimization with quadratic costs in a networked control system (e.g., wireless sensor network, power grid, robotic team) prone to external attacks (e.g., hacking, power outage) that cause agents to misbehave. Departing from classical filtering strategies proposed in literature, we draw inspiration from a game-theoretic formulation of t… ▽ More

    Submitted 10 January, 2024; v1 submitted 26 March, 2022; originally announced March 2022.

    Comments: 7 pages, 8 figures; accepted for CDC 2022

    MSC Class: 93B70 (Primary) 68M18; 93D09; 93D50 (Secondary) ACM Class: I.2.11

  6. arXiv:2111.09411  [pdf, other

    cs.NI

    Multi-sided Matching for the Association of Space-Air-Ground Integrated Systems

    Authors: Doha Hamza, Hajar El Hammouti, Jeff S Shamma, Mohamed-Slim Alouini

    Abstract: Space-air-ground integrated networks (SAGINs) will play a key role in 6G communication systems. They are considered a promising technology to enhance the network capacity in highly dense agglomerations and to provide connectivity in rural areas. The multi-layer and heterogeneous nature of SAGINs necessitates an innovative design of their multi-tier associations. We propose a modeling of the SAGINs… ▽ More

    Submitted 19 October, 2021; originally announced November 2021.

    Comments: Submitted to IEEE Communications Magazine

  7. arXiv:2111.00411  [pdf, other

    eess.SY cs.LG math.OC

    Safe Adaptive Learning-based Control for Constrained Linear Quadratic Regulators with Regret Guarantees

    Authors: Yingying Li, Subhro Das, Jeff Shamma, Na Li

    Abstract: We study the adaptive control of an unknown linear system with a quadratic cost function subject to safety constraints on both the states and actions. The challenges of this problem arise from the tension among safety, exploration, performance, and computation. To address these challenges, we propose a polynomial-time algorithm that guarantees feasibility and constraint satisfaction with high prob… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  8. arXiv:2012.06182  [pdf, other

    eess.SP cs.NI eess.SY

    Point-to-Point Communication in Integrated Satellite-Aerial Networks: State-of-the-art and Future Challenges

    Authors: Nasir Saeed, Heba Almorad, Hayssam Dahrouj, Tareq Y. Al-Naffouri, Jeff S. Shamma, Mohamed-Slim Alouini

    Abstract: This paper overviews point-to-point (P2P) links for integrated satellite-aerial networks, which are envisioned to be among the key enablers of the sixth-generation (6G) of wireless networks vision. The paper first outlines the unique characteristics of such integrated large-scale complex networks, often denoted by spatial networks, and focuses on two particular space-air infrastructures, namely, s… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 17 pages

  9. RISCuer: A Reliable Multi-UAV Search and Rescue Testbed

    Authors: Mohamed Abdelkader, Usman A. Fiaz, Noureddine Toumi, Mohamed A. Mabrok, Jeff S. Shamma

    Abstract: We present the Robotics Intelligent Systems & Control (RISC) Lab multiagent testbed for reliable search and rescue and aerial transport in outdoor environments. The system consists of a team of three multirotor unmanned aerial vehicles (UAVs), which are capable of autonomously searching, picking up, and transporting randomly distributed objects in an outdoor field. The method involves vision based… ▽ More

    Submitted 7 December, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Book chapter: 41 pages, 27 figures (Minor revision: Corrected references)

    Journal ref: Unmanned Aerial Systems, 2021

  10. arXiv:1904.11184  [pdf, other

    cs.GT eess.SP eess.SY

    Smart Jammer and LTE Network Strategies in An Infinite-Horizon Zero-Sum Repeated Game with Asymmetric and Incomplete Information

    Authors: Farhan M. Aziz, Lichun Li, Jeff S. Shamma, Gordon L. Stuber

    Abstract: LTE/LTE-Advanced networks are known to be vulnerable to denial-of-service and loss-of-service attacks from smart jammers. In this article, the interaction between a smart jammer and LTE network is modeled as an infinite-horizon, zero-sum, asymmetric repeated game. The smart jammer and eNode B are modeled as the informed and the uninformed player, respectively. The main purpose of this article is t… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  11. usBot: A Modular Robotic Testbed for Programmable Self-Assembly

    Authors: Usman A. Fiaz, Jeff S. Shamma

    Abstract: We present the design, characterization, and experimental results for a new modular robotic system for programmable self-assembly. The proposed system uses the Hybrid Cube Model (HCM), which integrates classical features from both deterministic and stochastic self-organization models. Thus, for instance, the modules are passive as far as their locomotion is concerned (stochastic), and yet they pos… ▽ More

    Submitted 29 June, 2019; v1 submitted 5 January, 2019; originally announced January 2019.

    Comments: Accepted as a conference paper at 2019 IFAC Joint MECHATRONICS and NOLCOS

  12. arXiv:1809.08218  [pdf, other

    cs.RO

    Infrastructure-free Localization of Aerial Robots with Ultrawideband Sensors

    Authors: Samet Guler, Mohamed Abdelkader, Jeff S. Shamma

    Abstract: Robots in a swarm take advantage of a motion capture system or GPS sensors to obtain their global position. However, motion capture systems are environment-dependent and GPS sensors are not reliable in occluded environments. For a reliable and versatile operation in a swarm, robots must sense each other and interact locally. Motivated by this requirement, here we propose an on-board localization f… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: 14 pages

  13. arXiv:1804.04449  [pdf, ps, other

    eess.SY cs.SI physics.soc-ph

    Herding Positive, Complex Networks

    Authors: Sebastian F. Ruf, Magnus Egersted, Jeff S. Shamma

    Abstract: The problem of controlling complex networks is of interest to disciplines ranging from biology to swarm robotics. However, controllability can be too strict a condition, failing to capture a range of desirable behaviors. Herdability, which describes the ability to drive a system to a specific set in the state space, was recently introduced as an alternative network control notion. This paper consi… ▽ More

    Submitted 28 April, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: Updated the proof of Theorem 2

  14. arXiv:1804.02693  [pdf, ps, other

    cs.LG stat.ML

    Path to Stochastic Stability: Comparative Analysis of Stochastic Learning Dynamics in Games

    Authors: Hassan Jaleel, Jeff S. Shamma

    Abstract: Stochastic stability is a popular solution concept for stochastic learning dynamics in games. However, a critical limitation of this solution concept is its inability to distinguish between different learning rules that lead to the same steady-state behavior. We address this limitation for the first time and develop a framework for the comparative analysis of stochastic learning dynamics with diff… ▽ More

    Submitted 8 April, 2018; originally announced April 2018.

  15. arXiv:1711.02308  [pdf, ps, other

    cs.GT

    Security Strategies of Both Players in Asymmetric Information Zero-Sum Stochastic Games with an Informed Controller

    Authors: Lichun Li, Cedric Langbort, Jeff S. Shamma

    Abstract: This paper considers a zero-sum two-player asymmetric information stochastic game where only one player knows the system state, and the transition law is controlled by the informed player only. For the informed player, it has been shown that the security strategy only depends on the belief and the current stage. We provide LP formulations whose size is only linear in the size of the uninformed pla… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: submitted to special issue in the journal Dynamic Games and Applications

  16. arXiv:1703.01957  [pdf, ps, other

    cs.GT

    An LP Approach for Solving Two-Player Zero-Sum Repeated Bayesian Games

    Authors: Lichun Li, Cedric Langbort, Jeff Shamma

    Abstract: This paper studies two-player zero-sum repeated Bayesian games in which every player has a private type that is unknown to the other player, and the initial probability of the type of every player is publicly known. The types of players are independently chosen according to the initial probabilities, and are kept the same all through the game. At every stage, players simultaneously choose actions,… ▽ More

    Submitted 7 November, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: submitted to TAC, under review

  17. arXiv:1703.01952  [pdf, ps, other

    cs.GT

    Efficient Strategy Computation in Zero-Sum Asymmetric Repeated Games

    Authors: Lichun Li, Jeff S. Shamma

    Abstract: Zero-sum asymmetric games model decision making scenarios involving two competing players who have different information about the game being played. A particular case is that of nested information, where one (informed) player has superior information over the other (uninformed) player. This paper considers the case of nested information in repeated zero-sum games and studies the computation of st… ▽ More

    Submitted 7 November, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: sumbitted to IEEE TAC, under review

  18. arXiv:1607.02502  [pdf, other

    cs.SI eess.SY physics.soc-ph

    Networked SIS Epidemics with Awareness

    Authors: Keith Paarporn, Ceyhun Eksin, Joshua S. Weitz, Jeff S. Shamma

    Abstract: We study an SIS epidemic process over a static contact network where the nodes have partial information about the epidemic state. They react by limiting their interactions with their neighbors when they believe the epidemic is currently prevalent. A node's awareness is weighted by the fraction of infected neighbors in their social network, and a global broadcast of the fraction of infected nodes i… ▽ More

    Submitted 12 July, 2016; v1 submitted 8 July, 2016; originally announced July 2016.

    Comments: 10 pages, 5 figures

  19. arXiv:1605.00306  [pdf, other

    cs.GT

    BLMA: A Blind Matching Algorithm with Application to Cognitive Radio Networks

    Authors: Doha Hamza, Jeff S. Shamma

    Abstract: We consider a two-sided matching problem with a defined notion of pairwise stability. We propose a distributed blind matching algorithm (BLMA) to solve the problem. We prove the solution produced by BLMA will converge to an $ε$-pairwise stable outcome with probability one. We then consider a matching problem in cognitive radio networks. Secondary users (SUs) are allowed access time to the spectrum… ▽ More

    Submitted 1 May, 2016; originally announced May 2016.

  20. arXiv:1604.03240  [pdf, other

    cs.SI physics.soc-ph

    Disease dynamics on a network game: a little empathy goes a long way

    Authors: Ceyhun Eksin, Jeff S. Shamma, Joshua S. Weitz

    Abstract: Individuals change their behavior during an epidemic in response to whether they and/or those they interact with are healthy or sick. Healthy individuals are concerned about contracting a disease from their sick contacts and may utilize protective measures. Sick individuals may be concerned with spreading the disease to their healthy contacts and adopt preemptive measures. Yet, in practice both pr… ▽ More

    Submitted 15 April, 2016; v1 submitted 12 April, 2016; originally announced April 2016.

    Comments: 27 pages, 9 figures, submitted for publication

  21. arXiv:1512.02160  [pdf, other

    cs.GT

    Learning Efficient Correlated Equilibria

    Authors: Holly P. Borowski, Jason R. Marden, Jeff S. Shamma

    Abstract: The majority of distributed learning literature focuses on convergence to Nash equilibria. Correlated equilibria, on the other hand, can often characterize more efficient collective behavior than even the best Nash equilibrium. However, there are no existing distributed learning algorithms that converge to specific correlated equilibria. In this paper, we provide one such algorithm which guarantee… ▽ More

    Submitted 7 December, 2015; originally announced December 2015.

    Comments: 11 pages, 1 figure

  22. arXiv:1510.08204  [pdf, other

    cs.SI cs.GT math.OC

    Global Games with Noisy Information Sharing

    Authors: Hessam Mahdavifar, Ahmad Beirami, Behrouz Touri, Jeff S. Shamma

    Abstract: Global games form a subclass of games with incomplete information where a set of agents decide actions against a regime with an underlying fundamental $θ$ representing its power. Each agent has access to an independent noisy observation of $θ$. In order to capture the behavior of agents in a social network of information exchange we assume that agents share their observation in a noisy environment… ▽ More

    Submitted 28 October, 2017; v1 submitted 28 October, 2015; originally announced October 2015.

    Comments: Accepted to IEEE Transactions on Signal and Information Processing over Networks

  23. arXiv:1509.00737  [pdf, other

    cs.MA cs.GT eess.SY

    A Game-theoretic Formulation of the Homogeneous Self-Reconfiguration Problem

    Authors: Daniel Pickem, Magnus Egerstedt, Jeff S. Shamma

    Abstract: In this paper we formulate the homogeneous two- and three-dimensional self-reconfiguration problem over discrete grids as a constrained potential game. We develop a game-theoretic learning algorithm based on the Metropolis-Hastings algorithm that solves the self-reconfiguration problem in a globally optimal fashion. Both a centralized and a fully distributed algorithm are presented and we show tha… ▽ More

    Submitted 2 September, 2015; originally announced September 2015.

    Comments: 8 pages, 5 figures, 2 algorithms

  24. arXiv:1505.06379  [pdf, other

    eess.SY cs.GT cs.MA cs.RO math.OC

    Communication-Free Distributed Coverage for Networked Systems

    Authors: A. Yasin Yazicioglu, Magnus Egerstedt, Jeff S. Shamma

    Abstract: In this paper, we present a communication-free algorithm for distributed coverage of an arbitrary network by a group of mobile agents with local sensing capabilities. The network is represented as a graph, and the agents are arbitrarily deployed on some nodes of the graph. Any node of the graph is covered if it is within the sensing range of at least one agent. The agents are mobile devices that a… ▽ More

    Submitted 23 May, 2015; originally announced May 2015.

  25. arXiv:1503.08131  [pdf, other

    cs.MA cs.SI eess.SY math.CO

    Formation of Robust Multi-Agent Networks Through Self-Organizing Random Regular Graphs

    Authors: A. Yasin Yazicioglu, Magnus Egerstedt, Jeff S. Shamma

    Abstract: Multi-agent networks are often modeled as interaction graphs, where the nodes represent the agents and the edges denote some direct interactions. The robustness of a multi-agent network to perturbations such as failures, noise, or malicious attacks largely depends on the corresponding graph. In many applications, networks are desired to have well-connected interaction graphs with relatively small… ▽ More

    Submitted 27 March, 2015; originally announced March 2015.

  26. arXiv:1110.4412  [pdf, ps, other

    cs.GT cs.LG

    Aspiration Learning in Coordination Games

    Authors: Georgios C. Chasparis, Ari Arapostathis, Jeff S. Shamma

    Abstract: We consider the problem of distributed convergence to efficient outcomes in coordination games through dynamics based on aspiration learning. Under aspiration learning, a player continues to play an action as long as the rewards received exceed a specified aspiration level. Here, the aspiration level is a fading memory average of past rewards, and these levels also are subject to occasional random… ▽ More

    Submitted 19 October, 2011; originally announced October 2011.

    Comments: 27 pages

    MSC Class: 68T05; 91A26; 91A22; 93E35; 60J05; 91A80

    Journal ref: SIAM J. Control Optim. 51 (2013), no. 1, 465-490