Skip to main content

Showing 1–29 of 29 results for author: Kamgarpour, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01793  [pdf, other

    cs.LG cs.AI stat.ML

    Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning

    Authors: Andreas Schlaginhaufen, Maryam Kamgarpour

    Abstract: Inverse reinforcement learning (IRL) aims to infer a reward from expert demonstrations, motivated by the idea that the reward, rather than the policy, is the most succinct and transferable description of a task [Ng et al., 2000]. However, the reward corresponding to an optimal policy is not unique, making it unclear if an IRL-learned reward is transferable to new transition laws in the sense that… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.15497  [pdf, ps, other

    cs.MA

    Finite-time convergence to an $ε$-efficient Nash equilibrium in potential games

    Authors: Anna Maddux, Reda Ouhamma, Maryam Kamgarpour

    Abstract: This paper investigates the convergence time of log-linear learning to an $ε$-efficient Nash equilibrium (NE) in potential games. In such games, an efficient NE is defined as the maximizer of the potential function. Existing results are limited to potential games with stringent structural assumptions and entail exponential convergence times in $1/ε$. Unaddressed so far, we tackle general potential… ▽ More

    Submitted 17 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 9 main pages, 25 pages, 1 Table

  3. arXiv:2404.03314  [pdf, other

    cs.GT eess.SY

    Learning to Bid in Forward Electricity Markets Using a No-Regret Algorithm

    Authors: Arega Getaneh Abate, Dorsa Majdi, Jalal Kazempour, Maryam Kamgarpour

    Abstract: It is a common practice in the current literature of electricity markets to use game-theoretic approaches for strategic price bidding. However, they generally rely on the assumption that the strategic bidders have prior knowledge of rival bids, either perfectly or with some uncertainty. This is not necessarily a realistic assumption. This paper takes a different approach by relaxing such an assump… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  4. arXiv:2403.16829  [pdf, ps, other

    cs.LG cs.AI

    Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm

    Authors: Titouan Renard, Andreas Schlaginhaufen, Tingting Ni, Maryam Kamgarpour

    Abstract: Given a dataset of expert demonstrations, inverse reinforcement learning (IRL) aims to recover a reward for which the expert is optimal. This work proposes a model-free algorithm to solve entropy-regularized IRL problem. In particular, we employ a stochastic gradient descent update for the reward and a stochastic soft policy iteration update for the policy. Assuming access to a generative model, w… ▽ More

    Submitted 23 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  5. arXiv:2312.08008  [pdf, ps, other

    cs.GT cs.LG

    Learning Nash Equilibria in Zero-Sum Markov Games: A Single Time-scale Algorithm Under Weak Reachability

    Authors: Reda Ouhamma, Maryam Kamgarpour

    Abstract: We consider decentralized learning for zero-sum games, where players only see their payoff information and are agnostic to actions and payoffs of the opponent. Previous works demonstrated convergence to a Nash equilibrium in this setting using double time-scale algorithms under strong reachability assumptions. We address the open problem of achieving an approximate Nash equilibrium efficiently wit… ▽ More

    Submitted 24 May, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.03100 by other authors

  6. arXiv:2312.00561  [pdf, other

    cs.LG math.OC

    A safe exploration approach to constrained Markov decision processes

    Authors: Tingting Ni, Maryam Kamgarpour

    Abstract: We consider discounted infinite horizon constrained Markov decision processes (CMDPs) where the goal is to find an optimal policy that maximizes the expected cumulative reward subject to expected cumulative constraints. Motivated by the application of CMDPs in online learning of safety-critical systems, we focus on develo** a model-free and simulator-free algorithm that ensures constraint satisf… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 37 pages, 3 figures

  7. arXiv:2310.14685  [pdf, other

    cs.GT eess.SY

    Multi-Agent Learning in Contextual Games under Unknown Constraints

    Authors: Anna M. Maddux, Maryam Kamgarpour

    Abstract: We consider the problem of learning to play a repeated contextual game with unknown reward and unknown constraints functions. Such games arise in applications where each agent's action needs to belong to a feasible set, but the feasible set is a priori unknown. For example, in constrained multi-agent reinforcement learning, the constraints on the agents' policies are a function of the unknown dyna… ▽ More

    Submitted 14 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Journal ref: International Conference on Artificial Intelligence and Statistics 2024

  8. arXiv:2306.00629  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Identifiability and Generalizability in Constrained Inverse Reinforcement Learning

    Authors: Andreas Schlaginhaufen, Maryam Kamgarpour

    Abstract: Two main challenges in Reinforcement Learning (RL) are designing appropriate reward functions and ensuring the safety of the learned policy. To address these challenges, we present a theoretical framework for Inverse Reinforcement Learning (IRL) in constrained Markov decision processes. From a convex-analytic perspective, we extend prior results on reward identifiability and generalizability to bo… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Published at ICML 2023

  9. arXiv:2212.12724  [pdf, other

    cs.RO eess.SY math.OC

    Certification of Bottleneck Task Assignment with Shortest Path Criteria

    Authors: Tony A. Wood, Maryam Kamgarpour

    Abstract: Minimising the longest travel distance for a group of mobile robots with interchangeable goals requires knowledge of the shortest length paths between all robots and goal destinations. Determining the exact length of the shortest paths in an environment with obstacles is NP-hard however. In this paper, we investigate when polynomial-time approximations of the shortest path search are sufficient to… ▽ More

    Submitted 8 June, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  10. arXiv:2207.10415  [pdf, other

    math.OC cs.LG

    Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning

    Authors: Ilnura Usmanova, Yarden As, Maryam Kamgarpour, Andreas Krause

    Abstract: Optimizing noisy functions online, when evaluating the objective requires experiments on a deployed system, is a crucial task arising in manufacturing, robotics and many others. Often, constraints on safe inputs are unknown ahead of time, and we only obtain noisy information, indicating how close we are to violating the constraints. Yet, safety must be guaranteed at all times, not only for the fin… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 36 pages, 9 pages of appendix

  11. arXiv:2203.07322  [pdf, other

    cs.LG cs.MA

    Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation

    Authors: Pier Giuseppe Sessa, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider model-based multi-agent reinforcement learning, where the environment transition model is unknown and can only be learned via expensive interactions with the environment. We propose H-MARL (Hallucinated Multi-Agent Reinforcement Learning), a novel sample-efficient algorithm that can efficiently balance exploration, i.e., learning about the environment, and exploitation, i.e., achieve g… ▽ More

    Submitted 10 July, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

  12. arXiv:2202.11147  [pdf, ps, other

    math.OC cs.MA

    On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games

    Authors: Tatiana Tatarenko, Maryam Kamgarpour

    Abstract: We derive the rate of convergence to Nash equilibria for the payoff-based algorithm proposed in \cite{tat_kam_TAC}. These rates are achieved under the standard assumption of convexity of the game, strong monotonicity and differentiability of the pseudo-gradient. In particular, we show the algorithm achieves $O(\frac{1}{T})$ in the two-point function evaluating setting and $O(\frac{1}{\sqrt{T}})$ i… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  13. Safe Motion Planning against Multimodal Distributions based on a Scenario Approach

    Authors: Hee** Ahn, Colin Chen, Ian M. Mitchell, Maryam Kamgarpour

    Abstract: We present the design of a motion planning algorithm that ensures safety for an autonomous vehicle. In particular, we consider a multimodal distribution over uncertainties; for example, the uncertain predictions of future trajectories of surrounding vehicles reflect discrete decisions, such as turning or going straight at intersections. We develop a computationally efficient, scenario-based approa… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: Published in IEEE Control Systems Letters

    Journal ref: in IEEE Control Systems Letters, vol. 6, pp. 1142-1147, 2022

  14. arXiv:2107.06327  [pdf, other

    cs.GT cs.LG

    Contextual Games: Multi-Agent Learning with Side Information

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

    Abstract: We formulate the novel class of contextual games, a type of repeated games driven by contextual information at each round. By means of kernel-based regularity assumptions, we model the correlation between different contexts and game outcomes and propose a novel online (meta) algorithm that exploits such correlations to minimize the contextual regret of individual players. We define game-theoretic… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Journal ref: Proc. of Neural Information Processing Systems (NeurIPS), 2020

  15. arXiv:2103.01840  [pdf, other

    cs.RO math.OC

    Multi-robot task allocation for safe planning against stochastic hazard dynamics

    Authors: Daniel Tihanyi, Yimeng Lu, Orcun Karaca, Maryam Kamgarpour

    Abstract: We address multi-robot safe mission planning in uncertain dynamic environments. This problem arises in several applications including safety-critical exploration, surveillance, and emergency rescue missions. Computation of a multi-robot optimal control policy is challenging not only because of the complexity of incorporating dynamic uncertainties while planning, but also because of the exponential… ▽ More

    Submitted 13 November, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

  16. A market-based approach for enabling inter-area reserve exchange

    Authors: Orcun Karaca, Stefanos Delikaraoglou, Maryam Kamgarpour

    Abstract: Considering the sequential clearing of energy and reserves in Europe, enabling inter-area reserve exchange requires optimally allocating inter-area transmission capacities between these two markets. To achieve this, we provide a market-based allocation framework and derive payments with desirable properties. The proposed min-max least core selecting payments achieve individual rationality, budget… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Operations Research Letters, 49(4), 501-506, 2021

  17. arXiv:2007.05271  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Play Sequential Games versus Unknown Opponents

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider a repeated sequential game between a learner, who plays first, and an opponent who responds to the chosen action. We seek to design strategies for the learner to successfully interact with the opponent. While most previous approaches consider known opponent models, we focus on the setting in which the opponent's model is unknown. To this end, we use kernel-based regularity assumptions… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  18. arXiv:2003.02913  [pdf, other

    cs.RO cs.AI

    Safe Mission Planning under Dynamical Uncertainties

    Authors: Yimeng Lu, Maryam Kamgarpour

    Abstract: This paper considers safe robot mission planning in uncertain dynamical environments. This problem arises in applications such as surveillance, emergency rescue, and autonomous driving. It is a challenging problem due to modeling and integrating dynamical uncertainties into a safe planning framework, and finding a solution in a computationally tractable way. In this work, we first develop a probab… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: This paper appears in ICRA 2020

  19. arXiv:2002.12613  [pdf, other

    cs.LG stat.ML

    Mixed Strategies for Robust Optimization of Unknown Objectives

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider robust optimization problems, where the goal is to optimize an unknown objective function against the worst-case realization of an uncertain parameter. For this setting, we design a novel sample-efficient algorithm GP-MRO, which sequentially learns about the unknown objective from noisy point evaluations. GP-MRO seeks to discover a robust and randomized mixed strategy, that maximizes t… ▽ More

    Submitted 2 March, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

  20. No-Regret Learning from Partially Observed Data in Repeated Auctions

    Authors: Orcun Karaca, Pier Giuseppe Sessa, Anna Leidi, Maryam Kamgarpour

    Abstract: We study a general class of repeated auctions, such as the ones found in electricity markets, as multi-agent games between the bidders. In such a repeated setting, bidders can adapt their strategies online based on the data observed in the previous auction rounds. Moreover, if no-regret algorithms are employed by the bidders to update their strategies, the game is known to converge to a coarse-cor… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Journal ref: IFAC-PapersOnLine, 53(2), 14-19, 2020

  21. arXiv:1909.08540  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    No-Regret Learning in Unknown Games with Correlated Payoffs

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider the problem of learning to play a repeated multi-agent game with an unknown reward function. Single player online learning algorithms attain strong regret bounds when provided with full information feedback, which unfortunately is unavailable in many real-world scenarios. Bandit feedback alone, i.e., observing outcomes only for the selected action, yields substantially worse performanc… ▽ More

    Submitted 28 October, 2019; v1 submitted 18 September, 2019; originally announced September 2019.

  22. arXiv:1904.01882  [pdf, ps, other

    cs.MA cs.GT

    Learning Nash Equilibria in Monotone Games

    Authors: Tatiana Tatarenko, Maryam Kamgarpour

    Abstract: We consider multi-agent decision making where each agent's cost function depends on all agents' strategies. We propose a distributed algorithm to learn a Nash equilibrium, whereby each agent uses only obtained values of her cost function at each joint played action, lacking any information of the functional form of her cost or other agents' costs or strategy sets. In contrast to past work where co… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  23. arXiv:1903.00950  [pdf, ps, other

    cs.GT

    Bounding Inefficiency of Equilibria in Continuous Actions Games using Submodularity and Curvature

    Authors: Pier Giuseppe Sessa, Maryam Kamgarpour, Andreas Krause

    Abstract: Games with continuous strategy sets arise in several machine learning problems (e.g. adversarial learning). For such games, simple no-regret learning algorithms exist in several cases and ensure convergence to coarse correlated equilibria (CCE). The efficiency of such equilibria with respect to a social function, however, is not well understood. In this paper, we define the class of valid utility… ▽ More

    Submitted 3 March, 2019; originally announced March 2019.

  24. Core-Selecting Mechanisms in Electricity Markets

    Authors: Orcun Karaca, Maryam Kamgarpour

    Abstract: Due to its theoretical virtues, several recent works propose the use of the incentive-compatible Vickrey-Clarke-Groves (VCG) mechanism for electricity markets. Coalitions of participants, however, can influence the VCG outcome to obtain higher collective profit. To address this issue, we propose core-selecting mechanisms for their coalition-proofness. We show that core-selecting mechanisms general… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

    Journal ref: IEEE Transactions on Smart Grid, 11(3), 2604 - 2614, 2020

  25. arXiv:1806.05069  [pdf, ps, other

    math.OC cs.LG

    Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces

    Authors: Tatiana Tatarenko, Maryam Kamgarpour

    Abstract: We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes the value of the corresponding cost function evaluated at her chosen action (zero-order oracle). The objective is to minimize the regret, that is, the difference… ▽ More

    Submitted 2 May, 2020; v1 submitted 13 June, 2018; originally announced June 2018.

  26. arXiv:1803.11030  [pdf, other

    cs.GT math.OC

    Exploiting Weak Supermodularity for Coalition-Proof Mechanisms

    Authors: Orcun Karaca, Maryam Kamgarpour

    Abstract: Under the incentive-compatible Vickrey-Clarke-Groves mechanism, coalitions of participants can influence the auction outcome to obtain higher collective profit. These manipulations were proven to be eliminated if and only if the market objective is supermodular. Nevertheless, several auctions do not satisfy the stringent conditions for supermodularity. These auctions include electricity markets, w… ▽ More

    Submitted 23 November, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

  27. Designing Coalition-Proof Reverse Auctions over Continuous Goods

    Authors: Orcun Karaca, Pier Giuseppe Sessa, Neil Walton, Maryam Kamgarpour

    Abstract: This paper investigates reverse auctions that involve continuous values of different types of goods, general nonconvex constraints, and second stage costs. We seek to design the payment rules and conditions under which coalitions of participants cannot influence the auction outcome in order to obtain higher collective utility. Under the incentive-compatible Vickrey-Clarke-Groves mechanism, we show… ▽ More

    Submitted 31 December, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Journal ref: IEEE Transactions on Automatic Control, 64(11), 4803-4810, 2019

  28. arXiv:1702.08789  [pdf, other

    eess.SY cs.GT math.OC

    Nash and Wardrop equilibria in aggregative games with coupling constraints

    Authors: Dario Paccagnan, Basilio Gentile, Francesca Parise, Maryam Kamgarpour, John Lygeros

    Abstract: We consider the framework of aggregative games, in which the cost function of each agent depends on his own strategy and on the average population strategy. As first contribution, we investigate the relations between the concepts of Nash and Wardrop equilibrium. By exploiting a characterization of the two equilibria as solutions of variational inequalities, we bound their distance with a decreasin… ▽ More

    Submitted 30 April, 2018; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: IEEE Trans. on Automatic Control (Accepted without changes). The first three authors contributed equally

  29. arXiv:1611.03044  [pdf, other

    cs.GT

    Exploring Vickrey-Clarke-Groves Mechanism for Electricity Markets

    Authors: Pier Giuseppe Sessa, Neil Walton, Maryam Kamgarpour

    Abstract: Control reserves are power generation or consumption entities that ensure balance of supply and demand of electricity in real-time. In many countries, they are operated through a market mechanism in which entities provide bids. The system operator determines the accepted bids based on an optimization algorithm. We develop the Vickrey-Clarke-Groves (VCG) mechanism for these electricity markets. We… ▽ More

    Submitted 21 November, 2016; v1 submitted 9 November, 2016; originally announced November 2016.