Skip to main content

Showing 1–36 of 36 results for author: Marchesi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14372  [pdf, ps, other

    cs.LG

    Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints

    Authors: Francesco Emanuele Stradi, Anna Lunghi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: In constrained Markov decision processes (CMDPs) with adversarial rewards and constraints, a well-known impossibility result prevents any algorithm from attaining both sublinear regret and sublinear constraint violation, when competing against a best-in-hindsight policy that satisfies constraints on average. In this paper, we show that this negative result can be eased in CMDPs with non-stationary… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2405.06977  [pdf, ps, other

    cs.GT

    The Sample Complexity of Stackelberg Games

    Authors: Francesco Bacchiocchi, Matteo Bollini, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: Stackelberg games (SGs) constitute the most fundamental and acclaimed models of strategic interactions involving some form of commitment. Moreover, they form the basis of more elaborate models of this kind, such as, e.g., Bayesian persuasion and principal-agent problems. Addressing learning tasks in SGs and related models is crucial to operationalize them in practice, where model parameters are us… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  3. arXiv:2403.03672  [pdf, ps, other

    cs.LG

    Learning Adversarial MDPs with Stochastic Hard Constraints

    Authors: Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study online learning problems in constrained Markov decision processes (CMDPs) with adversarial losses and stochastic hard constraints. We consider two different scenarios. In the first one, we address general CMDPs, where we design an algorithm that attains sublinear regret and cumulative positive constraints violation. In the second scenario, under the mild assumption that a policy strictly… ▽ More

    Submitted 20 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  4. arXiv:2402.13156  [pdf, ps, other

    cs.GT

    Regret-Minimizing Contracts: Agency Under Uncertainty

    Authors: Martino Bernasconi, Matteo Castiglioni, Alberto Marchesi

    Abstract: We study the fundamental problem of designing contracts in principal-agent problems under uncertainty. Previous works mostly addressed Bayesian settings in which principal's uncertainty is modeled as a probability distribution over agent's types. In this paper, we study a setting in which the principal has no distributional information about agent's type. In particular, in our setting, the princip… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2402.03077  [pdf, ps, other

    cs.GT cs.LG

    Markov Persuasion Processes: Learning to Persuade from Scratch

    Authors: Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: In Bayesian persuasion, an informed sender strategically discloses information to a receiver so as to persuade them to undertake desirable actions. Recently, a growing attention has been devoted to settings in which sender and receivers interact sequentially. Recently, Markov persuasion processes (MPPs) have been introduced to capture sequential scenarios where a sender faces a stream of myopic re… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  6. arXiv:2309.09801  [pdf, ps, other

    cs.GT cs.LG

    Learning Optimal Contracts: How to Exploit Small Action Spaces

    Authors: Francesco Bacchiocchi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study principal-agent problems in which a principal commits to an outcome-dependent payment scheme -- called contract -- in order to induce an agent to take a costly, unobservable action leading to favorable outcomes. We consider a generalization of the classical (single-round) version of the problem in which the principal interacts with the agent by committing to contracts over multiple rounds… ▽ More

    Submitted 7 June, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  7. arXiv:2306.12221  [pdf, other

    cs.GT

    Persuading Farsighted Receivers in MDPs: the Power of Honesty

    Authors: Martino Bernasconi, Matteo Castiglioni, Alberto Marchesi, Mirco Mutti

    Abstract: Bayesian persuasion studies the problem faced by an informed sender who strategically discloses information to influence the behavior of an uninformed receiver. Recently, a growing attention has been devoted to settings where the sender and the receiver interact sequentially, in which the receiver's decision-making problem is usually modeled as a Markov decision process (MDP). However, previous wo… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  8. arXiv:2304.14326  [pdf, ps, other

    cs.LG

    A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

    Authors: Jacopo Germano, Francesco Emanuele Stradi, Gianmarco Genalti, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study online learning in episodic constrained Markov decision processes (CMDPs), where the goal of the learner is to collect as much reward as possible over the episodes, while guaranteeing that some long-term constraints are satisfied during the learning process. Rewards and constraints can be selected either stochastically or adversarially, and the transition function is not known to the lear… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  9. arXiv:2303.01296  [pdf, ps, other

    cs.GT

    Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion

    Authors: Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti, Francesco Trovò

    Abstract: Bayesian persuasion studies how an informed sender should influence beliefs of rational receivers who take decisions through Bayesian updating of a common prior. We focus on the online Bayesian persuasion framework, in which the sender repeatedly faces one or more receivers with unknown and adversarially selected types. First, we show how to obtain a tight $\tilde O(T^{1/2})$ regret bound in the c… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  10. arXiv:2301.13790  [pdf, ps, other

    cs.GT

    Selling Information while Being an Interested Party

    Authors: Matteo Castiglioni, Francesco Bacchiocchi, Alberto Marchesi, Giulia Romano, Nicola Gatti

    Abstract: We study the algorithmic problem faced by an information holder (seller) who wants to optimally sell such information to a budged-constrained decision maker (buyer) that has to undertake some action. Differently from previous, we consider the case in which the seller is an interested party, as the action chosen by the buyer does not only influence their utility, but also seller's one. This happens… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  11. arXiv:2301.13654  [pdf, ps, other

    cs.GT

    Multi-Agent Contract Design: How to Commission Multiple Agents with Individual Outcome

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study hidden-action principal-agent problems with multiple agents. These are problems in which a principal commits to an outcome-dependent payment scheme in order to incentivize some agents to take costly, unobservable actions that lead to favorable outcomes. Previous works on multi-agent problems study models where the principal observes a single outcome determined by the actions of all the ag… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  12. arXiv:2301.13600  [pdf, ps, other

    cs.GT

    Constrained Phi-Equilibria

    Authors: Martino Bernasconi, Matteo Castiglioni, Alberto Marchesi, Francesco Trovò, Nicola Gatti

    Abstract: The computational study of equilibria involving constraints on players' strategies has been largely neglected. However, in real-world applications, players are usually subject to constraints ruling out the feasibility of some of their strategies, such as, e.g., safety requirements and budget caps. Computational studies on constrained versions of the Nash equilibrium have lead to some results under… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  13. arXiv:2209.07454  [pdf, ps, other

    cs.LG math.OC

    A Unifying Framework for Online Optimization with Long-Term Constraints

    Authors: Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Giulia Romano, Nicola Gatti

    Abstract: We study online learning problems in which a decision maker has to take a sequence of decisions subject to $m$ long-term constraints. The goal of the decision maker is to maximize their total reward, while at the same time achieving small cumulative constraints violation across the $T$ rounds. We present the first best-of-both-world type algorithm for this general class of problems, with no-regret… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  14. arXiv:2209.03927  [pdf, other

    cs.LG cs.AI cs.GT

    Sequential Information Design: Learning to Persuade in the Dark

    Authors: Martino Bernasconi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti, Francesco Trovo

    Abstract: We study a repeated information design problem faced by an informed sender who tries to influence the behavior of a self-interested receiver. We consider settings where the receiver faces a sequential decision making (SDM) problem. At each round, the sender observes the realizations of random events in the SDM problem. This begets the challenge of how to incrementally disclose such information to… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  15. arXiv:2208.08238  [pdf, other

    cs.GT

    Last-iterate Convergence to Trembling-hand Perfect Equilibria

    Authors: Martino Bernasconi, Alberto Marchesi, Francesco Trovò

    Abstract: Designing efficient algorithms to find Nash equilibrium (NE) refinements in sequential games is of paramount importance in practice. Indeed, it is well known that the NE has several weaknesses, since it may prescribe to play sub-optimal actions in those parts of the game that are never reached at the equilibrium. NE refinements, such as the extensive-form perfect equilibrium (EFPE), amend such wea… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  16. arXiv:2204.13772  [pdf, other

    cs.GT

    The Power of Media Agencies in Ad Auctions: Improving Utility through Coordinated Bidding

    Authors: Giulia Romano, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: The increasing competition in digital advertising induced a proliferation of media agencies playing the role of intermediaries between advertisers and platforms selling ad slots. When a group of competing advertisers is managed by a common agency, many forms of collusion, such as bid rigging, can be implemented by coordinating bidding strategies, dramatically increasing advertisers' value. We stud… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  17. arXiv:2202.10966  [pdf, ps, other

    cs.GT

    Designing Menus of Contracts Efficiently: The Power of Randomization

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study hidden-action principal-agent problems in which a principal commits to an outcome-dependent payment scheme (called contract) so as to incentivize the agent to take a costly, unobservable action leading to favorable outcomes. In particular, we focus on Bayesian settings where the agent has private information. This is collectively encoded by the agent's type, which is unknown to the princi… ▽ More

    Submitted 17 August, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

  18. arXiv:2202.00605  [pdf, ps, other

    cs.GT

    Bayesian Persuasion Meets Mechanism Design: Going Beyond Intractability with Type Reporting

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: Bayesian persuasion studies how an informed sender should partially disclose information so as to influence the behavior of self-interested receivers. In the last years, a growing attention has been devoted to relaxing the assumption that the sender perfectly knows receiver's payoffs. The first crucial step towards such an achievement is to study settings where each receiver's payoffs depend on th… ▽ More

    Submitted 1 September, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

  19. arXiv:2201.12275  [pdf, other

    cs.GT

    Efficiency of Ad Auctions with Price Displaying

    Authors: Matteo Castiglioni, Diodato Ferraioli, Nicola Gatti, Alberto Marchesi, Giulia Romano

    Abstract: Most of the economic reports forecast that almost half of the worldwide market value unlocked by AI over the next decade (up to 6 trillion USD per year) will be in marketing&sales. In particular, AI will enable the optimization of more and more intricate economic settings, in which multiple different activities need to be jointly automated. This is the case of, e.g., Google Hotel Ads and Tripadvis… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  20. arXiv:2201.12183  [pdf, other

    cs.GT

    Signaling in Posted Price Auctions

    Authors: Matteo Castiglioni, Giulia Romano, Alberto Marchesi, Nicola Gatti

    Abstract: We study single-item single-unit Bayesian posted price auctions, where buyers arrive sequentially and their valuations for the item being sold depend on a random, unknown state of nature. The seller has complete knowledge of the actual state and can send signals to the buyers so as to disclose information about it. For instance, the state of nature may reflect the condition and/or some particular… ▽ More

    Submitted 29 March, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  21. arXiv:2201.09728  [pdf, other

    cs.GT

    Public Signaling in Bayesian Ad Auctions

    Authors: Francesco Bacchiocchi, Matteo Castiglioni, Alberto Marchesi, Giulia Romano, Nicola Gatti

    Abstract: We study signaling in Bayesian ad auctions, in which bidders' valuations depend on a random, unknown state of nature. The auction mechanism has complete knowledge of the actual state of nature, and it can send signals to bidders so as to disclose information about the state and increase revenue. For instance, a state may collectively encode some features of the user that are known to the mechanism… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  22. arXiv:2106.06480  [pdf, ps, other

    cs.GT cs.AI

    Multi-Receiver Online Bayesian Persuasion

    Authors: Matteo Castiglioni, Alberto Marchesi, Andrea Celli, Nicola Gatti

    Abstract: Bayesian persuasion studies how an informed sender should partially disclose information to influence the behavior of a self-interested receiver. Classical models make the stringent assumption that the sender knows the receiver's utility. This can be relaxed by considering an online learning framework in which the sender repeatedly faces a receiver of an unknown, adversarially selected type. We st… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  23. arXiv:2106.00319  [pdf, ps, other

    cs.GT

    Bayesian Agency: Linear versus Tractable Contracts

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study principal-agent problems in which a principal commits to an outcome-dependent payment scheme (a.k.a. contract) so as to induce an agent to take a costly, unobservable action. We relax the assumption that the principal perfectly knows the agent by considering a Bayesian setting where the agent's type is unknown and randomly selected according to a given probability distribution, which is k… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  24. arXiv:2104.01520  [pdf, ps, other

    cs.GT cs.LG cs.MA

    Simple Uncoupled No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

    Authors: Gabriele Farina, Andrea Celli, Alberto Marchesi, Nicola Gatti

    Abstract: The existence of simple uncoupled no-regret learning dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated… ▽ More

    Submitted 27 May, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Extended version of our NeurIPS 2020 paper. Compared to the conference version, this preprint gives finer, in-high-probability regret bounds. We also better connected our work to the phi-regret minimization framework

  25. arXiv:2012.06528  [pdf, ps, other

    cs.GT

    Trembling-Hand Perfection and Correlation in Sequential Games

    Authors: Alberto Marchesi, Nicola Gatti

    Abstract: We initiate the study of trembling-hand perfection in sequential (i.e., extensive-form) games with correlation. We introduce the extensive-form perfect correlated equilibrium (EFPCE) as a refinement of the classical extensive-form correlated equilibrium (EFCE) that amends its weaknesses off the equilibrium path. This is achieved by accounting for the possibility that players may make mistakes whil… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  26. arXiv:2012.05774  [pdf, other

    cs.GT

    Online Posted Pricing with Unknown Time-Discounted Valuations

    Authors: Giulia Romano, Gianluca Tartaglia, Alberto Marchesi, Nicola Gatti

    Abstract: We study the problem of designing posted-price mechanisms in order to sell a single unit of a single item within a finite period of time. Motivated by real-world problems, such as, e.g., long-term rental of rooms and apartments, we assume that customers arrive online according to a Poisson process, and their valuations are drawn from an unknown distribution and discounted over time. We evaluate ou… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  27. arXiv:2004.00603  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

    Authors: Andrea Celli, Alberto Marchesi, Gabriele Farina, Nicola Gatti

    Abstract: The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilib… ▽ More

    Submitted 2 September, 2022; v1 submitted 1 April, 2020; originally announced April 2020.

  28. arXiv:2002.05190  [pdf, ps, other

    cs.GT cs.AI

    Signaling in Bayesian Network Congestion Games: the Subtle Power of Symmetry

    Authors: Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

    Abstract: Network congestion games are a well-understood model of multi-agent strategic interactions. Despite their ubiquitous applications, it is not clear whether it is possible to design information structures to ameliorate the overall experience of the network users. We focus on Bayesian games with atomic players, where network vagaries are modeled via a (random) state of nature which determines the cos… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  29. arXiv:1911.07755  [pdf, other

    cs.GT cs.LG

    Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

    Authors: Alberto Marchesi, Francesco Trovò, Nicola Gatti

    Abstract: We tackle the problem of learning equilibria in simulation-based games. In such games, the players' utility functions cannot be described analytically, as they are given through a black-box simulator that can be queried to obtain noisy estimates of the utilities. This is the case in many real-world games in which a complete description of the elements involved is not available upfront, such as com… ▽ More

    Submitted 25 February, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

  30. arXiv:1910.06228  [pdf, other

    cs.GT

    Learning to Correlate in Multi-Player General-Sum Sequential Games

    Authors: Andrea Celli, Alberto Marchesi, Tommaso Bianchi, Nicola Gatti

    Abstract: In the context of multi-player, general-sum games, there is an increasing interest in solution concepts modeling some form of communication among players, since they can lead to socially better outcomes with respect to Nash equilibria, and may be reached through learning dynamics in a decentralized fashion. In this paper, we focus on coarse correlated equilibria (CCEs) in sequential games. First,… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  31. arXiv:1905.13108  [pdf, other

    cs.GT

    Leadership in Congestion Games: Multiple User Classes and Non-Singleton Actions (Extended Version)

    Authors: Alberto Marchesi, Matteo Castiglioni, Nicola Gatti

    Abstract: We study the problem of finding Stackelberg equilibria in games with a massive number of players. So far, the only known game instances in which the problem is solved in polynomial time are some particular congestion games. However, a complete characterization of hard and easy instances is still lacking. In this paper, we extend the state of the art along two main directions. First, we focus on ga… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  32. arXiv:1905.13106  [pdf, other

    cs.GT

    Be a Leader or Become a Follower: The Strategy to Commit to with Multiple Leaders (Extended Version)

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

    Abstract: We study the problem of computing correlated strategies to commit to in games with multiple leaders and followers. To the best of our knowledge, this problem is widely unexplored so far, as the majority of the works in the literature focus on games with a single leader and one or more followers. The fundamental ingredient of our model is that a leader can decide whether to participate in the commi… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  33. arXiv:1811.03871  [pdf, other

    cs.GT

    Quasi-Perfect Stackelberg Equilibrium

    Authors: Alberto Marchesi, Gabriele Farina, Christian Kroer, Nicola Gatti, Tuomas Sandholm

    Abstract: Equilibrium refinements are important in extensive-form (i.e., tree-form) games, where they amend weaknesses of the Nash equilibrium concept by requiring sequential rationality and other beneficial properties. One of the most attractive refinement concepts is quasi-perfect equilibrium. While quasi-perfection has been studied in extensive-form games, it is poorly understood in Stackelberg settings-… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

  34. arXiv:1808.10209  [pdf, other

    cs.GT

    Leadership in Singleton Congestion Games: What is Hard and What is Easy

    Authors: Matteo Castiglioni, Alberto Marchesi, Nicola Gatti, Stefano Coniglio

    Abstract: We study the problem of computing Stackelberg equilibria Stackelberg games whose underlying structure is in congestion games, focusing on the case where each player can choose a single resource (a.k.a. singleton congestion games) and one of them acts as leader. In particular, we address the cases where the players either have the same action spaces (i.e., the set of resources they can choose is th… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

  35. arXiv:1808.01438  [pdf, other

    cs.GT

    Computing a Pessimistic Leader-Follower Equilibrium with Multiple Followers: the Mixed-Pure Case

    Authors: Stefano Coniglio, Nicola Gatti, Alberto Marchesi

    Abstract: The search problem of computing a \textit{leader-follower equilibrium} has been widely investigated in the scientific literature in, almost exclusively, the single-follower setting. Although the \textit{optimistic} and \ textit{pessimistic} versions of the problem are solved with different methodologies, both cases allow for efficient, polynomial-time algorithms based on linear programming. The si… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

  36. arXiv:1807.11914  [pdf, other

    cs.AI cs.GT

    Computing the Strategy to Commit to in Polymatrix Games (Extended Version)

    Authors: Giuseppe De Nittis, Alberto Marchesi, Nicola Gatti

    Abstract: Leadership games provide a powerful paradigm to model many real-world settings. Most literature focuses on games with a single follower who acts optimistically, breaking ties in favour of the leader. Unfortunately, for real-world applications, this is unlikely. In this paper, we look for efficiently solvable games with multiple followers who play either optimistically or pessimistically, i.e., bre… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.