Skip to main content

Showing 1–12 of 12 results for author: Oesterheld, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15970  [pdf, ps, other

    cs.GT cs.AI cs.CC

    Imperfect-Recall Games: Equilibrium Concepts and Their Complexity

    Authors: Emanuel Tewolde, Brian Hu Zhang, Caspar Oesterheld, Manolis Zampetakis, Tuomas Sandholm, Paul W. Goldberg, Vincent Conitzer

    Abstract: We investigate optimal decision making under imperfect recall, that is, when an agent forgets information it once held before. An example is the absentminded driver game, as well as team games in which the members have limited communication capabilities. In the framework of extensive-form games with imperfect recall, we analyze the computational complexities of finding equilibria in multiplayer se… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Long version of the paper that got accepted to the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024). 35 pages, 10 figures, 1 table

    MSC Class: 91A05; 91A06; 91A10; 91A11; 91A18; 91A35; 91A68; 68T37; 68Q17; 68Q25 ACM Class: I.2; J.4; F.2

  2. arXiv:2402.08128  [pdf, other

    cs.AI cs.GT

    Recursive Joint Simulation in Games

    Authors: Vojtech Kovarik, Caspar Oesterheld, Vincent Conitzer

    Abstract: Game-theoretic dynamics between AI agents could differ from traditional human-human interactions in various ways. One such difference is that it may be possible to accurately simulate an AI agent, for example because its source code is known. Our aim is to explore ways of leveraging this possibility to achieve more cooperative outcomes in strategic settings. In this paper, we study an interaction… ▽ More

    Submitted 1 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2402.06626  [pdf, other

    cs.GT

    Computing Optimal Commitments to Strategies and Outcome-Conditional Utility Transfers

    Authors: Nathaniel Sauerberg, Caspar Oesterheld

    Abstract: Prior work has studied the computational complexity of computing optimal strategies to commit to in Stackelberg or leadership games, where a leader commits to a strategy which is observed by one or more followers. We extend this setting to one where the leader can additionally commit to outcome-conditional utility transfers. We characterize the computational complexity of finding optimal strategie… ▽ More

    Submitted 10 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: AAMAS 2024

  4. arXiv:2307.05068  [pdf, ps, other

    cs.AI cs.GT cs.LG

    A Theory of Bounded Inductive Rationality

    Authors: Caspar Oesterheld, Abram Demski, Vincent Conitzer

    Abstract: The dominant theories of rational choice assume logical omniscience. That is, they assume that when facing a decision problem, an agent can perform all relevant computations and determine the truth value of all relevant logical/mathematical claims. This assumption is unrealistic when, for example, we offer bets on remote digits of pi or when an agent faces a computationally intractable planning pr… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: In Proceedings TARK 2023, arXiv:2307.04005

    ACM Class: I.2

    Journal ref: EPTCS 379, 2023, pp. 421-440

  5. arXiv:2305.17805  [pdf, other

    cs.GT cs.AI cs.CC

    The Computational Complexity of Single-Player Imperfect-Recall Games

    Authors: Emanuel Tewolde, Caspar Oesterheld, Vincent Conitzer, Paul W. Goldberg

    Abstract: We study single-player extensive-form games with imperfect recall, such as the Slee** Beauty problem or the Absentminded Driver game. For such games, two natural equilibrium concepts have been proposed as alternative solution concepts to ex-ante optimality. One equilibrium concept uses generalized double halving (GDH) as a belief system and evidential decision theory (EDT), and another one uses… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Long version of the paper that got accepted to the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23). 10 pages and 2 figures in the main body. 17 pages and 4 figures in the appendix

    MSC Class: 91A18; 68T37; 68Q17; 91A35 ACM Class: I.2; J.4; F.2

  6. arXiv:2305.17601  [pdf, other

    cs.AI

    Incentivizing honest performative predictions with proper scoring rules

    Authors: Caspar Oesterheld, Johannes Treutlein, Emery Cooper, Rubi Hudson

    Abstract: Proper scoring rules incentivize experts to accurately report beliefs, assuming predictions cannot influence outcomes. We relax this assumption and investigate incentives when predictions are performative, i.e., when they can influence the outcome of the prediction, such as when making public predictions about the stock market. We say a prediction is a fixed point if it accurately reflects the exp… ▽ More

    Submitted 30 May, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

  7. arXiv:2305.11261  [pdf, other

    cs.GT

    Game Theory with Simulation of Other Players

    Authors: Vojtech Kovarik, Caspar Oesterheld, Vincent Conitzer

    Abstract: Game-theoretic interactions with AI agents could differ from traditional human-human interactions in various ways. One such difference is that it may be possible to simulate an AI agent (for example because its source code is known), which allows others to accurately predict the agent's actions. This could lower the bar for trust and cooperation. In this paper, we formalize games in which one play… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: The latest version fixes some typos in the proof of Theorem 5

  8. arXiv:2211.14468  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    Similarity-based cooperative equilibrium

    Authors: Caspar Oesterheld, Johannes Treutlein, Roger Grosse, Vincent Conitzer, Jakob Foerster

    Abstract: As machine learning agents act more autonomously in the world, they will increasingly interact with each other. Unfortunately, in many social dilemmas like the one-shot Prisoner's Dilemma, standard game theory predicts that ML agents will fail to cooperate with each other. Prior work has shown that one way to enable cooperative outcomes in the one-shot Prisoner's Dilemma is to make the agents mutu… ▽ More

    Submitted 12 November, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Published at NeurIPS 2023. 32 pages, 9 figures

    MSC Class: 91A10 (Primary) 91A05 91A26 91A35 (Secondary) ACM Class: I.2.11

  9. arXiv:2211.05057  [pdf, ps, other

    cs.GT

    A Note on the Compatibility of Different Robust Program Equilibria of the Prisoner's Dilemma

    Authors: Caspar Oesterheld

    Abstract: We study a program game version of the Prisoner's Dilemma, i.e., a two-player game in which each player submits a computer program, the programs are given read access to each other's source code and then choose whether to cooperate or defect. Prior work has introduced various programs that form cooperative equilibria against themselves in this game. For example, the $ε$-grounded Fair Bot cooperate… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 8 pages, 1 table

    MSC Class: 91A44

  10. arXiv:2207.03470  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria

    Authors: Scott Emmons, Caspar Oesterheld, Andrew Critch, Vincent Conitzer, Stuart Russell

    Abstract: Although it has been known since the 1970s that a globally optimal strategy profile in a common-payoff game is a Nash equilibrium, global optimality is a strict requirement that limits the result's applicability. In this work, we show that any locally optimal symmetric strategy profile is also a (global) Nash equilibrium. Furthermore, we show that this result is robust to perturbations to the comm… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  11. arXiv:2106.06613  [pdf, other

    cs.AI cs.LG

    A New Formalism, Method and Open Issues for Zero-Shot Coordination

    Authors: Johannes Treutlein, Michael Dennis, Caspar Oesterheld, Jakob Foerster

    Abstract: In many coordination problems, independently reasoning humans are able to discover mutually compatible policies. In contrast, independently trained self-play policies are often mutually incompatible. Zero-shot coordination (ZSC) has recently been proposed as a new frontier in multi-agent reinforcement learning to address this fundamental issue. Prior work approaches the ZSC problem by assuming pla… ▽ More

    Submitted 12 July, 2023; v1 submitted 11 June, 2021; originally announced June 2021.

  12. Formalizing Preference Utilitarianism in Physical World Models

    Authors: Caspar Oesterheld

    Abstract: Most ethical work is done at a low level of formality. This makes practical moral questions inaccessible to formal and natural sciences and can lead to misunderstandings in ethical discussion. In this paper, we use Bayesian inference to introduce a formalization of preference utilitarianism in physical world models, specifically cellular automata. Even though our formalization is not immediately a… ▽ More

    Submitted 30 November, 2015; v1 submitted 21 April, 2015; originally announced April 2015.

    Comments: 14 pages, 3 figures