Skip to main content

Showing 1–8 of 8 results for author: Gerstgrasser, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01413  [pdf, other

    cs.LG cs.AI cs.CL cs.ET stat.ML

    Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

    Authors: Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

    Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs? Recent investigations into model-data feedback loops proposed that such loops would lead to a phenomenon termed model collapse, under which performance progressively degrades with each model-data feedback iteration… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2311.09144  [pdf, other

    cs.CL cs.HC

    Grounding Gaps in Language Model Generations

    Authors: Omar Shaikh, Kristina Gligorić, Ashna Khetan, Matthias Gerstgrasser, Diyi Yang, Dan Jurafsky

    Abstract: Effective conversation requires common ground: a shared understanding between the participants. Common ground, however, does not emerge spontaneously in conversation. Speakers and listeners work together to both identify and construct a shared basis while avoiding misunderstanding. To accomplish grounding, humans rely on a range of dialogue acts, like clarification (What do you mean?) and acknowle… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024; 18 pages, 2 figures

  3. arXiv:2311.00865  [pdf, other

    cs.LG cs.AI cs.MA cs.RO

    Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning

    Authors: Matthias Gerstgrasser, Tom Danino, Sarah Keren

    Abstract: We present a novel multi-agent RL approach, Selective Multi-Agent Prioritized Experience Relay, in which agents share with other agents a limited number of transitions they observe during training. The intuition behind this is that even a small number of relevant experiences from other agents could help each agent learn. Unlike many other multi-agent RL algorithms, this approach allows for largely… ▽ More

    Submitted 23 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: published at NeurIPS 2023

    Journal ref: Advances in Neural Information Processing Systems, 36 (2023)

  4. arXiv:2210.11942  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning

    Authors: Matthias Gerstgrasser, David C. Parkes

    Abstract: Stackelberg equilibria arise naturally in a range of popular learning problems, such as in security games or indirect mechanism design, and have received increasing attention in the reinforcement learning literature. We present a general framework for implementing Stackelberg equilibria search as a multi-agent RL problem, allowing a wide range of algorithmic design choices. We discuss how previous… ▽ More

    Submitted 1 June, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  5. arXiv:2210.03852  [pdf, other

    cs.GT cs.MA

    Stackelberg POMDP: A Reinforcement Learning Approach for Economic Design

    Authors: Gianluca Brero, Alon Eden, Darshan Chakrabarti, Matthias Gerstgrasser, Amy Greenwald, Vincent Li, David C. Parkes

    Abstract: We introduce a reinforcement learning framework for economic design where the interaction between the environment designer and the participants is modeled as a Stackelberg game. In this game, the designer (leader) sets up the rules of the economic system, while the participants (followers) respond strategically. We integrate algorithms for determining followers' response strategies into the leader… ▽ More

    Submitted 9 November, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

  6. arXiv:2111.06614  [pdf, other

    cs.LG cs.MA

    Collaboration Promotes Group Resilience in Multi-Agent AI

    Authors: Sarah Keren, Matthias Gerstgrasser, Ofir Abu, Jeffrey Rosenschein

    Abstract: AI agents need to be robust to unexpected changes in their environment in order to safely operate in real-world scenarios. While some work has been done on this type of robustness in the single-agent case, in this work we introduce the idea that collaboration with other agents can help agents adapt to environment perturbations in multi-agent reinforcement learning settings. We first formalize this… ▽ More

    Submitted 9 December, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

    ACM Class: I.2.11, I.2.6

  7. arXiv:2010.01180  [pdf, other

    cs.GT cs.AI cs.LG

    Reinforcement Learning of Sequential Price Mechanisms

    Authors: Gianluca Brero, Alon Eden, Matthias Gerstgrasser, David C. Parkes, Duncan Rheingans-Yoo

    Abstract: We introduce the use of reinforcement learning for indirect mechanisms, working with the existing class of sequential price mechanisms, which generalizes both serial dictatorship and posted price mechanisms and essentially characterizes all strongly obviously strategyproof mechanisms. Learning an optimal mechanism within this class forms a partially-observable Markov decision process. We provide r… ▽ More

    Submitted 5 May, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

  8. arXiv:1811.05130  [pdf, ps, other

    cs.GT

    Multi-unit Bilateral Trade

    Authors: Matthias Gerstgrasser, Paul W. Goldberg, Bart de Keijzer, Philip Lazos, Alexander Skopalik

    Abstract: We characterise the set of dominant strategy incentive compatible (DSIC), strongly budget balanced (SBB), and ex-post individually rational (IR) mechanisms for the multi-unit bilateral trade setting. In such a setting there is a single buyer and a single seller who holds a finite number k of identical items. The mechanism has to decide how many units of the item are transferred from the seller to… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.