Skip to main content

Showing 1–11 of 11 results for author: Grammel, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.07520  [pdf, ps, other

    cs.DS cs.DM math.CO

    On (Random-order) Online Contention Resolution Schemes for the Matching Polytope of (Bipartite) Graphs

    Authors: Calum MacRury, Will Ma, Nathaniel Grammel

    Abstract: Online Contention Resolution Schemes (OCRS's) represent a modern tool for selecting a subset of elements, subject to resource constraints, when the elements are presented to the algorithm sequentially. OCRS's have led to some of the best-known competitive ratio guarantees for online resource allocation problems, with the added benefit of treating different online decisions -- accept/reject, probin… ▽ More

    Submitted 1 April, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: This version improves our previous bipartite RCRS lower bound from 0.4761 to 0.4789

    ACM Class: F.2.2; G.2.2

    Journal ref: SODA 2023

  2. The Stochastic Boolean Function Evaluation Problem for Symmetric Boolean Functions

    Authors: Dimitrios Gkenosis, Nathaniel Grammel, Lisa Hellerstein, Devorah Kletenik

    Abstract: We give two approximation algorithms solving the Stochastic Boolean Function Evaluation (SBFE) problem for symmetric Boolean functions. The first is an $O(\log n)$-approximation algorithm, based on the submodular goal-value approach of Deshpande, Hellerstein and Kletenik. Our second algorithm, which is simple, is based on the algorithm solving the SBFE problem for $k$-of-$n$ functions, due to Sall… ▽ More

    Submitted 4 January, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Preliminary versions of these results appeared on Arxiv in arXiv:1806.10660. That paper contains results for both arbitrary costs and unit costs. This paper considers only arbitrary costs. Updated January 2022 include journal information

    Journal ref: Discrete Applied Mathematics 309 (2022), 269-277

  3. arXiv:2106.06892  [pdf, ps, other

    cs.DS cs.DM

    Improved Guarantees for Offline Stochastic Matching via new Ordered Contention Resolution Schemes

    Authors: Brian Brubach, Nathaniel Grammel, Will Ma, Aravind Srinivasan

    Abstract: Matching is one of the most fundamental and broadly applicable problems across many domains. In these diverse real-world applications, there is often a degree of uncertainty in the input which has led to the study of stochastic matching models. Here, each edge in the graph has a known, independent probability of existing derived from some prediction. Algorithms must probe edges to determine existe… ▽ More

    Submitted 8 August, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: full version of Neurips 2021 paper

  4. arXiv:2009.14471  [pdf, other

    cs.LG cs.MA stat.ML

    PettingZoo: Gym for Multi-Agent Reinforcement Learning

    Authors: J. K. Terry, Benjamin Black, Nathaniel Grammel, Mario Jayakumar, Ananth Hari, Ryan Sullivan, Luis Santos, Rodrigo Perez, Caroline Horsch, Clemens Dieffendahl, Niall L. Williams, Yashas Lokesh, Praveen Ravi

    Abstract: This paper introduces the PettingZoo library and the accompanying Agent Environment Cycle ("AEC") games model. PettingZoo is a library of diverse sets of multi-agent environments with a universal, elegant Python API. PettingZoo was developed with the goal of accelerating research in Multi-Agent Reinforcement Learning ("MARL"), by making work more interchangeable, accessible and reproducible akin… ▽ More

    Submitted 26 October, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

  5. arXiv:2009.13051   

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Agent Environment Cycle Games

    Authors: J K Terry, Nathaniel Grammel, Benjamin Black, Ananth Hari, Caroline Horsch, Luis Santos

    Abstract: Partially Observable Stochastic Games (POSGs) are the most general and common model of games used in Multi-Agent Reinforcement Learning (MARL). We argue that the POSG model is conceptually ill suited to software MARL environments, and offer case studies from the literature where this mismatch has led to severely unexpected behavior. In response to this, we introduce the Agent Environment Cycle G… ▽ More

    Submitted 1 May, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: This work of this paper has been merged into the paper "PettingZoo: Gym for Multi-Agent Reinforcement Learning" arXiv:2009.14471

  6. arXiv:2008.03325  [pdf, ps, other

    cs.DS

    Stochastic Optimization and Learning for Two-Stage Supplier Problems

    Authors: Brian Brubach, Nathaniel Grammel, David G. Harris, Aravind Srinivasan, Leonidas Tsepenekas, Anil Vullikanti

    Abstract: The main focus of this paper is radius-based (supplier) clustering in the two-stage stochastic setting with recourse, where the inherent stochasticity of the model comes in the form of a budget constraint. In addition to the standard (homogeneous) setting where all clients must be within a distance $R$ of the nearest facility, we provide results for the more general problem where the radius demand… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 August, 2020; originally announced August 2020.

  7. arXiv:2006.06870   

    cs.MA cs.AI cs.LG

    Multi-Agent Informational Learning Processes

    Authors: J. K. Terry, Nathaniel Grammel

    Abstract: We introduce a new mathematical model of multi-agent reinforcement learning, the Multi-Agent Informational Learning Processor "MAILP" model. The model is based on the notion that agents have policies for a certain amount of information, models how this information iteratively evolves and propagates through many agents. This model is very general, and the only meaningful assumption made is that l… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: We are withdrawing this paper as section 2.1.1 implicitly assumes information gain at all points is homogenous. A researcher has provided us an example showing that this assumption causes our model to make unexpected and pathological predictions, and we are aware of now way to remove this assumption from our work

  8. arXiv:2005.13625  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning

    Authors: J. K. Terry, Nathaniel Grammel, Sanghyun Son, Benjamin Black, Aakriti Agrawal

    Abstract: Parameter sharing, where each agent independently learns a policy with fully shared parameters between all policies, is a popular baseline method for multi-agent deep reinforcement learning. Unfortunately, since all agents share the same policy network, they cannot learn different policies or tasks. This issue has been circumvented experimentally by adding an agent-specific indicator signal to obs… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2020; originally announced May 2020.

  9. arXiv:1907.03963  [pdf, ps, other

    cs.DS cs.DM cs.MA math.CO math.PR

    Online Matching Frameworks under Stochastic Rewards, Product Ranking, and Unknown Patience

    Authors: Brian Brubach, Nathaniel Grammel, Will Ma, Aravind Srinivasan

    Abstract: We study generalizations of online bipartite matching in which each arriving vertex (customer) views a ranked list of offline vertices (products) and matches to (purchases) the first one they deem acceptable. The number of products that the customer has patience to view can be stochastic and dependent on the products seen. We develop a framework that views the interaction with each customer as an… ▽ More

    Submitted 26 June, 2023; v1 submitted 8 July, 2019; originally announced July 2019.

  10. arXiv:1806.10660  [pdf, other

    cs.DS

    The Stochastic Score Classification Problem

    Authors: Dimitrios Gkenosis, Nathaniel Grammel, Lisa Hellerstein, Devorah Kletenik

    Abstract: Consider the following Stochastic Score Classification Problem. A doctor is assessing a patient's risk of develo** a certain disease, and can perform $n$ tests on the patient. Each test has a binary outcome, positive or negative. A positive test result is an indication of risk, and a patient's score is the total number of positive test results. The doctor needs to classify the patient into one o… ▽ More

    Submitted 27 June, 2018; originally announced June 2018.

  11. arXiv:1603.03158  [pdf, other

    cs.DS cs.LG

    Scenario Submodular Cover

    Authors: Nathaniel Grammel, Lisa Hellerstein, Devorah Kletenik, Patrick Lin

    Abstract: Many problems in Machine Learning can be modeled as submodular optimization problems. Recent work has focused on stochastic or adaptive versions of these problems. We consider the Scenario Submodular Cover problem, which is a counterpart to the Stochastic Submodular Cover problem studied by Golovin and Krause. In Scenario Submodular Cover, the goal is to produce a cover with minimum expected cost,… ▽ More

    Submitted 10 March, 2016; originally announced March 2016.

    Comments: 32 pages, 1 figure