Skip to main content

Showing 1–4 of 4 results for author: Marcotte, P

.
  1. arXiv:2311.17190  [pdf, other

    cs.LG cs.AI cs.MA

    Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

    Authors: Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai

    Abstract: Recent advances in Competitive Self-Play (CSP) have achieved, or even surpassed, human level performance in complex game environments such as Dota 2 and StarCraft II using Distributed Multi-Agent Reinforcement Learning (MARL). One core component of these methods relies on creating a pool of learning agents -- consisting of the Main Agent, past versions of this agent, and Exploiter Agents -- where… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  2. arXiv:2112.11731  [pdf, other

    cs.LG cs.AI

    Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

    Authors: Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

    Abstract: We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions. In this setting, classical symbolic planners are not applicable or difficult to adapt. We introduce a hybrid technique combining a low level policy trained with reinforcement learning and a graph based high level classical planner. In addition to prov… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  3. arXiv:1601.05678  [pdf, other

    math.OC

    Achieving an optimal trade-off between revenue and energy peak within a smart grid environment

    Authors: Sezin Afsar, Luce Brotcorne, Patrice Marcotte, Gilles Savard

    Abstract: We consider an energy provider whose goal is to simultaneously set revenue-maximizing prices and meet a peak load constraint. In our bilevel setting, the provider acts as a leader (upper level) that takes into account a smart grid (lower level) that minimizes the sum of users' disutilities. The latter bases its decisions on the hourly prices set by the leader, as well as the schedule preferences s… ▽ More

    Submitted 21 January, 2016; originally announced January 2016.

  4. arXiv:cs/0409054  [pdf, ps, other

    cs.GT math.OC

    An Approximation Algorithm for Stackelberg Network Pricing

    Authors: S. Roch, P. Marcotte, G. Savard

    Abstract: We consider the problem of maximizing the revenue raised from tolls set on the arcs of a transportation network, under the constraint that users are assigned to toll-compatible shortest paths. We first prove that this problem is strongly NP-hard. We then provide a polynomial time algorithm with a worst-case precision guarantee of ${1/2}\log_2 m_T+1$, where $m_T$ denotes the number of toll arcs.… ▽ More

    Submitted 26 September, 2004; originally announced September 2004.

    Comments: 38 pages