Skip to main content

Showing 1–5 of 5 results for author: Bargiacchi, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.05036  [pdf, other

    cs.LG cs.AI

    Pareto Conditioned Networks

    Authors: Mathieu Reymond, Eugenio Bargiacchi, Ann Nowé

    Abstract: In multi-objective optimization, learning all the policies that reach Pareto-efficient solutions is an expensive process. The set of optimal policies can grow exponentially with the number of objectives, and recovering all solutions requires an exhaustive exploration of the entire state space. We propose Pareto Conditioned Networks (PCN), a method that uses a single neural network to encompass all… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at the International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2022

  2. A Practical Guide to Multi-Objective Reinforcement Learning and Planning

    Authors: Conor F. Hayes, Roxana Rădulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa M. Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowé, Gabriel Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers

    Abstract: Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via a simple linear combination. Such approaches may oversimplify the underlying pr… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Journal ref: Auton Agent Multi-Agent Syst 36, 26 (2022)

  3. arXiv:2101.07844  [pdf, other

    cs.LG cs.AI eess.SY

    Scalable Optimization for Wind Farm Control using Coordination Graphs

    Authors: Timothy Verstraeten, Pieter-Jan Daems, Eugenio Bargiacchi, Diederik M. Roijers, Pieter J. K. Libin, Jan Helsen

    Abstract: Wind farms are a crucial driver toward the generation of ecological and renewable energy. Due to their rapid increase in capacity, contemporary wind farms need to adhere to strict constraints on power output to ensure stability of the electricity grid. Specifically, a wind farm controller is required to match the farm's power production with a power demand imposed by the grid operator. This is a n… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  4. arXiv:2001.07527  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Swee**

    Authors: Eugenio Bargiacchi, Timothy Verstraeten, Diederik M. Roijers, Ann Nowé

    Abstract: We present a new model-based reinforcement learning algorithm, Cooperative Prioritized Swee**, for efficient learning in multi-agent Markov decision processes. The algorithm allows for sample-efficient learning on large problems by exploiting a factorization to approximate the value function. Our approach only requires knowledge about the structure of the problem in the form of a dynamic decisio… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

  5. arXiv:1911.10120  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures

    Authors: Timothy Verstraeten, Eugenio Bargiacchi, Pieter JK Libin, Jan Helsen, Diederik M Roijers, Ann Nowé

    Abstract: Multi-agent coordination is prevalent in many real-world applications. However, such coordination is challenging due to its combinatorial nature. An important observation in this regard is that agents in the real world often only directly affect a limited set of neighbouring agents. Leveraging such loose couplings among agents is key to making coordination in multi-agent systems feasible. In this… ▽ More

    Submitted 7 February, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Journal ref: Sci Rep 10, 6728 (2020)