Skip to main content

Showing 1–12 of 12 results for author: Verstraeten, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.12822  [pdf, other

    cs.LG cs.AI

    Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top exploration

    Authors: Alexandra Cimpean, Timothy Verstraeten, Lander Willem, Niel Hens, Ann Nowé, Pieter Libin

    Abstract: Individual-based epidemiological models support the study of fine-grained preventive measures, such as tailored vaccine allocation policies, in silico. As individual-based models are computationally intensive, it is pivotal to identify optimal strategies within a reasonable computational budget. Moreover, due to the high societal impact associated with the implementation of preventive strategies,… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  2. arXiv:2207.00368  [pdf, other

    cs.AI cs.LG

    Multi-Objective Coordination Graphs for the Expected Scalarised Returns with Generative Flow Models

    Authors: Conor F. Hayes, Timothy Verstraeten, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: Many real-world problems contain multiple objectives and agents, where a trade-off exists between objectives. Key to solving such problems is to exploit sparse dependency structures that exist between agents. For example, in wind farm control a trade-off exists between maximising power and minimising stress on the systems components. Dependencies between turbines arise due to the wake effect. We m… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  3. Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making

    Authors: Conor F. Hayes, Timothy Verstraeten, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: In many real-world scenarios, the utility of a user is derived from the single execution of a policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the returns must be optimised. Various scenarios exist where a user's preferences over objectives (also known as the utility function) are unknown or difficult to specify. In such scenarios, a set of optimal pol… ▽ More

    Submitted 1 July, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  4. A Practical Guide to Multi-Objective Reinforcement Learning and Planning

    Authors: Conor F. Hayes, Roxana Rădulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa M. Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowé, Gabriel Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers

    Abstract: Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via a simple linear combination. Such approaches may oversimplify the underlying pr… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Journal ref: Auton Agent Multi-Agent Syst 36, 26 (2022)

  5. arXiv:2101.07844  [pdf, other

    cs.LG cs.AI eess.SY

    Scalable Optimization for Wind Farm Control using Coordination Graphs

    Authors: Timothy Verstraeten, Pieter-Jan Daems, Eugenio Bargiacchi, Diederik M. Roijers, Pieter J. K. Libin, Jan Helsen

    Abstract: Wind farms are a crucial driver toward the generation of ecological and renewable energy. Due to their rapid increase in capacity, contemporary wind farms need to adhere to strict constraints on power output to ensure stability of the electricity grid. Specifically, a wind farm controller is required to match the farm's power production with a power demand imposed by the grid operator. This is a n… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  6. arXiv:2011.07290  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

    Authors: Roxana Rădulescu, Timothy Verstraeten, Yijie Zhang, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Abstract: Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore, it is essential for an agent to learn about the behaviour of other agents in the system. In this work, we present the first study of the effects of such opponent… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Under review since 14 November 2020

  7. arXiv:2003.13676  [pdf, other

    cs.LG cs.AI cs.MA

    Deep reinforcement learning for large-scale epidemic control

    Authors: Pieter Libin, Arno Moonens, Timothy Verstraeten, Fabian Perez-San**es, Niel Hens, Philippe Lemey, Ann Nowé

    Abstract: Epidemics of infectious diseases are an important threat to public health and global economies. Yet, the development of prevention strategies remains a challenging process, as epidemics are non-linear and complex processes. For this reason, we investigate a deep reinforcement learning approach to automatically learn prevention strategies in the context of pandemic influenza. Firstly, we construct… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  8. arXiv:2001.07527  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Swee**

    Authors: Eugenio Bargiacchi, Timothy Verstraeten, Diederik M. Roijers, Ann Nowé

    Abstract: We present a new model-based reinforcement learning algorithm, Cooperative Prioritized Swee**, for efficient learning in multi-agent Markov decision processes. The algorithm allows for sample-efficient learning on large problems by exploiting a factorization to approximate the value function. Our approach only requires knowledge about the structure of the problem in the form of a dynamic decisio… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

  9. arXiv:1911.10121  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Fleet Control using Coregionalized Gaussian Process Policy Iteration

    Authors: Timothy Verstraeten, Pieter JK Libin, Ann Nowé

    Abstract: In many settings, as for example wind farms, multiple machines are instantiated to perform the same task, which is called a fleet. The recent advances with respect to the Internet of Things allow control devices and/or machines to connect through cloud-based architectures in order to share information about their status and environment. Such an infrastructure allows seamless data sharing between f… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  10. arXiv:1911.10120  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures

    Authors: Timothy Verstraeten, Eugenio Bargiacchi, Pieter JK Libin, Jan Helsen, Diederik M Roijers, Ann Nowé

    Abstract: Multi-agent coordination is prevalent in many real-world applications. However, such coordination is challenging due to its combinatorial nature. An important observation in this regard is that agents in the real world often only directly affect a limited set of neighbouring agents. Leveraging such loose couplings among agents is key to making coordination in multi-agent systems feasible. In this… ▽ More

    Submitted 7 February, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Journal ref: Sci Rep 10, 6728 (2020)

  11. IPC-Net: 3D point-cloud segmentation using deep inter-point convolutional layers

    Authors: Felipe Gomez Marulanda, Pieter Libin, Timothy Verstraeten, Ann Nowé

    Abstract: Over the last decade, the demand for better segmentation and classification algorithms in 3D spaces has significantly grown due to the popularity of new 3D sensor technologies and advancements in the field of robotics. Point-clouds are one of the most popular representations to store a digital description of 3D shapes. However, point-clouds are stored in irregular and unordered structures, which l… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

    Journal ref: 2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI),

  12. arXiv:1711.06299  [pdf, ps, other

    cs.LG cs.AI q-bio.PE

    Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies

    Authors: Pieter Libin, Timothy Verstraeten, Diederik M. Roijers, Jelena Grujic, Kristof Theys, Philippe Lemey, Ann Nowé

    Abstract: Pandemic influenza has the epidemic potential to kill millions of people. While various preventive measures exist (i.a., vaccination and school closures), deciding on strategies that lead to their most effective and efficient use remains challenging. To this end, individual-based epidemiological models are essential to assist decision makers in determining the best strategy to curb epidemic spread… ▽ More

    Submitted 15 June, 2018; v1 submitted 16 November, 2017; originally announced November 2017.