Skip to main content

Showing 1–14 of 14 results for author: Rădulescu, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07182  [pdf, other

    cs.LG

    Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning

    Authors: Willem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M. Roijers, Ann Nowé, Roxana Rădulescu

    Abstract: A significant challenge in multi-objective reinforcement learning is obtaining a Pareto front of policies that attain optimal performance under different preferences. We introduce Iterated Pareto Referent Optimisation (IPRO), a principled algorithm that decomposes the task of finding the Pareto front into a sequence of single-objective problems for which various solution methods exist. This enable… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  2. arXiv:2402.02665  [pdf, ps, other

    cs.LG

    Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning

    Authors: Peter Vamplew, Cameron Foale, Conor F. Hayes, Patrick Mannion, Enda Howley, Richard Dazeley, Scott Johnson, Johan Källström, Gabriel Ramos, Roxana Rădulescu, Willem Röpke, Diederik M. Roijers

    Abstract: Research in multi-objective reinforcement learning (MORL) has introduced the utility-based paradigm, which makes use of both environmental rewards and a function that defines the utility derived by the user from those rewards. In this paper we extend this paradigm to the context of single-objective reinforcement learning (RL), and outline multiple potential benefits including the ability to perfor… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted for the Blue Sky Track at AAMAS'24

  3. arXiv:2401.12646  [pdf, other

    cs.MA cs.AI cs.GT

    Emergent Cooperation under Uncertain Incentive Alignment

    Authors: Nicole Orzan, Erman Acar, Davide Grossi, Roxana Rădulescu

    Abstract: Understanding the emergence of cooperation in systems of computational agents is crucial for the development of effective cooperative AI. Interaction among individuals in real-world settings are often sparse and occur within a broad spectrum of incentives, which often are only partially known. In this work, we explore how cooperation can arise among reinforcement learning agents in scenarios chara… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  4. arXiv:2306.05898  [pdf, other

    cs.CR cs.CY cs.SI

    You Can Tell a Cybercriminal by the Company they Keep: A Framework to Infer the Relevance of Underground Communities to the Threat Landscape

    Authors: Michele Campobasso, Radu Rădulescu, Sylvan Brons, Luca Allodi

    Abstract: The criminal underground is populated with forum marketplaces where, allegedly, cybercriminals share and trade knowledge, skills, and cybercrime products. However, it is still unclear whether all marketplaces matter the same in the overall threat landscape. To effectively support trade and avoid degenerating into scams-for-scammers places, underground markets must address fundamental economic prob… ▽ More

    Submitted 13 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: The 22nd Workshop on the Economics of Information Security (WEIS'23), July 05--08, 2023, Geneva, Switzerland

  5. arXiv:2301.05755  [pdf, other

    cs.GT

    Bridging the Gap Between Single and Multi Objective Games

    Authors: Willem Röpke, Carla Groenland, Roxana Rădulescu, Ann Nowé, Diederik M. Roijers

    Abstract: A classic model to study strategic decision making in multi-agent systems is the normal-form game. This model can be generalised to allow for an infinite number of pure strategies leading to continuous games. Multi-objective normal-form games are another generalisation that model settings where players receive separate payoffs in more than one objective. We bridge the gap between the two models by… ▽ More

    Submitted 1 March, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to AAMAS 2023

  6. arXiv:2204.05027  [pdf, ps, other

    cs.LG cs.AI q-bio.PE

    Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning

    Authors: Mathieu Reymond, Conor F. Hayes, Lander Willem, Roxana Rădulescu, Steven Abrams, Diederik M. Roijers, Enda Howley, Patrick Mannion, Niel Hens, Ann Nowé, Pieter Libin

    Abstract: Infectious disease outbreaks can have a disruptive impact on public health and societal processes. As decision making in the context of epidemic mitigation is hard, reinforcement learning provides a methodology to automatically learn prevention strategies in combination with complex epidemic models. Current research focuses on optimizing policies w.r.t. a single objective, such as the pathogen's a… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  7. arXiv:2112.15422  [pdf, other

    cs.AI

    Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)

    Authors: Peter Vamplew, Benjamin J. Smith, Johan Kallstrom, Gabriel Ramos, Roxana Radulescu, Diederik M. Roijers, Conor F. Hayes, Fredrik Heintz, Patrick Mannion, Pieter J. K. Libin, Richard Dazeley, Cameron Foale

    Abstract: The recent paper `"Reward is Enough" by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation is sufficient to underpin all intelligence, both natural and artificial. We contest the underlying assumption of Silver et al. that such reward can be scalar-valued. In this paper we explain why scalar rewards are insufficient to account for some aspects of both biological and co… ▽ More

    Submitted 24 November, 2021; originally announced December 2021.

  8. arXiv:2112.06500  [pdf, other

    cs.GT cs.MA

    On Nash Equilibria in Normal-Form Games With Vectorial Payoffs

    Authors: Willem Röpke, Diederik M. Roijers, Ann Nowé, Roxana Rădulescu

    Abstract: We provide an in-depth study of Nash equilibria in multi-objective normal form games (MONFGs), i.e., normal form games with vectorial payoffs. Taking a utility-based approach, we assume that each player's utility can be modelled with a utility function that maps a vector to a scalar utility. In the case of a mixed strategy, it is meaningful to apply such a scalarisation both before calculating the… ▽ More

    Submitted 16 July, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

  9. arXiv:2111.09191  [pdf, other

    cs.GT cs.LG cs.MA

    Preference Communication in Multi-Objective Normal-Form Games

    Authors: Willem Röpke, Diederik M. Roijers, Ann Nowé, Roxana Rădulescu

    Abstract: We consider preference communication in two-player multi-objective normal-form games. In such games, the payoffs resulting from joint actions are vector-valued. Taking a utility-based approach, we assume there exists a utility function for each player which maps vectors to scalar utilities and consider agents that aim to maximise the utility of expected payoff vectors. As agents typically do not k… ▽ More

    Submitted 10 June, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

  10. A Practical Guide to Multi-Objective Reinforcement Learning and Planning

    Authors: Conor F. Hayes, Roxana Rădulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa M. Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowé, Gabriel Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers

    Abstract: Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via a simple linear combination. Such approaches may oversimplify the underlying pr… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Journal ref: Auton Agent Multi-Agent Syst 36, 26 (2022)

  11. arXiv:2011.07290  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

    Authors: Roxana Rădulescu, Timothy Verstraeten, Yijie Zhang, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Abstract: Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore, it is essential for an agent to learn about the behaviour of other agents in the system. In this work, we present the first study of the effects of such opponent… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Under review since 14 November 2020

  12. arXiv:2001.08177  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    A utility-based analysis of equilibria in multi-objective normal form games

    Authors: Roxana Rădulescu, Patrick Mannion, Yijie Zhang, Diederik M. Roijers, Ann Nowé

    Abstract: In multi-objective multi-agent systems (MOMAS), agents explicitly consider the possible tradeoffs between conflicting objective functions. We argue that compromises between competing objectives in MOMAS should be analysed on the basis of the utility that these compromises have for the users of a system, where an agent's utility function maps their payoff vectors to scalar utility values. This util… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Under review since 16 January 2020

  13. Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and Survey

    Authors: Roxana Rădulescu, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Abstract: The majority of multi-agent system (MAS) implementations aim to optimise agents' policies with respect to a single objective, despite the fact that many real-world problem domains are inherently multi-objective in nature. Multi-objective multi-agent systems (MOMAS) explicitly consider the possible trade-offs between conflicting objective functions. We argue that, in MOMAS, such compromises should… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: Under review since 15 May 2019

  14. arXiv:1702.08736  [pdf, other

    cs.MA cs.AI

    Analysing Congestion Problems in Multi-agent Reinforcement Learning

    Authors: Roxana Rădulescu, Peter Vrancx, Ann Nowé

    Abstract: Congestion problems are omnipresent in today's complex networks and represent a challenge in many research domains. In the context of Multi-agent Reinforcement Learning (MARL), approaches like difference rewards and resource abstraction have shown promising results in tackling such problems. Resource abstraction was shown to be an ideal candidate for solving large-scale resource allocation problem… ▽ More

    Submitted 30 March, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: Adaptive Learning Agents (ALA) Workshop at AAMAS 2017

    MSC Class: 68T05 ACM Class: I.2.11