Skip to main content

Showing 1–10 of 10 results for author: Rens, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.03226  [pdf, other

    cs.AI cs.LG

    Safe Reinforcement Learning via Probabilistic Logic Shields

    Authors: Wen-Chi Yang, Giuseppe Marra, Gavin Rens, Luc De Raedt

    Abstract: Safe Reinforcement learning (Safe RL) aims at learning optimal policies while staying safe. A popular solution to Safe RL is shielding, which uses a logical safety specification to prevent an RL agent from taking unsafe actions. However, traditional shielding techniques are difficult to integrate with continuous, end-to-end deep RL methods. To this end, we introduce Probabilistic Logic Policy Grad… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  2. arXiv:2211.03461  [pdf, ps, other

    cs.AI

    Learning Probabilistic Temporal Safety Properties from Examples in Relational Domains

    Authors: Gavin Rens, Wen-Chi Yang, Jean-François Raskin, Luc De Raedt

    Abstract: We propose a framework for learning a fragment of probabilistic computation tree logic (pCTL) formulae from a set of states that are labeled as safe or unsafe. We work in a relational setting and combine ideas from relational Markov Decision Processes with pCTL model-checking. More specifically, we assume that there is an unknown relational pCTL target formula that is satisfied by only safe states… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 25 pages, 3 figures, 5 tables, 2 algorithms, preprint

  3. arXiv:2009.12600  [pdf, other

    cs.AI

    Online Learning of Non-Markovian Reward Models

    Authors: Gavin Rens, Jean-François Raskin, Raphaël Reynouad, Giuseppe Marra

    Abstract: There are situations in which an agent should receive rewards only after having accomplished a series of previous tasks, that is, rewards are non-Markovian. One natural and quite general way to represent history-dependent rewards is via a Mealy machine, a finite state automaton that produces output sequences from input sequences. In our formal setting, we consider a Markov decision process (MDP) t… ▽ More

    Submitted 30 September, 2020; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: 24 pages, single column, 7 figures. arXiv admin note: substantial text overlap with arXiv:2001.09293

  4. arXiv:2008.11791  [pdf, other

    cs.AI cs.MA cs.SI

    Reputation-driven Decision-making in Networks of Stochastic Agents

    Authors: David Maoujoud, Gavin Rens

    Abstract: This paper studies multi-agent systems that involve networks of self-interested agents. We propose a Markov Decision Process-derived framework, called RepNet-MDP, tailored to domains in which agent reputation is a key driver of the interactions between agents. The fundamentals are based on the principles of RepNet-POMDP, a framework developed by Rens et al. in 2018, but addresses its mathematical… ▽ More

    Submitted 20 October, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: 19 pages, including bibliography

    MSC Class: 68T37 (Primary) 68T05 (Secondary)

  5. arXiv:2001.09293  [pdf, other

    cs.AI

    Learning Non-Markovian Reward Models in MDPs

    Authors: Gavin Rens, Jean-François Raskin

    Abstract: There are situations in which an agent should receive rewards only after having accomplished a series of previous tasks. In other words, the reward that the agent receives is non-Markovian. One natural and quite general way to represent history-dependent rewards is via a Mealy machine; a finite state automaton that produces output sequences (rewards in our case) from input sequences (state/action… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

    Comments: 18 pages, single column, 4 figures

  6. arXiv:1805.05230  [pdf, ps, other

    cs.AI cs.MA

    Maximizing Expected Impact in an Agent Reputation Network -- Technical Report

    Authors: Gavin Rens, Abhaya Nayak, Thomas Meyer

    Abstract: Many multi-agent systems (MASs) are situated in stochastic environments. Some such systems that are based on the partially observable Markov decision process (POMDP) do not take the benevolence of other agents for granted. We propose a new POMDP-based framework which is general enough for the specification of a variety of stochastic MAS domains involving the impact of agents on each other's reputa… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.

    Comments: 18 pages including bibliography

  7. arXiv:1705.01172  [pdf, other

    cs.AI

    Imagining Probabilistic Belief Change as Imaging (Technical Report)

    Authors: Gavin Rens, Thomas Meyer

    Abstract: Imaging is a form of probabilistic belief change which could be employed for both revision and update. In this paper, we propose a new framework for probabilistic belief change based on imaging, called Expected Distance Imaging (EDI). EDI is sufficiently general to define Bayesian conditioning and other forms of imaging previously defined in the literature. We argue that, and investigate how, EDI… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: 21 pages

  8. arXiv:1607.00656  [pdf, other

    cs.AI

    A Hybrid POMDP-BDI Agent Architecture with Online Stochastic Planning and Plan Caching

    Authors: Gavin Rens, Deshendran Moodley

    Abstract: This article presents an agent architecture for controlling an autonomous agent in stochastic environments. The architecture combines the partially observable Markov decision process (POMDP) model with the belief-desire-intention (BDI) framework. The Hybrid POMDP-BDI agent architecture takes the best features from the two approaches, that is, the online generation of reward-maximizing courses of a… ▽ More

    Submitted 3 July, 2016; originally announced July 2016.

    Comments: 26 pages, 3 figures, unpublished version

  9. arXiv:1604.02133  [pdf, other

    cs.AI

    Revising Incompletely Specified Convex Probabilistic Belief Bases

    Authors: Gavin Rens, Thomas Meyer, Giovanni Casini

    Abstract: We propose a method for an agent to revise its incomplete probabilistic beliefs when a new piece of propositional information is observed. In this work, an agent's beliefs are represented by a set of probabilistic formulae -- a belief base. The method involves determining a representative set of 'boundary' probability distributions consistent with the current belief base, revising each of these pr… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.

    Comments: Presented at the Sixteenth International Workshop on Non-Monotonic Reasoning, 22-24 April 2016, Cape Town, South Africa. 9.25 pages

  10. arXiv:1604.02126  [pdf, other

    cs.AI

    On Stochastic Belief Revision and Update and their Combination

    Authors: Gavin Rens

    Abstract: I propose a framework for an agent to change its probabilistic beliefs when a new piece of propositional information $α$ is observed. Traditionally, belief change occurs by either a revision process or by an update process, depending on whether the agent is informed with $α$ in a static world or, respectively, whether $α$ is a 'signal' from the environment due to an event occurring. Boutilier sugg… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.

    Comments: Presented at the Sixteenth International Workshop on Non-Monotonic Reasoning, 22-24 April 2016, Cape Town, South Africa. 10 pages