Skip to main content

Showing 1–3 of 3 results for author: Serban, A C

Searching in archive cs. Search in all archives.
.
  1. arXiv:1903.08428  [pdf, ps, other

    cs.AI cs.LG

    Counterexample-Guided Strategy Improvement for POMDPs Using Recurrent Neural Networks

    Authors: Steven Carr, Nils Jansen, Ralf Wimmer, Alexandru C. Serban, Bernd Becker, Ufuk Topcu

    Abstract: We study strategy synthesis for partially observable Markov decision processes (POMDPs). The particular problem is to determine strategies that provably adhere to (probabilistic) temporal logic constraints. This problem is computationally intractable and theoretically hard. We propose a novel method that combines techniques from machine learning and formal verification. First, we train a recurrent… ▽ More

    Submitted 21 March, 2019; v1 submitted 20 March, 2019; originally announced March 2019.

  2. arXiv:1810.01185  [pdf, ps, other

    cs.CV cs.CR cs.LG cs.NE

    Adversarial Examples - A Complete Characterisation of the Phenomenon

    Authors: Alexandru Constantin Serban, Erik Poll, Joost Visser

    Abstract: We provide a complete characterisation of the phenomenon of adversarial examples - inputs intentionally crafted to fool machine learning models. We aim to cover all the important concerns in this field of study: (1) the conjectures on the existence of adversarial examples, (2) the security, safety and robustness implications, (3) the methods used to generate and (4) protect against adversarial exa… ▽ More

    Submitted 17 February, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

  3. arXiv:1807.06096  [pdf, other

    cs.AI

    Safe Reinforcement Learning via Probabilistic Shields

    Authors: Nils Jansen, Bettina Könighofer, Sebastian Junges, Alexandru C. Serban, Roderick Bloem

    Abstract: This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent models to capture such planning problems. Reinforcement learning (RL) is a machine learning technique to determine near-optimal policies in MDPs that may be unknown prior to exploring the model. However, during exploration, R… ▽ More

    Submitted 25 November, 2019; v1 submitted 16 July, 2018; originally announced July 2018.