Skip to main content

Showing 1–50 of 55 results for author: Perez, G A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.02019  [pdf, other

    cs.FL cs.LG

    Active Learning of Mealy Machines with Timers

    Authors: Véronique Bruyère, Bharat Garhewal, Guillermo A. Pérez, Gaëtan Staquet, Frits W. Vaandrager

    Abstract: We present the first algorithm for query learning of a general class of Mealy machines with timers (MMTs) in a black-box context. Our algorithm is an extension of the L# algorithm of Vaandrager et al. to a timed setting. Like the algorithm for learning timed automata proposed by Waga, our algorithm is inspired by ideas of Maler & Pnueli. Based on the elementary languages of, both Waga's and our al… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 77 pages, 19 figures

    MSC Class: 68Q45 ACM Class: F.4.3

  2. arXiv:2402.13785  [pdf, other

    cs.AI

    Synthesis of Hierarchical Controllers Based on Deep Reinforcement Learning Policies

    Authors: Florent Delgrange, Guy Avni, Anna Lukina, Christian Schilling, Ann Nowé, Guillermo A. Pérez

    Abstract: We propose a novel approach to the problem of controller design for environments modeled as Markov decision processes (MDPs). Specifically, we consider a hierarchical MDP a graph with each vertex populated by an MDP called a "room". We first apply deep reinforcement learning (DRL) to obtain low-level policies for each room, scaling to large rooms of unknown structure. We then apply reactive synthe… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 19 pages main text, 17 pages Appendix (excluding references)

  3. arXiv:2402.13237  [pdf, other

    cs.LO cs.FL

    Continuous Pushdown VASS in One Dimension are Easy

    Authors: Guillermo A. Perez, Shrisha Rao

    Abstract: A pushdown vector addition system with states (PVASS) extends the model of vector addition systems with a pushdown stack. The algorithmic analysis of PVASS has applications such as static analysis of recursive programs manipulating integer variables. Unfortunately, reachability analysis, even for one-dimensional PVASS is not known to be decidable. We relax the model of one-dimensional PVASS to mak… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 2 tables, 6 figures, 12 pages

  4. arXiv:2402.09121  [pdf, ps, other

    cs.FL

    Inform: From Compartmental Models to Stochastic Bounded Counter Machines

    Authors: Tim Leys, Guillermo A. Perez

    Abstract: Compartmental models are used in epidemiology to capture the evolution of infectious diseases such as COVID-19 in a population by assigning members of it to compartments with labels such as susceptible, infected, and recovered. In a stochastic compartmental model the flow of individuals between compartments is determined probabilistically. We establish that certain stochastic compartment models ca… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2310.17410  [pdf, ps, other

    cs.AI cs.LO

    Synthesizing Efficiently Monitorable Formulas in Metric Temporal Logic

    Authors: Ritam Raha, Rajarshi Roy, Nathanael Fijalkow, Daniel Neider, Guillermo A. Perez

    Abstract: In runtime verification, manually formalizing a specification for monitoring system executions is a tedious and error-prone process. To address this issue, we consider the problem of automatically synthesizing formal specifications from system executions. To demonstrate our approach, we consider the popular specification language Metric Temporal Logic (MTL), which is particularly tailored towards… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  6. arXiv:2308.13609  [pdf, ps, other

    cs.LO math.NT

    Integer Programming with GCD Constraints

    Authors: Rémy Defossez, Christoph Haase, Alessio Mansutti, Guillermo A. Perez

    Abstract: We study the non-linear extension of integer programming with greatest common divisor constraints of the form $\gcd(f,g) \sim d$, where $f$ and $g$ are linear polynomials, $d$ is a positive integer, and $\sim$ is a relation among $\leq, =, \neq$ and $\geq$. We show that the feasibility problem for these systems is in NP, and that an optimal solution minimizing a linear objective function, if it ex… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  7. arXiv:2308.07738  [pdf, other

    cs.AI

    Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods

    Authors: Debraj Chakraborty, Damien Busatto-Gaston, Jean-François Raskin, Guillermo A. Pérez

    Abstract: We study how to efficiently combine formal methods, Monte Carlo Tree Search (MCTS), and deep learning in order to produce high-quality receding horizon policies in large Markov Decision processes (MDPs). In particular, we use model-checking techniques to guide the MCTS algorithm in order to generate offline samples of high-quality decisions on a representative set of states of the MDP. Those sampl… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  8. arXiv:2305.09634  [pdf, other

    cs.GT

    Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

    Authors: Damien Busatto-Gaston, Debraj Chakraborty, Anirban Majumdar, Sayan Mukherjee, Guillermo A. Pérez, Jean-François Raskin

    Abstract: We consider lexicographic bi-objective problems on Markov Decision Processes (MDPs), where we optimize one objective while guaranteeing optimality of another. We propose a two-stage technique for solving such problems when the objectives are related (in a way that we formalize). We instantiate our technique for two natural pairs of objectives: minimizing the (conditional) expected number of steps… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  9. Automata with Timers

    Authors: Véronique Bruyère, Guillermo A. Pérez, Gaëtan Staquet, Frits W. Vaandrager

    Abstract: In this work, we study properties of deterministic finite-state automata with timers, a subclass of timed automata proposed by Vaandrager et al. as a candidate for an efficiently learnable timed model. We first study the complexity of the configuration reachability problem for such automata and establish that it is PSPACE-complete. Then, as simultaneous timeouts (we call these, races) can occur in… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 35 pages, 9 figures

    ACM Class: F.4.3

    Journal ref: Formal Modeling and Analysis of Timed Systems (FORMATS) 2023 pp. 33-49

  10. arXiv:2305.05739  [pdf, ps, other

    cs.LO cs.AI

    Graph-Based Reductions for Parametric and Weighted MDPs

    Authors: Kasper Engelen, Guillermo A. Pérez, Shrisha Rao

    Abstract: We study the complexity of reductions for weighted reachability in parametric Markov decision processes. That is, we say a state p is never worse than q if for all valuations of the polynomial indeterminates it is the case that the maximal expected weight that can be reached from p is greater than the same value from q. In terms of computational complexity, we establish that determining whether p… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  11. arXiv:2303.12558  [pdf, other

    cs.LG cs.AI

    Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees

    Authors: Florent Delgrange, Ann Nowé, Guillermo A. Pérez

    Abstract: Although deep reinforcement learning (DRL) has many success stories, the large-scale deployment of policies learned through these advanced techniques in safety-critical scenarios is hindered by their lack of formal guarantees. Variational Markov Decision Processes (VAE-MDPs) are discrete latent space models that provide a reliable framework for distilling formally verifiable controllers from any R… ▽ More

    Submitted 21 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: ICLR 2023, 10 pages main text, 14 pages appendix (excluding references)

  12. arXiv:2303.03839  [pdf, ps, other

    cs.LO

    The Temporal Logic Synthesis Format TLSF v1.2

    Authors: Swen Jacobs, Guillermo A. Perez, Philipp Schlehuber-Caissier

    Abstract: We present an extension of the Temporal Logic Synthesis Format (TLSF). TLSF builds on standard LTL, but additionally supports high-level constructs, such as sets and functions, as well as parameters that allow a specification to define a whole a family of problems. Our extension introduces operators and a new semantics option for LTLf , i.e., LTL on finite executions.

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1604.02284, arXiv:1601.05228

  13. arXiv:2303.03284  [pdf, other

    cs.LG cs.AI

    The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models

    Authors: Raphael Avalos, Florent Delgrange, Ann Nowé, Guillermo A. Pérez, Diederik M. Roijers

    Abstract: Partially Observable Markov Decision Processes (POMDPs) are used to model environments where the full state cannot be perceived by an agent. As such the agent needs to reason taking into account the past observations and actions. However, simply remembering the full history is generally intractable due to the exponential growth in the history space. Maintaining a probability distribution that mode… ▽ More

    Submitted 26 October, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  14. arXiv:2212.05337  [pdf, ps, other

    cs.LG

    Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking

    Authors: Dennis Gross, Thiago D. Simao, Nils Jansen, Guillermo A. Perez

    Abstract: Deep Reinforcement Learning (RL) agents are susceptible to adversarial noise in their observations that can mislead their policies and decrease their performance. However, an adversary may be interested not only in decreasing the reward, but also in modifying specific temporal logic properties of the policy. This paper presents a metric that measures the exact impact of adversarial attacks against… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: ICAART 2023 Paper (Technical Report)

  15. Validating Streaming JSON Documents with Learned VPAs

    Authors: Véronique Bruyère, Guillermo A. Perez, Gaëtan Staquet

    Abstract: We present a new streaming algorithm to validate JSON documents against a set of constraints given as a JSON schema. Among the possible values a JSON document can hold, objects are unordered collections of key-value pairs while arrays are ordered collections of values. We prove that there always exists a visibly pushdown automaton (VPA) that accepts the same set of JSON documents as a JSON schema.… ▽ More

    Submitted 8 March, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: 46 pages, 10 figures, published at TACAS 2023

    ACM Class: F.4.3

    Journal ref: Tools and Algorithms for the Construction and Analysis of Systems (TACAS) 2023, pp. 271-289

  16. arXiv:2210.00785  [pdf, ps, other

    cs.LO

    The Geometry of Reachability in Continuous Vector Addition Systems with States

    Authors: Shaull Almagor, Arka Ghosh, Tim Leys, Guillermo A. Perez

    Abstract: We study the geometry of reachability sets of continuous vector addition systems with states (VASS). In particular we establish that they are almost Minkowski sums of convex cones and zonotopes generated by the vectors labelling the transitions of the VASS. We use the latter to prove that short so-called linear path schemes suffice as witnesses of reachability in continuous VASS of fixed dimension… ▽ More

    Submitted 14 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

  17. arXiv:2209.07133  [pdf, other

    cs.LG cs.LO

    COOL-MC: A Comprehensive Tool for Reinforcement Learning and Model Checking

    Authors: Dennis Gross, Nils Jansen, Sebastian Junges, Guillermo A. Perez

    Abstract: This paper presents COOL-MC, a tool that integrates state-of-the-art reinforcement learning (RL) and model checking. Specifically, the tool builds upon the OpenAI gym and the probabilistic model checker Storm. COOL-MC provides the following features: (1) a simulator to train RL policies in the OpenAI gym for Markov decision processes (MDPs) that are defined as input for Storm, (2) a new model buil… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  18. arXiv:2206.00251  [pdf, other

    cs.LO

    The Reactive Synthesis Competition (SYNTCOMP): 2018-2021

    Authors: Swen Jacobs, Guillermo A. Perez, Remco Abraham, Veronique Bruyere, Michael Cadilhac, Maximilien Colange, Charly Delfosse, Tom van Dijk, Alexandre Duret-Lutz, Peter Faymonville, Bernd Finkbeiner, Ayrat Khalimov, Felix Klein, Michael Luttenberger, Klara Meyer, Thibaud Michaud, Adrien Pommellet, Florian Renkin, Philipp Schlehuber-Caissier, Mouhammad Sakr, Salomon Sickert, Gaetan Staquet, Clement Tamines, Leander Tentrup, Adam Walker

    Abstract: We report on the last four editions of the reactive synthesis competition (SYNTCOMP 2018-2021). We briefly describe the evaluation scheme and the experimental setup of SYNTCOMP. Then, we introduce new benchmark classes that have been added to the SYNTCOMP library and give an overview of the participants of SYNTCOMP. Finally, we present and analyze the results of our experimental evaluations, inclu… ▽ More

    Submitted 6 May, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: accepted for publication in STTT

  19. arXiv:2204.06079  [pdf, other

    cs.LO cs.MS

    Acacia-Bonsai: A Modern Implementation of Downset-Based LTL Realizability

    Authors: Michaël Cadilhac, Guillermo A. Pérez

    Abstract: We describe our implementation of downset-manipulating algorithms used to solve the realizability problem for linear temporal logic (LTL). These algorithms were introduced by Filiot et al.~in the 2010s and implemented in the tools Acacia and Acacia+ in C and Python. We identify degrees of freedom in the original algorithms and provide a complete rewriting of Acacia in C++20 articulated around gene… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 10 pages, 1 figure

  20. arXiv:2112.09655  [pdf, other

    cs.LG cs.AI

    Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes (Technical Report)

    Authors: Florent Delgrange, Ann Nowé, Guillermo A. Pérez

    Abstract: We consider the challenge of policy simplification and verification in the context of policies learned through reinforcement learning (RL) in continuous environments. In well-behaved settings, RL algorithms have convergence guarantees in the limit. While these guarantees are valuable, they are insufficient for safety-critical applications. Furthermore, they are lost when applying advanced techniqu… ▽ More

    Submitted 14 June, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: AAAI 2022, technical report including supplementary material (10 pages main text, 14 pages appendix)

  21. arXiv:2112.02976  [pdf, ps, other

    cs.AI cs.FL

    Lecture Notes on Partially Known MDPs

    Authors: Guillermo A. Perez

    Abstract: In these notes we will tackle the problem of finding optimal policies for Markov decision processes (MDPs) which are not fully known to us. Our intention is to slowly transition from an offline setting to an online (learning) setting. Namely, we are moving towards reinforcement learning.

    Submitted 20 June, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  22. Learning Realtime One-Counter Automata

    Authors: Véronique Bruyère, Guillermo A. Pérez, Gaëtan Staquet

    Abstract: We present a new learning algorithm for realtime one-counter automata. Our algorithm uses membership and equivalence queries as in Angluin's L* algorithm, as well as counter value queries and partial equivalence queries. In a partial equivalence query, we ask the teacher whether the language of a given finite-state automaton coincides with a counter-bounded subset of the target language. We evalua… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 55 pages, 9 figures, submitted to TACAS 2022

    ACM Class: F.4.3

    Journal ref: Tools and Algorithms for the Construction and Analysis of Systems (TACAS) 2022 pp. 271-289

  23. arXiv:2104.11758  [pdf, ps, other

    cs.FL cs.LG

    Active Learning of Sequential Transducers with Side Information about the Domain

    Authors: Raphaël Berthon, Adrien Boiret, Guillermo A. Perez, Jean-François Raskin

    Abstract: Active learning is a setting in which a student queries a teacher, through membership and equivalence queries, in order to learn a language. Performance on these algorithms is often measured in the number of queries required to learn a target, with an emphasis on costly equivalence queries. In graybox learning, the learning process is accelerated by foreknowledge of some information on the target.… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  24. arXiv:2101.11996  [pdf, ps, other

    cs.FL cs.LO

    Continuous One-Counter Automata

    Authors: Michael Blondin, Tim Leys, Filip Mazowiecki, Philip Offtermatt, Guillermo A. Pérez

    Abstract: We study the reachability problem for continuous one-counter automata, COCA for short. In such automata, transitions are guarded by upper and lower bound tests against the counter value. Additionally, the counter updates associated with taking transitions can be (non-deterministically) scaled down by a nonzero factor between zero and one. Our three main results are as follows: (1) We prove that th… ▽ More

    Submitted 3 February, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

  25. arXiv:2009.13128  [pdf, ps, other

    cs.LO

    The Complexity of Reachability in Parametric Markov Decision Processes

    Authors: Sebastian Junges, Joost-Pieter Katoen, Guillermo A. Pérez, Tobias Winkler

    Abstract: This article presents the complexity of reachability decision problems for parametric Markov decision processes (pMDPs), an extension to Markov decision processes (MDPs) where transitions probabilities are described by polynomials over a finite set of parameters. In particular, we study the complexity of finding values for these parameters such that the induced MDP satisfies some maximal or minima… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: This is a preprint of an article under submission which follows our earlier CONCUR paper. It contains small corrections and new results regarding qualitative reachability queries

  26. arXiv:2005.09253  [pdf, other

    cs.AI cs.LO

    Safe Learning for Near Optimal Scheduling

    Authors: Damien Busatto-Gaston, Debraj Chakraborty, Shibashis Guha, Guillermo A. Pérez, Jean-François Raskin

    Abstract: In this paper, we investigate the combination of synthesis, model-based learning, and online sampling techniques to obtain safe and near-optimal schedulers for a preemptible task scheduling problem. Our algorithms can handle Markov decision processes (MDPs) that have 1020 states and beyond which cannot be handled with state-of-the art probabilistic model-checkers. We provide probably approximately… ▽ More

    Submitted 13 July, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

  27. arXiv:2005.05587  [pdf, ps, other

    cs.LG stat.ML

    Robustness Verification for Classifier Ensembles

    Authors: Dennis Gross, Nils Jansen, Guillermo A. Pérez, Stephan Raaijmakers

    Abstract: We give a formal verification procedure that decides whether a classifier ensemble is robust against arbitrary randomized attacks. Such attacks consist of a set of deterministic attacks and a distribution over this set. The robustness-checking problem consists of assessing, given a set of classifiers and a labelled data set, whether there exists a randomized attack that induces a certain expected… ▽ More

    Submitted 9 July, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

  28. arXiv:2005.01071  [pdf, other

    cs.LO cs.FL

    Revisiting Parameter Synthesis for One-Counter Automata

    Authors: Guillermo A. Pérez, Ritam Raha

    Abstract: We study the (parameter) synthesis problem for one-counter automata with parameters. One-counter automata are obtained by extending classical finite-state automata with a counter whose value can range over non-negative integers and be tested for zero. The updates and tests applicable to the counter can further be made parametric by introducing a set of integer-valued variables called parameters. T… ▽ More

    Submitted 19 October, 2021; v1 submitted 3 May, 2020; originally announced May 2020.

  29. arXiv:2004.02593  [pdf, ps, other

    cs.LG stat.ML

    Let's Agree to Degree: Comparing Graph Convolutional Networks in the Message-Passing Framework

    Authors: Floris Geerts, Filip Mazowiecki, Guillermo A. Pérez

    Abstract: In this paper we cast neural networks defined on graphs as message-passing neural networks (MPNNs) in order to study the distinguishing power of different classes of such models. We are interested in whether certain architectures are able to tell vertices apart based on the feature labels given as input with the graph. We consider two variants of MPNNS: anonymous MPNNs whose message functions depe… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: 22 pages

  30. arXiv:1912.05793  [pdf, ps, other

    cs.LO cs.FL

    The Extended HOA Format for Synthesis

    Authors: Guillermo A. Perez

    Abstract: We propose a small extension to the Hanoi Omega-Automata format to define reactive-synthesis problems. Namely, we add a "controllable-AP" header item specifying the subset of atomic propositions which is controllable. We describe the semantics of the new format and propose an output format for synthesized strategies. Finally, we also comment on tool support meant to encourage fast adoption of the… ▽ More

    Submitted 13 May, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Updated the link to a cited tool and made it explicit that parity automata can be assumed to be complete

  31. arXiv:1907.06913  [pdf, ps, other

    cs.GT

    Partial Solvers for Generalized Parity Games

    Authors: Véronique Bruyère, Guillermo A. Pérez, Jean-François Raskin, Clément Tamines

    Abstract: Parity games have been broadly studied in recent years for their applications to controller synthesis and verification. In practice, partial solvers for parity games that execute in polynomial time, while incomplete, can solve most games in publicly available benchmark suites. In this paper, we combine those partial solvers with the classical recursive algorithm for parity games due to Zielonka. W… ▽ More

    Submitted 20 July, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

  32. arXiv:1904.01503  [pdf, other

    cs.LO cs.CC

    On the Complexity of Reachability in Parametric Markov Decision Processes

    Authors: Tobias Winkler, Sebastian Junges, Guillermo A. Pérez, Joost-Pieter Katoen

    Abstract: This paper studies parametric Markov decision processes (pMDPs), an extension to Markov decision processes (MDPs) where transitions probabilities are described by polynomials over a finite set of parameters. Fixing values for all parameters yields MDPs. In particular, this paper studies the complexity of finding values for these parameters such that the induced MDP satisfies some reachability cons… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Full version with proofs, 42 pages

  33. arXiv:1902.06576  [pdf, other

    cs.LO

    Coverability in 1-VASS with Disequality Tests

    Authors: Shaull Almagor, Nathann Cohen, Guillermo A. Pérez, Mahsa Shirmohammadi, James Worrell

    Abstract: We study a class of reachability problems in weighted graphs with constraints on the accumulated weight of paths. The problems we study can equivalently be formulated in the model of vector addition systems with states (VASS). We consider a version of the vertex-to-vertex reachability problem in which the accumulated weight of a path is required always to be non-negative. This is equivalent to the… ▽ More

    Submitted 7 September, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

  34. arXiv:1811.07146  [pdf, ps, other

    cs.GT cs.AI cs.FL cs.LO

    The Impatient May Use Limited Optimism to Minimize Regret

    Authors: Michaël Cadilhac, Guillermo A. Pérez, Marie van den Bogaard

    Abstract: Discounted-sum games provide a formal model for the study of reinforcement learning, where the agent is enticed to get rewards early since later rewards are discounted. When the agent interacts with the environment, she may regret her actions, realizing that a previous choice was suboptimal given the behavior of the environment. The main contribution of this paper is a PSPACE algorithm for computi… ▽ More

    Submitted 17 November, 2018; originally announced November 2018.

  35. arXiv:1807.04920  [pdf, other

    cs.FL cs.AI cs.CC

    On the Complexity of Value Iteration

    Authors: Nikhil Balaji, Stefan Kiefer, Petr Novotný, Guillermo A. Pérez, Mahsa Shirmohammadi

    Abstract: Value iteration is a fundamental algorithm for solving Markov Decision Processes (MDPs). It computes the maximal $n$-step payoff by iterating $n$ times a recurrence equation which is naturally associated to the MDP. At the same time, value iteration provides a policy for the MDP that is optimal on a given finite horizon $n$. In this paper, we settle the computational complexity of value iteration.… ▽ More

    Submitted 27 April, 2019; v1 submitted 13 July, 2018; originally announced July 2018.

    Comments: Full version of an ICALP'19 paper

  36. arXiv:1804.09077  [pdf, ps, other

    cs.FL

    When is Containment Decidable for Probabilistic Automata?

    Authors: Laure Daviaud, Marcin Jurdziński, Ranko Lazić, Filip Mazowiecki, Guillermo A. Pérez, James Worrell

    Abstract: The emptiness and containment problems for probabilistic automata are natural quantitative generalisations of the classical language emptiness and inclusion problems for Boolean automata. It is well known that both problems are undecidable. In this paper we provide a more refined view of these problems in terms of the degree of ambiguity of probabilistic automata. We show that a gap version of the… ▽ More

    Submitted 29 March, 2020; v1 submitted 24 April, 2018; originally announced April 2018.

    ACM Class: F.3.1

  37. arXiv:1804.08924  [pdf, other

    cs.AI cs.LO

    Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints

    Authors: Jan Křetínský, Guillermo A. Pérez, Jean-François Raskin

    Abstract: We formalize the problem of maximizing the mean-payoff value with high probability while satisfying a parity objective in a Markov decision process (MDP) with unknown probabilistic transition function and unknown reward function. Assuming the support of the unknown transition function and a lower bound on the minimal transition probability are known in advance, we show that in MDPs consisting of a… ▽ More

    Submitted 23 August, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

  38. arXiv:1804.06336  [pdf, other

    cs.FL

    Weak Cost Register Automata are Still Powerful

    Authors: Shaull Almagor, Michaël Cadilhac, Filip Mazowiecki, Guillermo A. Pérez

    Abstract: We consider one of the weakest variants of cost register automata over a tropical semiring, namely copyless cost register automata over $\mathbb{N}$ with updates using $\min$ and increments. We show that this model can simulate, in some sense, the runs of counter machines with zero-tests. We deduce that a number of problems pertaining to that model are undecidable, in particular equivalence, dispr… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: 16 pages

  39. The 4th Reactive Synthesis Competition (SYNTCOMP 2017): Benchmarks, Participants & Results

    Authors: Swen Jacobs, Nicolas Basset, Roderick Bloem, Romain Brenguier, Maximilien Colange, Peter Faymonville, Bernd Finkbeiner, Ayrat Khalimov, Felix Klein, Thibaud Michaud, Guillermo A. Pérez, Jean-François Raskin, Ocan Sankur, Leander Tentrup

    Abstract: We report on the fourth reactive synthesis competition (SYNTCOMP 2017). We introduce two new benchmark classes that have been added to the SYNTCOMP library, and briefly describe the benchmark selection, evaluation scheme and the experimental setup of SYNTCOMP 2017. We present the participants of SYNTCOMP 2017, with a focus on changes with respect to the previous years and on the two completely new… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: In Proceedings SYNT 2017, arXiv:1711.10224. arXiv admin note: text overlap with arXiv:1609.00507

    Journal ref: EPTCS 260, 2017, pp. 116-143

  40. arXiv:1710.07903  [pdf, ps, other

    cs.LO cs.AI

    The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes

    Authors: Stephane Le Roux, Guillermo A. Perez

    Abstract: We study the never-worse relation (NWR) for Markov decision processes with an infinite-horizon reachability objective. A state q is never worse than a state p if the maximal probability of reaching the target set of states from p is at most the same value from q, regard- less of the probabilities labelling the transitions. Extremal-probability states, end components, and essential states are all s… ▽ More

    Submitted 24 February, 2018; v1 submitted 22 October, 2017; originally announced October 2017.

  41. arXiv:1701.02903  [pdf, ps, other

    cs.FL cs.GT cs.LO

    On Delay and Regret Determinization of Max-Plus Automata

    Authors: Emmanuel Filiot, Ismaël Jecker, Nathan Lhote, Guillermo A. Pérez, Jean-François Raskin

    Abstract: Decidability of the determinization problem for weighted automata over the semiring $(\mathbb{Z} \cup {-\infty}, \max, +)$, WA for short, is a long-standing open question. We propose two ways of approaching it by constraining the search space of deterministic WA: k-delay and r-regret. A WA N is k-delay determinizable if there exists a deterministic automaton D that defines the same function as N a… ▽ More

    Submitted 3 March, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

  42. arXiv:1611.08696  [pdf, other

    cs.AI cs.GT

    Optimizing Expectation with Guarantees in POMDPs (Technical Report)

    Authors: Krishnendu Chatterjee, Petr Novotný, Guillermo A. Pérez, Jean-François Raskin, Đorđe Žikelić

    Abstract: A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy that maximizes the expected discounted-sum payoff. However, such policies may still permit unlikely but highly undesirable outcomes, which is problematic especially in safety-critical applications. Recently, there has been a surge of interest in POMDPs where the goal is to maximize the probability t… ▽ More

    Submitted 29 January, 2017; v1 submitted 26 November, 2016; originally announced November 2016.

  43. arXiv:1611.08677  [pdf, other

    cs.LO cs.GT

    Admissibility in Quantitative Graph Games

    Authors: Romain Brenguier, Guillermo A. Pérez, Jean-François Raskin, Ocan Sankur

    Abstract: Admissibility has been studied for games of infinite duration with Boolean objectives. We extend here this study to games of infinite duration with quantitative objectives. First, we show that, un- der the assumption that optimal worst-case and cooperative strategies exist, admissible strategies are guaranteed to exist. Second, we give a characterization of admissible strategies using the no- tion… ▽ More

    Submitted 26 November, 2016; originally announced November 2016.

    Comments: This is the full version of an article that will be published in FSTTCS'16

    ACM Class: F.1.1; D.2.4

  44. The 3rd Reactive Synthesis Competition (SYNTCOMP 2016): Benchmarks, Participants & Results

    Authors: Swen Jacobs, Roderick Bloem, Romain Brenguier, Ayrat Khalimov, Felix Klein, Robert Könighofer, Jens Kreber, Alexander Legg, Nina Narodytska, Guillermo A. Pérez, Jean-François Raskin, Leonid Ryzhyk, Ocan Sankur, Martina Seidl, Leander Tentrup, Adam Walker

    Abstract: We report on the benchmarks, participants and results of the third reactive synthesis competition(SYNTCOMP 2016). The benchmark library of SYNTCOMP 2016 has been extended to benchmarks in the new LTL-based temporal logic synthesis format (TLSF), and 2 new sets of benchmarks for the existing AIGER-based format for safety specifications. The participants of SYNTCOMP 2016 can be separated according t… ▽ More

    Submitted 23 November, 2016; v1 submitted 2 September, 2016; originally announced September 2016.

    Comments: In Proceedings SYNT 2016, arXiv:1611.07178

    Journal ref: EPTCS 229, 2016, pp. 149-177

  45. Compositional Algorithms for Succinct Safety Games

    Authors: Romain Brenguier, Guillermo A. Pérez, Jean-François Raskin, Ocan Sankur

    Abstract: We study the synthesis of circuits for succinct safety specifications given in the AIG format. We show how AIG safety specifications can be decomposed automatically into sub specifications. Then we propose symbolic compositional algorithms to solve the synthesis problem compositionally starting for the sub-specifications. We have evaluated the compositional algorithms on a set of benchmarks includ… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

    Comments: In Proceedings SYNT 2015, arXiv:1602.00786

    ACM Class: F.3.1,B.1.2,B.6.3

    Journal ref: EPTCS 202, 2016, pp. 98-111

  46. The Second Reactive Synthesis Competition (SYNTCOMP 2015)

    Authors: Swen Jacobs, Roderick Bloem, Romain Brenguier, Robert Könighofer, Guillermo A. Pérez, Jean-François Raskin, Leonid Ryzhyk, Ocan Sankur, Martina Seidl, Leander Tentrup, Adam Walker

    Abstract: We report on the design and results of the second reactive synthesis competition (SYNTCOMP 2015). We describe our extended benchmark library, with 6 completely new sets of benchmarks, and additional challenging instances for 4 of the benchmark sets that were already used in SYNTCOMP 2014. To enhance the analysis of experimental results, we introduce an extension of our benchmark format with meta-i… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

    Comments: In Proceedings SYNT 2015, arXiv:1602.00786

    Journal ref: EPTCS 202, 2016, pp. 27-57

  47. arXiv:1512.05568  [pdf, ps, other

    cs.LO cs.FL cs.GT

    Non-Zero Sum Games for Reactive Synthesis

    Authors: Romain Brenguier, Lorenzo Clemente, Paul Hunter, Guillermo A. Pérez, Mickael Randour, Jean-François Raskin, Ocan Sankur, Mathieu Sassolas

    Abstract: In this invited contribution, we summarize new solution concepts useful for the synthesis of reactive systems that we have introduced in several recent publications. These solution concepts are developed in the context of non-zero sum games played on graphs. They are part of the contributions obtained in the inVEST project funded by the European Research Council.

    Submitted 17 December, 2015; originally announced December 2015.

    Comments: LATA'16 invited paper

  48. arXiv:1512.04255  [pdf, ps, other

    cs.LO cs.GT

    The Fixed Initial Credit Problem for Partial-Observation Energy Games is Ack-complete

    Authors: Guillermo A. Pérez

    Abstract: In this paper we study two-player games with asymmetric partial observation and an energy objective. Such games are played on a weighted automaton by Eve, choosing actions, and Adam, choosing a transition labelled with the given action. Eve attempts to maintain the sum of the weights (of the transitions taken) non-negative while Adam tries to do the opposite. Eve does not know the exact state of t… ▽ More

    Submitted 16 November, 2016; v1 submitted 14 December, 2015; originally announced December 2015.

  49. arXiv:1511.00523  [pdf, ps, other

    cs.GT cs.FL cs.LO

    Minimizing Regret in Discounted-Sum Games

    Authors: Paul Hunter, Guillermo A. Pérez, Jean-François Raskin

    Abstract: In this paper, we study the problem of minimizing regret in discounted-sum games played on weighted game graphs. We give algorithms for the general problem of computing the minimal regret of the controller (Eve) as well as several variants depending on which strategies the environment (Adam) is permitted to use. We also consider the problem of synthesizing regret-free strategies for Eve in each of… ▽ More

    Submitted 4 May, 2018; v1 submitted 2 November, 2015; originally announced November 2015.

    Comments: arXiv admin note: text overlap with arXiv:1504.01708; some typos have been removed in the proof of simple strategies being sufficient to minimize regret against any adversary

    ACM Class: F.1.1; D.2.4

  50. The First Reactive Synthesis Competition (SYNTCOMP 2014)

    Authors: Swen Jacobs, Roderick Bloem, Romain Brenguier, Rüdiger Ehlers, Timotheus Hell, Robert Könighofer, Guillermo A. Pérez, Jean-François Raskin, Leonid Ryzhyk, Ocan Sankur, Martina Seidl, Leander Tentrup, Adam Walker

    Abstract: We introduce the reactive synthesis competition (SYNTCOMP), a long-term effort intended to stimulate and guide advances in the design and application of synthesis procedures for reactive systems. The first iteration of SYNTCOMP is based on the controller synthesis problem for finite-state systems and safety specifications. We provide an overview of this problem and existing approaches to solve it,… ▽ More

    Submitted 13 April, 2016; v1 submitted 29 June, 2015; originally announced June 2015.

    Comments: 24 pages, published in STTT

    Journal ref: International Journal on Software Tools for Technology Transfer, Online First, 2016, pp 1-24