Skip to main content

Showing 1–8 of 8 results for author: Bozkurt, A K

.
  1. arXiv:2404.05074  [pdf, ps, other

    cs.AI cs.RO

    On the Uniqueness of Solution for the Bellman Equation of LTL Objectives

    Authors: Zetong Xuan, Alper Kamil Bozkurt, Miroslav Pajic, Yu Wang

    Abstract: Surrogate rewards for linear temporal logic (LTL) objectives are commonly utilized in planning problems for LTL objectives. In a widely-adopted surrogate reward approach, two discount factors are used to ensure that the expected return approximates the satisfaction probability of the LTL objective. The expected return then can be estimated by methods using the Bellman updates such as reinforcement… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted for the 2024 Learning for Dynamics and Control Conference (L4DC)

  2. arXiv:2104.01612  [pdf, other

    cs.RO

    Reinforcement Learning with Temporal Logic Constraints for Partially-Observable Markov Decision Processes

    Authors: Yu Wang, Alper Kamil Bozkurt, Miroslav Pajic

    Abstract: This paper proposes a reinforcement learning method for controller synthesis of autonomous systems in unknown and partially-observable environments with subjective time-dependent safety constraints. Mathematically, we model the system dynamics by a partially-observable Markov decision process (POMDP) with unknown transition/observation probabilities. The time-dependent safety constraint is capture… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  3. arXiv:2103.14600  [pdf, other

    cs.RO cs.FL cs.LG cs.LO

    Model-Free Learning of Safe yet Effective Controllers

    Authors: Alper Kamil Bozkurt, Yu Wang, Miroslav Pajic

    Abstract: We study the problem of learning safe control policies that are also effective; i.e., maximizing the probability of satisfying a linear temporal logic (LTL) specification of a task, and the discounted reward capturing the (classic) control performance. We consider unknown environments modeled as Markov decision processes. We propose a model-free reinforcement learning algorithm that learns a polic… ▽ More

    Submitted 26 September, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

  4. arXiv:2102.04307  [pdf, other

    cs.AI cs.LG cs.LO cs.RO

    Learning Optimal Strategies for Temporal Tasks in Stochastic Games

    Authors: Alper Kamil Bozkurt, Yu Wang, Michael M. Zavlanos, Miroslav Pajic

    Abstract: Synthesis from linear temporal logic (LTL) specifications provides assured controllers for systems operating in stochastic and potentially adversarial environments. Automatic synthesis tools, however, require a model of the environment to construct controllers. In this work, we introduce a model-free reinforcement learning (RL) approach to derive controllers from given LTL specifications even when… ▽ More

    Submitted 30 August, 2023; v1 submitted 8 February, 2021; originally announced February 2021.

  5. arXiv:2011.01882  [pdf, other

    cs.RO cs.GT

    Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning

    Authors: Alper Kamil Bozkurt, Yu Wang, Miroslav Pajic

    Abstract: We consider the problem of security-aware planning in an unknown stochastic environment, in the presence of attacks on control signals (i.e., actuators) of the robot. We model the attacker as an agent who has the full knowledge of the controller as well as the employed intrusion-detection system and who wants to prevent the controller from performing tasks while staying stealthy. We formulate the… ▽ More

    Submitted 26 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

  6. arXiv:2010.01050  [pdf, other

    cs.RO cs.LO

    Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives

    Authors: Alper Kamil Bozkurt, Yu Wang, Michael Zavlanos, Miroslav Pajic

    Abstract: We study the problem of synthesizing control strategies for Linear Temporal Logic (LTL) objectives in unknown environments. We model this problem as a turn-based zero-sum stochastic game between the controller and the environment, where the transition probabilities and the model topology are fully unknown. The winning condition for the controller in this game is the satisfaction of the given LTL s… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  7. arXiv:1909.07299  [pdf, other

    cs.RO cs.AI cs.LG

    Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

    Authors: Alper Kamil Bozkurt, Yu Wang, Michael M. Zavlanos, Miroslav Pajic

    Abstract: We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP). Specifically, we learn a policy that maximizes the probability of satisfying the LTL formula without learning the transition probabilities. We introduce a novel rewarding… ▽ More

    Submitted 5 March, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  8. arXiv:1904.03264  [pdf, other

    cs.FL eess.SY

    Attack-Resilient Supervisory Control of Discrete-Event Systems: A Finite-State Transducer Approach

    Authors: Yu Wang, Alper Kamil Bozkurt, Nathan Smith, Miroslav Pajic

    Abstract: Resilience to sensor and actuator attacks is a major concern in the supervisory control of discrete events in cyber-physical systems (CPS). In this work, we propose a new framework to design supervisors for CPS under attacks using finite-state transducers (FSTs) to model the effects of the discrete events. FSTs can capture a general class of regular-rewriting attacks in which an attacker can nonde… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2019; originally announced April 2019.