Skip to main content

Showing 1–7 of 7 results for author: Kalagarla, K C

.
  1. arXiv:2305.14736  [pdf, other

    cs.AI cs.FL eess.SY

    Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes

    Authors: Krishna C. Kalagarla, Dhruva Kartik, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Autonomous systems often have logical constraints arising, for example, from safety, operational, or regulatory requirements. Such constraints can be expressed using temporal logic specifications. The system state is often partially observable. Moreover, it could encompass a team of multiple agents with a common objective but disparate information structures and constraints. In this paper, we firs… ▽ More

    Submitted 19 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.09038

  2. arXiv:2301.11547  [pdf, other

    cs.LG cs.AI eess.SY

    Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation

    Authors: Krishna C Kalagarla, Rahul Jain, Pierluigi Nuzzo

    Abstract: Constrained Markov decision processes (CMDPs) model scenarios of sequential decision making with multiple objectives that are increasingly important in many applications. However, the model is often unknown and must be learned online while still ensuring the constraint is met, or at least the violation is bounded with time. Some recent papers have made progress on this very challenging problem but… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  3. arXiv:2203.09038  [pdf, other

    eess.SY

    Optimal Control of Partially Observable Markov Decision Processes with Finite Linear Temporal Logic Constraints

    Authors: Krishna C. Kalagarla, Dhruva Kartik, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Autonomous agents often operate in scenarios where the state is partially observed. In addition to maximizing their cumulative reward, agents must execute complex tasks with rich temporal and logical structures. These tasks can be expressed using temporal logic languages like finite linear temporal logic (LTL_f). This paper, for the first time, provides a structured framework for designing agent p… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  4. arXiv:2109.13377  [pdf, other

    eess.SY cs.AI cs.FL

    Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic Specifications

    Authors: Krishna C. Kalagarla, Rahul Jain, Pierluigi Nuzzo

    Abstract: We present a model-free reinforcement learning algorithm to find an optimal policy for a finite-horizon Markov decision process while guaranteeing a desired lower bound on the probability of satisfying a signal temporal logic (STL) specification. We propose a method to effectively augment the MDP state space to capture the required state history and express the STL objective as a reachability obje… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Accepted for CDC 2021

  5. arXiv:2011.00632  [pdf, other

    eess.SY cs.FL

    Synthesis of Discounted-Reward Optimal Policies for Markov Decision Processes Under Linear Temporal Logic Specifications

    Authors: Krishna C. Kalagarla, Rahul Jain, Pierluigi Nuzzo

    Abstract: We present a method to find an optimal policy with respect to a reward function for a discounted Markov decision process under general linear temporal logic (LTL) specifications. Previous work has either focused on maximizing a cumulative reward objective under finite-duration tasks, specified by syntactically co-safe LTL, or maximizing an average reward for persistent (e.g., surveillance) tasks.… ▽ More

    Submitted 22 March, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted for ACC 2021

  6. arXiv:2010.14785  [pdf, other

    cs.LG cs.AI

    Designing Interpretable Approximations to Deep Reinforcement Learning

    Authors: Nathan Dahlin, Krishna Chaitanya Kalagarla, Nikhil Naik, Rahul Jain, Pierluigi Nuzzo

    Abstract: In an ever expanding set of research and application areas, deep neural networks (DNNs) set the bar for algorithm performance. However, depending upon additional constraints such as processing power and execution time limits, or requirements such as verifiable safety guarantees, it may not be feasible to actually use such high-performing DNNs in practice. Many techniques have been developed in rec… ▽ More

    Submitted 19 June, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  7. arXiv:2009.11348  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints

    Authors: Krishna C. Kalagarla, Rahul Jain, Pierluigi Nuzzo

    Abstract: Constrained Markov Decision Processes (CMDPs) formalize sequential decision-making problems whose objective is to minimize a cost function while satisfying constraints on various cost functions. In this paper, we consider the setting of episodic fixed-horizon CMDPs. We propose an online algorithm which leverages the linear programming formulation of finite-horizon CMDP for repeated optimistic plan… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.