Skip to main content

Showing 1–8 of 8 results for author: Kara, A D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.00123  [pdf, other

    math.OC cs.AI eess.SY

    Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: As a primary contribution, we present a convergence theorem for stochastic iterations, and in particular, Q-learning iterates, under a general, possibly non-Markovian, stochastic environment. Our conditions for convergence involve an ergodicity and a positivity criterion. We provide a precise characterization on the limit of the iterates and conditions on the environment and initializations for co… ▽ More

    Submitted 4 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 2 figures

  2. arXiv:2309.11744  [pdf, ps, other

    math.OC eess.SY

    Infinite Horizon Average Cost Optimality Criteria for Mean-Field Control

    Authors: Erhan Bayraktar, Ali D. Kara

    Abstract: We study mean-field control problems in discrete-time under the infinite horizon average cost optimality criteria. We focus on both the finite population and the infinite population setups. We show the existence of a solution to the average cost optimality equation (ACOE) and the existence of optimal stationary Markov policies for finite population problems under (i) a minorization condition that… ▽ More

    Submitted 17 April, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  3. arXiv:2308.07591  [pdf, other

    math.OC eess.SY

    Q-Learning for Continuous State and Action MDPs under Average Cost Criteria

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: For infinite-horizon average-cost criterion problems, there exist relatively few rigorous approximation and reinforcement learning results. In this paper, for such problems, we present several approximation and reinforcement learning results for Markov Decision Processes with standard Borel spaces. Toward this end, (i) we first provide a discretization based approximation method for fully observed… ▽ More

    Submitted 19 March, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 3 figures

  4. arXiv:2111.06781  [pdf, ps, other

    cs.LG eess.SY

    Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity

    Authors: Ali Devran Kara, Naci Saldi, Serdar Yüksel

    Abstract: Reinforcement learning algorithms often require finiteness of state and action spaces in Markov decision processes (MDPs) (also called controlled Markov chains) and various efforts have been made in the literature towards the applicability of such algorithms for continuous state and action spaces. In this paper, we show that under very mild regularity conditions (in particular, involving only weak… ▽ More

    Submitted 7 September, 2023; v1 submitted 12 November, 2021; originally announced November 2021.

  5. arXiv:2103.12158  [pdf, other

    cs.LG eess.SY

    Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability

    Authors: Ali Devran Kara, Serdar Yuksel

    Abstract: In this paper, for POMDPs, we provide the convergence of a Q learning algorithm for control policies using a finite history of past observations and control actions, and, consequentially, we establish near optimality of such limit Q functions under explicit filter stability conditions. We present explicit error bounds relating the approximation error to the length of the finite history window. We… ▽ More

    Submitted 25 October, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

  6. arXiv:2003.05769  [pdf, ps, other

    eess.SY

    Robustness to Incorrect Models and Data-Driven Learning in Average-Cost Optimal Stochastic Control

    Authors: Ali Devran Kara, Maxim Raginsky, Serdar Yuksel

    Abstract: We study continuity and robustness properties of infinite-horizon average expected cost problems with respect to (controlled) transition kernels, and applications of these results to the problem of robustness of control policies designed for approximate models applied to actual systems. We show that sufficient conditions presented in the literature for discounted-cost problems are in general not s… ▽ More

    Submitted 20 December, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Presented at Conference on Decision and Control 2019. arXiv admin note: text overlap with arXiv:1803.06046

  7. arXiv:1803.06046  [pdf, ps, other

    eess.SY

    Robustness to incorrect system models in stochastic control

    Authors: Ali Devran Kara, Serdar Yüksel

    Abstract: In stochastic control applications, typically only an ideal model (controlled transition kernel) is assumed and the control design is based on the given model, raising the problem of performance loss due to the mismatch between the assumed model and the actual model. Toward this end, we study continuity properties of discrete-time stochastic control problems with respect to system models (i.e., co… ▽ More

    Submitted 1 February, 2020; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Conference version to appear at the 2018 IEEE CDC with title "Robustness to Incorrect System Models in Stochastic Control and Application to Data-Driven Learning". The paper is to appear in SIAM J. on Control and Optimization

  8. arXiv:1803.05103  [pdf, ps, other

    eess.SY math.OC

    Robustness to incorrect priors in partially observed stochastic control

    Authors: Ali Devran Kara, Serdar Yüksel

    Abstract: We study the continuity properties of optimal solutions to stochastic control problems with respect to initial probability measures and applications of these to the robustness of optimal control policies applied to systems with incomplete or incorrect priors. It is shown that for single and multi-stage optimal cost problems, continuity and robustness cannot be established under weak convergence or… ▽ More

    Submitted 13 April, 2019; v1 submitted 13 March, 2018; originally announced March 2018.