Skip to main content

Showing 1–2 of 2 results for author: Botto, M A

.
  1. arXiv:2011.02141  [pdf, other

    cs.LG

    Control with adaptive Q-learning

    Authors: João Pedro Araújo, Mário A. T. Figueiredo, Miguel Ayala Botto

    Abstract: This paper evaluates adaptive Q-learning (AQL) and single-partition adaptive Q-learning (SPAQL), two algorithms for efficient model-free episodic reinforcement learning (RL), in two classical control problems (Pendulum and Cartpole). AQL adaptively partitions the state-action space of a Markov decision process (MDP), while learning the control policy, i. e., the map** from states to actions. The… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 29 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2007.06741

  2. arXiv:2007.06741  [pdf, other

    cs.LG stat.ML

    Single-partition adaptive Q-learning

    Authors: João Pedro Araújo, Mário Figueiredo, Miguel Ayala Botto

    Abstract: This paper introduces single-partition adaptive Q-learning (SPAQL), an algorithm for model-free episodic reinforcement learning (RL), which adaptively partitions the state-action space of a Markov decision process (MDP), while simultaneously learning a time-invariant policy (i. e., the map** from states to actions does not depend explicitly on the episode time step) for maximizing the cumulative… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: 34 pages, 15 figures

    MSC Class: 68T05 ACM Class: I.2.6