Skip to main content

Showing 1–5 of 5 results for author: Palombarini, J A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.08094  [pdf, other

    cs.LG

    Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning

    Authors: Juan Cruz Barsce, Jorge A. Palombarini, Ernesto C. Martínez

    Abstract: Optimal setting of several hyper-parameters in machine learning algorithms is key to make the most of available data. To this aim, several methods such as evolutionary strategies, random search, Bayesian optimization and heuristic rules of thumb have been proposed. In reinforcement learning (RL), the information content of data gathered by the learning agent while interacting with its environment… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Under review at Computational Intelligence

  2. arXiv:1909.08332  [pdf, other

    cs.LG cs.AI stat.ML

    A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning

    Authors: Juan Cruz Barsce, Jorge A. Palombarini, Ernesto Martínez

    Abstract: Optimization of hyper-parameters in reinforcement learning (RL) algorithms is a key task, because they determine how the agent will learn its policy by interacting with its environment, and thus what data is gathered. In this work, an approach that uses Bayesian optimization to perform a two-step optimization is proposed: first, categorical RL structure hyper-parameters are taken as binary variabl… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: Short paper presented in the Jornadas Argentinas de Informática (JAIIO) 2019 (Salta, Argentina), describing an ongoing research on RL hyper-parameter tuning

  3. arXiv:1805.04752  [pdf

    cs.AI

    Generating Rescheduling Knowledge using Reinforcement Learning in a Cognitive Architecture

    Authors: Jorge A. Palombarini, Juan Cruz Barsce, Ernesto C. Martínez

    Abstract: In order to reach higher degrees of flexibility, adaptability and autonomy in manufacturing systems, it is essential to develop new rescheduling methodologies which resort to cognitive capabilities, similar to those found in human beings. Artificial cognition is important for designing planning and control systems that generate and represent knowledge about heuristics for repair-based scheduling.… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Comments: Conference paper presented in the Jornadas Argentinas de Informática (JAIIO) 2014. arXiv admin note: text overlap with arXiv:1805.04749

  4. arXiv:1805.04749  [pdf

    cs.AI

    A Cognitive Approach to Real-time Rescheduling using SOAR-RL

    Authors: Juan Cruz Barsce, Jorge A. Palombarini, Ernesto C. Martínez

    Abstract: Ensuring flexible and efficient manufacturing of customized products in an increasing dynamic and turbulent environment without sacrificing cost effectiveness, product quality and on-time delivery has become a key issue for most industrial enterprises. A promising approach to cope with this challenge is the integration of cognitive capabilities in systems and processes with the aim of expanding th… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Comments: Conference paper presented in the Argentinian Congress of Computer Science 2013

  5. arXiv:1805.04748  [pdf, other

    cs.AI cs.LG

    Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

    Authors: Juan Cruz Barsce, Jorge A. Palombarini, Ernesto C. Martínez

    Abstract: With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key factor for achieving satisfactory performance regardless of user expertise in the inner workings of the techniques and methodologies. In particular, for a reinfor… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Comments: Paper submitted to CLEI Electronic Journal. This is an extended version of the conference paper presented at Latin American Computer Conference (CLEI), 2017