Skip to main content

Showing 1–4 of 4 results for author: H, J A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.07123  [pdf, other

    cs.LG cs.AI eess.SY

    Reinforcement Learning in System Identification

    Authors: Jose Antonio Martin H., Oscar Fernandez Vicente, Sergio Perez, Anas Belfadil, Cristina Ibanez-Llano, Freddy Jose Perozo Rondon, Jose Javier Valle, Javier Arechalde Pelaz

    Abstract: System identification, also known as learning forward models, transfer functions, system dynamics, etc., has a long tradition both in science and engineering in different fields. Particularly, it is a recurring theme in Reinforcement Learning research, where forward models approximate the state transition function of a Markov Decision Process by learning a map** function from current state and a… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted in Neurips Deep Reinforcement Learning Workshop 2022: https://openreview.net/forum?id=fGcbpWQIJZV

  2. arXiv:1104.0510  [pdf, other

    math.CO cs.CC cs.DM

    Minimal non-extensible precolorings and implicit-relations

    Authors: José Antonio Martín H

    Abstract: In this paper I study a variant of the general vertex coloring problem called precoloring. Specifically, I study graph precolorings, by develo** new theory, for characterizing the minimal non-extensible precolorings. It is interesting per se that, for graphs of arbitrarily large chromatic number, the minimal number of colored vertices, in a non-extensible precoloring, remains constant; only two… ▽ More

    Submitted 4 April, 2011; originally announced April 2011.

    MSC Class: Primary 05C15; 05C75; Secondary 05C90; 05C69

  3. arXiv:1101.6038  [pdf, other

    cs.DM cs.CC cs.DS math.CO

    A polynomial 3-colorability algorithm with automatic generation of NO 3-colorability (i.e. Co-NP) short proofs

    Authors: Jose Antonio Martin H

    Abstract: In this paper, an algorithm for determining 3-colorability, i.e. the decision problem (YES/NO), in planar graphs is presented. The algorithm, although not exact (it could produce false positives) has two very important features: (i) it has polynomial complexity and (ii) for every "NO" answer, a "short" proof is generated, which is of much interest since 3-colorability is a NP-complete problem and… ▽ More

    Submitted 31 January, 2011; originally announced January 2011.

  4. arXiv:1101.4003  [pdf, other

    cs.AI cs.LG eess.SY math.OC

    Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems

    Authors: Matilde Santos, Jose Antonio Martin H., Victoria Lopez, Guillermo Botella

    Abstract: In a Role-Playing Game, finding optimal trajectories is one of the most important tasks. In fact, the strategy decision system becomes a key component of a game engine. Determining the way in which decisions are taken (online, batch or simulated) and the consumed resources in decision making (e.g. execution time, memory) will influence, in mayor degree, the game performance. When classical search… ▽ More

    Submitted 30 July, 2011; v1 submitted 20 January, 2011; originally announced January 2011.

    MSC Class: 68T05 ACM Class: I.2