Skip to main content

Showing 1–4 of 4 results for author: Sterzing, V

Searching in archive cs. Search in all archives.
.
  1. Learning Control Policies for Variable Objectives from Offline Data

    Authors: Marc Weber, Phillip Swazinna, Daniel Hein, Steffen Udluft, Volkmar Sterzing

    Abstract: Offline reinforcement learning provides a viable approach to obtain advanced control strategies for dynamical systems, in particular when direct interaction with the environment is not available. In this paper, we introduce a conceptual extension for model-based policy search methods, called variable objective policy (VOP). With this approach, policies are trained to generalize efficiently over a… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 7 figures

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence

  2. arXiv:1709.09480  [pdf, other

    cs.AI cs.LG eess.SY

    A Benchmark Environment Motivated by Industrial Control Problems

    Authors: Daniel Hein, Stefan Depeweg, Michel Tokic, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing

    Abstract: In the research area of reinforcement learning (RL), frequently novel and promising methods are developed and introduced to the RL community. However, although many researchers are keen to apply their methods on real-world problems, implementing such methods in real industry environments often is a frustrating and tedious process. Generally, academic research groups have only limited access to rea… ▽ More

    Submitted 24 November, 2022; v1 submitted 27 September, 2017; originally announced September 2017.

    Journal ref: 2017 IEEE Symposium Series on Computational Intelligence (SSCI)

  3. arXiv:1705.07262  [pdf, ps, other

    cs.LG cs.AI cs.NE eess.SY

    Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

    Authors: Daniel Hein, Steffen Udluft, Michel Tokic, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing

    Abstract: The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement lea… ▽ More

    Submitted 27 July, 2017; v1 submitted 20 May, 2017; originally announced May 2017.

    Journal ref: 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp. 4214-4221

  4. arXiv:1610.03793  [pdf, ps, other

    cs.LG

    Introduction to the "Industrial Benchmark"

    Authors: Daniel Hein, Alexander Hentschel, Volkmar Sterzing, Michel Tokic, Steffen Udluft

    Abstract: A novel reinforcement learning benchmark, called Industrial Benchmark, is introduced. The Industrial Benchmark aims at being be realistic in the sense, that it includes a variety of aspects that we found to be vital in industrial applications. It is not designed to be an approximation of any real system, but to pose the same hardness and complexity.

    Submitted 28 September, 2017; v1 submitted 12 October, 2016; originally announced October 2016.

    Comments: 11 pages