Skip to main content

Showing 1–10 of 10 results for author: Eimer, T

.
  1. arXiv:2306.08107  [pdf, other

    cs.LG cs.CL

    AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks

    Authors: Alexander Tornede, Difan Deng, Theresa Eimer, Joseph Giovanelli, Aditya Mohan, Tim Ruhkopf, Sarah Segel, Daphne Theodorakopoulos, Tanja Tornede, Henning Wachsmuth, Marius Lindauer

    Abstract: The fields of both Natural Language Processing (NLP) and Automated Machine Learning (AutoML) have achieved remarkable results over the past years. In NLP, especially Large Language Models (LLMs) have experienced a rapid series of breakthroughs very recently. We envision that the two fields can radically push the boundaries of each other through tight integration. To showcase this vision, we explor… ▽ More

    Submitted 21 February, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Submitted and accepted at TMLR: https://openreview.net/forum?id=cAthubStyG

  2. arXiv:2306.01324  [pdf, other

    cs.LG

    Hyperparameters in Reinforcement Learning and How To Tune Them

    Authors: Theresa Eimer, Marius Lindauer, Roberta Raileanu

    Abstract: In order to improve reproducibility, deep reinforcement learning (RL) has been adopting better scientific practices such as standardized evaluation metrics and reporting. However, the process of hyperparameter optimization still varies widely across papers, which makes it challenging to compare RL algorithms fairly. In this paper, we show that hyperparameter choices in RL can significantly affect… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  3. arXiv:2212.10876  [pdf, other

    cs.LG

    Hyperparameters in Contextual RL are Highly Situational

    Authors: Theresa Eimer, Carolin Benjamins, Marius Lindauer

    Abstract: Although Reinforcement Learning (RL) has shown impressive results in games and simulation, real-world application of RL suffers from its instability under changing environment conditions and hyperparameters. We give a first impression of the extent of this instability by showing that the hyperparameters found by automatic hyperparameter optimization (HPO) methods are not only dependent on the prob… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  4. arXiv:2205.13881  [pdf, other

    cs.AI cs.LG cs.NE

    Automated Dynamic Algorithm Configuration

    Authors: Steven Adriaensen, André Biedenkapp, Gresa Shala, Noor Awad, Theresa Eimer, Marius Lindauer, Frank Hutter

    Abstract: The performance of an algorithm often critically depends on its parameter configuration. While a variety of automated algorithm configuration methods have been proposed to relieve users from the tedious and error-prone task of manually tuning parameters, there is still a lot of untapped potential as the learned configuration is static, i.e., parameter settings remain fixed throughout the run. Howe… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  5. arXiv:2202.04500  [pdf, other

    cs.LG

    Contextualize Me -- The Case for Context in Reinforcement Learning

    Authors: Carolin Benjamins, Theresa Eimer, Frederik Schubert, Aditya Mohan, Sebastian Döhler, André Biedenkapp, Bodo Rosenhahn, Frank Hutter, Marius Lindauer

    Abstract: While Reinforcement Learning ( RL) has made great strides towards solving increasingly complicated problems, many algorithms are still brittle to even slight environmental changes. Contextual Reinforcement Learning (cRL) provides a framework to model such changes in a principled manner, thereby enabling flexible, precise and interpretable task specification and generation. Our goal is to show how… ▽ More

    Submitted 2 June, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.02102

  6. Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

    Authors: Jack Parker-Holder, Raghu Rajan, Xingyou Song, André Biedenkapp, Yingjie Miao, Theresa Eimer, Baohe Zhang, Vu Nguyen, Roberto Calandra, Aleksandra Faust, Frank Hutter, Marius Lindauer

    Abstract: The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path towards generally capable agents. However, the success of RL agents is often highly sensitive to design choices in the training process, which may require tedious and error-prone manual tuning. This makes it challenging to use RL for new problems,… ▽ More

    Submitted 2 June, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Published in JAIR. Co-first authors and co-last authors are listed in alphabetical order

    MSC Class: 68T01 ACM Class: I.2.6

    Journal ref: Journal of Artificial Intelligence Research 74 (2022) 517-568

  7. arXiv:2110.02102  [pdf, other

    cs.LG

    CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning

    Authors: Carolin Benjamins, Theresa Eimer, Frederik Schubert, André Biedenkapp, Bodo Rosenhahn, Frank Hutter, Marius Lindauer

    Abstract: While Reinforcement Learning has made great strides towards solving ever more complicated tasks, many algorithms are still brittle to even slight changes in their environment. This is a limiting factor for real-world applications of RL. Although the research community continuously aims at improving both robustness and generalization of RL algorithms, unfortunately it still lacks an open-source set… ▽ More

    Submitted 11 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Workshop on Ecological Theory of Reinforcement Learning, NeurIPS 2021

  8. arXiv:2106.06317  [pdf, other

    cs.LG

    Automatic Risk Adaptation in Distributional Reinforcement Learning

    Authors: Frederik Schubert, Theresa Eimer, Bodo Rosenhahn, Marius Lindauer

    Abstract: The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the es… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Journal ref: Reinforcement Learning for Real Life Workshop, ICML 2021

  9. arXiv:2106.05110  [pdf, other

    cs.LG

    Self-Paced Context Evaluation for Contextual Reinforcement Learning

    Authors: Theresa Eimer, André Biedenkapp, Frank Hutter, Marius Lindauer

    Abstract: Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task cur… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of Machine Learning Research 139 (ICML 2021)

  10. arXiv:2105.08541  [pdf, other

    cs.AI

    DACBench: A Benchmark Library for Dynamic Algorithm Configuration

    Authors: Theresa Eimer, André Biedenkapp, Maximilian Reimer, Steven Adriaensen, Frank Hutter, Marius Lindauer

    Abstract: Dynamic Algorithm Configuration (DAC) aims to dynamically control a target algorithm's hyperparameters in order to improve its performance. Several theoretical and empirical results have demonstrated the benefits of dynamically controlling hyperparameters in domains like evolutionary computation, AI Planning or deep learning. Replicating these results, as well as studying new methods for DAC, howe… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: Accepted at IJCAI 2021

    Journal ref: 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)