Skip to main content

Showing 1–50 of 62 results for author: Lindauer, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05088  [pdf, other

    cs.LG

    Optimizing Time Series Forecasting Architectures: A Hierarchical Neural Architecture Search Approach

    Authors: Difan Deng, Marius Lindauer

    Abstract: The rapid development of time series forecasting research has brought many deep learning-based modules in this field. However, despite the increasing amount of new forecasting architectures, it is still unclear if we have leveraged the full potential of these existing modules within a properly designed architecture. In this work, we propose a novel hierarchical neural architecture search approach… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.03348  [pdf, other

    cs.LG

    Position: A Call to Action for a Human-Centered AutoML Paradigm

    Authors: Marius Lindauer, Florian Karl, Anne Klier, Julia Moosbauer, Alexander Tornede, Andreas Mueller, Frank Hutter, Matthias Feurer, Bernd Bischl

    Abstract: Automated machine learning (AutoML) was formed around the fundamental objectives of automatically and efficiently configuring machine learning (ML) workflows, aiding the research of new ML algorithms, and contributing to the democratization of ML by making it accessible to a broader audience. Over the past decade, commendable achievements in AutoML have primarily focused on optimizing predictive p… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.07640  [pdf, other

    cs.LG cs.AI

    Hyperparameter Importance Analysis for Multi-Objective AutoML

    Authors: Daphne Theodorakopoulos, Frederic Stahl, Marius Lindauer

    Abstract: Hyperparameter optimization plays a pivotal role in enhancing the predictive performance and generalization capabilities of ML models. However, in many applications, we do not only care about predictive performance but also about objectives such as inference time, memory, or energy consumption. In such MOO scenarios, determining the importance of hyperparameters poses a significant challenge due t… ▽ More

    Submitted 15 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  4. arXiv:2404.01965  [pdf, other

    cs.LG cs.AI

    Towards Leveraging AutoML for Sustainable Deep Learning: A Multi-Objective HPO Approach on Deep Shift Neural Networks

    Authors: Leona Hennig, Tanja Tornede, Marius Lindauer

    Abstract: Deep Learning (DL) has advanced various fields by extracting complex patterns from large datasets. However, the computational demands of DL models pose environmental and resource challenges. Deep shift neural networks (DSNNs) offer a solution by leveraging shift operations to reduce computational complexity at inference. Following the insights from standard DNNs, we are interested in leveraging th… ▽ More

    Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2312.08528  [pdf, other

    cs.LG

    auto-sktime: Automated Time Series Forecasting

    Authors: Marc-André Zöller, Marius Lindauer, Marco F. Huber

    Abstract: In today's data-driven landscape, time series forecasting is pivotal in decision-making across various sectors. Yet, the proliferation of more diverse time series data, coupled with the expanding landscape of available forecasting methods, poses significant challenges for forecasters. To meet the growing demand for efficient forecasting, we introduce auto-sktime, a novel framework for automated ti… ▽ More

    Submitted 30 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at LION18

  6. arXiv:2309.03581  [pdf, other

    cs.LG cs.AI

    Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning

    Authors: Joseph Giovanelli, Alexander Tornede, Tanja Tornede, Marius Lindauer

    Abstract: Hyperparameter optimization (HPO) is important to leverage the full potential of machine learning (ML). In practice, users are often interested in multi-objective (MO) problems, i.e., optimizing potentially conflicting objectives, like accuracy and energy consumption. To tackle this, the vast majority of MO-ML algorithms return a Pareto front of non-dominated machine learning models to the user. O… ▽ More

    Submitted 11 January, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

  7. arXiv:2306.16913  [pdf, other

    cs.LG cs.AI cs.DB

    AutoML in Heavily Constrained Applications

    Authors: Felix Neutatz, Marius Lindauer, Ziawasch Abedjan

    Abstract: Optimizing a machine learning pipeline for a task at hand requires careful configuration of various hyperparameters, typically supported by an AutoML system that optimizes the hyperparameters for the given training dataset. Yet, depending on the AutoML system's own second-order meta-configuration, the performance of the AutoML process can vary significantly. Current AutoML systems cannot automatic… ▽ More

    Submitted 16 October, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  8. Structure in Deep Reinforcement Learning: A Survey and Open Problems

    Authors: Aditya Mohan, Amy Zhang, Marius Lindauer

    Abstract: Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural Networks (DNNs) for function approximation, has demonstrated considerable success in numerous applications. However, its practicality in addressing various real-world scenarios, characterized by diverse and unpredictable dynamics, noisy signals, and large state and action spaces, remains limited. This limitation s… ▽ More

    Submitted 25 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Published at the Journal of Artificial Intelligence Research, Volume 79, Pages 1167-1236

  9. arXiv:2306.12370  [pdf, other

    cs.LG

    PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

    Authors: Neeratyoy Mallik, Edward Bergman, Carl Hvarfner, Danny Stoll, Maciej Janowski, Marius Lindauer, Luigi Nardi, Frank Hutter

    Abstract: Hyperparameters of Deep Learning (DL) pipelines are crucial for their downstream performance. While a large number of methods for Hyperparameter Optimization (HPO) have been developed, their incurred costs are often untenable for modern DL. Consequently, manual experimentation is still the most prevalent approach to optimize hyperparameters, relying on the researcher's intuition, domain knowledge,… ▽ More

    Submitted 15 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  10. Automated Machine Learning for Remaining Useful Life Predictions

    Authors: Marc-André Zöller, Fabian Mauthe, Peter Zeiler, Marius Lindauer, Marco F. Huber

    Abstract: Being able to predict the remaining useful life (RUL) of an engineering system is an important task in prognostics and health management. Recently, data-driven approaches to RUL predictions are becoming prevalent over model-based approaches since no underlying physical knowledge of the engineering system is required. Yet, this just replaces required expertise of the underlying physics with machine… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Manuscript accepted at IEEE SMC 2023

  11. arXiv:2306.08107  [pdf, other

    cs.LG cs.CL

    AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks

    Authors: Alexander Tornede, Difan Deng, Theresa Eimer, Joseph Giovanelli, Aditya Mohan, Tim Ruhkopf, Sarah Segel, Daphne Theodorakopoulos, Tanja Tornede, Henning Wachsmuth, Marius Lindauer

    Abstract: The fields of both Natural Language Processing (NLP) and Automated Machine Learning (AutoML) have achieved remarkable results over the past years. In NLP, especially Large Language Models (LLMs) have experienced a rapid series of breakthroughs very recently. We envision that the two fields can radically push the boundaries of each other through tight integration. To showcase this vision, we explor… ▽ More

    Submitted 21 February, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Submitted and accepted at TMLR: https://openreview.net/forum?id=cAthubStyG

  12. arXiv:2306.04262  [pdf, other

    cs.LG

    Self-Adjusting Weighted Expected Improvement for Bayesian Optimization

    Authors: Carolin Benjamins, Elena Raponi, Anja Jankovic, Carola Doerr, Marius Lindauer

    Abstract: Bayesian Optimization (BO) is a class of surrogate-based, sample-efficient algorithms for optimizing black-box problems with small evaluation budgets. The BO pipeline itself is highly configurable with many different design choices regarding the initial design, surrogate model, and acquisition function (AF). Unfortunately, our understanding of how to select suitable components for a problem at han… ▽ More

    Submitted 30 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: AutoML Conference 2023

  13. arXiv:2306.01324  [pdf, other

    cs.LG

    Hyperparameters in Reinforcement Learning and How To Tune Them

    Authors: Theresa Eimer, Marius Lindauer, Roberta Raileanu

    Abstract: In order to improve reproducibility, deep reinforcement learning (RL) has been adopting better scientific practices such as standardized evaluation metrics and reporting. However, the process of hyperparameter optimization still varies widely across papers, which makes it challenging to compare RL algorithms fairly. In this paper, we show that hyperparameter choices in RL can significantly affect… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  14. arXiv:2305.10964  [pdf, other

    cs.LG cs.NE

    Learning Activation Functions for Sparse Neural Networks

    Authors: Mohammad Loni, Aditya Mohan, Mehdi Asadi, Marius Lindauer

    Abstract: Sparse Neural Networks (SNNs) can potentially demonstrate similar performance to their dense counterparts while saving significant energy and memory at inference. However, the accuracy drop incurred by SNNs, especially at high pruning ratios, can be an issue in critical deployment conditions. While recent works mitigate this issue through sophisticated pruning techniques, we shift our focus to an… ▽ More

    Submitted 5 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  15. arXiv:2304.02396  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    AutoRL Hyperparameter Landscapes

    Authors: Aditya Mohan, Carolin Benjamins, Konrad Wienecke, Alexander Dockhorn, Marius Lindauer

    Abstract: Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods… ▽ More

    Submitted 5 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Version updated after acceptance

  16. arXiv:2212.10876  [pdf, other

    cs.LG

    Hyperparameters in Contextual RL are Highly Situational

    Authors: Theresa Eimer, Carolin Benjamins, Marius Lindauer

    Abstract: Although Reinforcement Learning (RL) has shown impressive results in games and simulation, real-world application of RL suffers from its instability under changing environment conditions and hyperparameters. We give a first impression of the extent of this instability by showing that the hyperparameters found by automatic hyperparameter optimization (HPO) methods are not only dependent on the prob… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  17. arXiv:2211.09678  [pdf, other

    cs.LG

    Towards Automated Design of Bayesian Optimization via Exploratory Landscape Analysis

    Authors: Carolin Benjamins, Anja Jankovic, Elena Raponi, Koen van der Blom, Marius Lindauer, Carola Doerr

    Abstract: Bayesian optimization (BO) algorithms form a class of surrogate-based heuristics, aimed at efficiently computing high-quality solutions for numerical black-box optimization problems. The BO pipeline is highly modular, with different design choices for the initial sampling strategy, the surrogate model, the acquisition function (AF), the solver used to optimize the AF, etc. We demonstrate in this w… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 6th Workshop on Meta-Learning at NeurIPS 2022, New Orleans

  18. arXiv:2211.01455  [pdf, other

    cs.LG

    PI is back! Switching Acquisition Functions in Bayesian Optimization

    Authors: Carolin Benjamins, Elena Raponi, Anja Jankovic, Koen van der Blom, Maria Laura Santoni, Marius Lindauer, Carola Doerr

    Abstract: Bayesian Optimization (BO) is a powerful, sample-efficient technique to optimize expensive-to-evaluate functions. Each of the BO components, such as the surrogate model, the acquisition function (AF), or the initial design, is subject to a wide range of design choices. Selecting the right components for a given optimization task is a challenging task, which can have significant impact on the quali… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  19. arXiv:2206.05447  [pdf, other

    cs.LG stat.ML

    Improving Accuracy of Interpretability Measures in Hyperparameter Optimization via Bayesian Algorithm Execution

    Authors: Julia Moosbauer, Giuseppe Casalicchio, Marius Lindauer, Bernd Bischl

    Abstract: Despite all the benefits of automated hyperparameter optimization (HPO), most modern HPO algorithms are black-boxes themselves. This makes it difficult to understand the decision process which leads to the selected configuration, reduces trust in HPO, and thus hinders its broad adoption. Here, we study the combination of HPO with interpretable machine learning (IML) methods such as partial depende… ▽ More

    Submitted 12 February, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

  20. arXiv:2206.03493  [pdf, other

    cs.LG

    DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning

    Authors: René Sass, Eddie Bergman, André Biedenkapp, Frank Hutter, Marius Lindauer

    Abstract: Automated Machine Learning (AutoML) is used more than ever before to support users in determining efficient hyperparameters, neural architectures, or even full machine learning pipelines. However, users tend to mistrust the optimization process and its results due to a lack of transparency, making manual tuning still widespread. We introduce DeepCAVE, an interactive framework to analyze and monito… ▽ More

    Submitted 11 July, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Workshop on Adaptive Experimental Design and Active Learning in the Real World (ReALML@ICML'22)

  21. arXiv:2206.03130  [pdf, other

    cs.LG

    Towards Meta-learned Algorithm Selection using Implicit Fidelity Information

    Authors: Aditya Mohan, Tim Ruhkopf, Marius Lindauer

    Abstract: Automatically selecting the best performing algorithm for a given dataset or ranking multiple algorithms by their expected performance supports users in develo** new machine learning applications. Most approaches for this problem rely on pre-computed dataset meta-features and landmarking performances to capture the salient topology of the datasets and those topologies that the algorithms attend… ▽ More

    Submitted 13 July, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Camera-ready version

  22. arXiv:2205.13881  [pdf, other

    cs.AI cs.LG cs.NE

    Automated Dynamic Algorithm Configuration

    Authors: Steven Adriaensen, André Biedenkapp, Gresa Shala, Noor Awad, Theresa Eimer, Marius Lindauer, Frank Hutter

    Abstract: The performance of an algorithm often critically depends on its parameter configuration. While a variety of automated algorithm configuration methods have been proposed to relieve users from the tedious and error-prone task of manually tuning parameters, there is still a lot of untapped potential as the learned configuration is static, i.e., parameter settings remain fixed throughout the run. Howe… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  23. arXiv:2205.11357  [pdf, other

    cs.LG cs.RO

    POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning

    Authors: Frederik Schubert, Carolin Benjamins, Sebastian Döhler, Bodo Rosenhahn, Marius Lindauer

    Abstract: The goal of Unsupervised Reinforcement Learning (URL) is to find a reward-agnostic prior policy on a task domain, such that the sample-efficiency on supervised downstream tasks is improved. Although agents initialized with such a prior policy can achieve a significantly higher reward with fewer samples when finetuned on the downstream task, it is still an open question how an optimal pretrained pr… ▽ More

    Submitted 15 December, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Journal ref: Transactions on Machine Learning Research, 2023

  24. arXiv:2205.05511  [pdf, other

    cs.LG

    Efficient Automated Deep Learning for Time Series Forecasting

    Authors: Difan Deng, Florian Karl, Frank Hutter, Bernd Bischl, Marius Lindauer

    Abstract: Recent years have witnessed tremendously improved efficiency of Automated Machine Learning (AutoML), especially Automated Deep Learning (AutoDL) systems, but recent work focuses on tabular, image, or NLP tasks. So far, little attention has been paid to general AutoDL frameworks for time series forecasting, despite the enormous success in applying different novel architectures to such tasks. In thi… ▽ More

    Submitted 22 July, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  25. arXiv:2204.11051  [pdf, other

    cs.LG stat.ML

    $π$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization

    Authors: Carl Hvarfner, Danny Stoll, Artur Souza, Marius Lindauer, Frank Hutter, Luigi Nardi

    Abstract: Bayesian optimization (BO) has become an established framework and popular tool for hyperparameter optimization (HPO) of machine learning (ML) algorithms. While known for its sample-efficiency, vanilla BO can not utilize readily available prior beliefs the practitioner has on the potential location of the optimum. Thus, BO disregards a valuable source of information, reducing its appeal to ML prac… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 9 pages, 4 figures, Accepted as poster for ICLR 2022

  26. arXiv:2203.01717  [pdf, other

    cs.LG

    Practitioner Motives to Select Hyperparameter Optimization Methods

    Authors: Niklas Hasebrook, Felix Morsbach, Niclas Kannengießer, Marc Zöller, Jörg Franke, Marius Lindauer, Frank Hutter, Ali Sunyaev

    Abstract: Advanced programmatic hyperparameter optimization (HPO) methods, such as Bayesian optimization, have high sample efficiency in reproducibly finding optimal hyperparameter values of machine learning (ML) models. Yet, ML practitioners often apply less sample-efficient HPO methods, such as grid search, which often results in under-optimized ML models. As a reason for this behavior, we suspect practit… ▽ More

    Submitted 26 June, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: submitted to JMLR; currently under review

  27. arXiv:2202.04500  [pdf, other

    cs.LG

    Contextualize Me -- The Case for Context in Reinforcement Learning

    Authors: Carolin Benjamins, Theresa Eimer, Frederik Schubert, Aditya Mohan, Sebastian Döhler, André Biedenkapp, Bodo Rosenhahn, Frank Hutter, Marius Lindauer

    Abstract: While Reinforcement Learning ( RL) has made great strides towards solving increasingly complicated problems, many algorithms are still brittle to even slight environmental changes. Contextual Reinforcement Learning (cRL) provides a framework to model such changes in a principled manner, thereby enabling flexible, precise and interpretable task specification and generation. Our goal is to show how… ▽ More

    Submitted 2 June, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.02102

  28. Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

    Authors: Jack Parker-Holder, Raghu Rajan, Xingyou Song, André Biedenkapp, Yingjie Miao, Theresa Eimer, Baohe Zhang, Vu Nguyen, Roberto Calandra, Aleksandra Faust, Frank Hutter, Marius Lindauer

    Abstract: The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path towards generally capable agents. However, the success of RL agents is often highly sensitive to design choices in the training process, which may require tedious and error-prone manual tuning. This makes it challenging to use RL for new problems,… ▽ More

    Submitted 2 June, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Published in JAIR. Co-first authors and co-last authors are listed in alphabetical order

    MSC Class: 68T01 ACM Class: I.2.6

    Journal ref: Journal of Artificial Intelligence Research 74 (2022) 517-568

  29. arXiv:2201.03801  [pdf, other

    cs.LG cs.AI

    Winning solutions and post-challenge analyses of the ChaLearn AutoDL challenge 2019

    Authors: Zhengying Liu, Adrien Pavao, Zhen Xu, Sergio Escalera, Fabio Ferreira, Isabelle Guyon, Sirui Hong, Frank Hutter, Rongrong Ji, Julio C. S. Jacques Junior, Ge Li, Marius Lindauer, Zhipeng Luo, Meysam Madadi, Thomas Nierhoff, Kangning Niu, Chunguang Pan, Danny Stoll, Sebastien Treguer, ** Wang, Peng Wang, Chenglin Wu, Youcheng Xiong, Arbe r Zela, Yang Zhang

    Abstract: This paper reports the results and post-challenge analyses of ChaLearn's AutoDL challenge series, which helped sorting out a profusion of AutoML solutions for Deep Learning (DL) that had been introduced in a variety of settings, but lacked fair comparisons. All input data modalities (time series, images, videos, text, tabular) were formatted as tensors and all tasks were multi-label classification… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: The first three authors contributed equally; This is only a draft version

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 2021

  30. arXiv:2111.05834  [pdf, other

    cs.LG stat.ML

    Searching in the Forest for Local Bayesian Optimization

    Authors: Difan Deng, Marius Lindauer

    Abstract: Because of its sample efficiency, Bayesian optimization (BO) has become a popular approach dealing with expensive black-box optimization problems, such as hyperparameter optimization (HPO). Recent empirical experiments showed that the loss landscapes of HPO problems tend to be more benign than previously assumed, i.e. in the best case uni-modal and convex, such that a BO framework could be more ef… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  31. arXiv:2111.04820  [pdf, other

    cs.LG stat.ML

    Explaining Hyperparameter Optimization via Partial Dependence Plots

    Authors: Julia Moosbauer, Julia Herbinger, Giuseppe Casalicchio, Marius Lindauer, Bernd Bischl

    Abstract: Automated hyperparameter optimization (HPO) can support practitioners to obtain peak performance in machine learning models. However, there is often a lack of valuable insights into the effects of different hyperparameters on the final model performance. This lack of explainability makes it difficult to trust and understand the automated HPO process and its results. We suggest using interpretable… ▽ More

    Submitted 26 January, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: to be published in proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021); typos corrected, replaced N by N' in formula (6)

  32. arXiv:2110.02102  [pdf, other

    cs.LG

    CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning

    Authors: Carolin Benjamins, Theresa Eimer, Frederik Schubert, André Biedenkapp, Bodo Rosenhahn, Frank Hutter, Marius Lindauer

    Abstract: While Reinforcement Learning has made great strides towards solving ever more complicated tasks, many algorithms are still brittle to even slight changes in their environment. This is a limiting factor for real-world applications of RL. Although the research community continuously aims at improving both robustness and generalization of RL algorithms, unfortunately it still lacks an open-source set… ▽ More

    Submitted 11 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Workshop on Ecological Theory of Reinforcement Learning, NeurIPS 2021

  33. arXiv:2109.09831  [pdf, other

    cs.LG stat.ML

    SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

    Authors: Marius Lindauer, Katharina Eggensperger, Matthias Feurer, André Biedenkapp, Difan Deng, Carolin Benjamins, Tim Ruhopf, René Sass, Frank Hutter

    Abstract: Algorithm parameters, in particular hyperparameters of machine learning algorithms, can substantially impact their performance. To support users in determining well-performing hyperparameter configurations for their algorithms, datasets and applications at hand, SMAC3 offers a robust and flexible framework for Bayesian Optimization, which can improve performance within a few evaluations. It offers… ▽ More

    Submitted 8 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Journal ref: Journal of Machine Learning Research 23 (2022) 1-9

  34. arXiv:2109.06716  [pdf, other

    cs.LG

    HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems for HPO

    Authors: Katharina Eggensperger, Philipp Müller, Neeratyoy Mallik, Matthias Feurer, René Sass, Aaron Klein, Noor Awad, Marius Lindauer, Frank Hutter

    Abstract: To achieve peak predictive performance, hyperparameter optimization (HPO) is a crucial component of machine learning and its applications. Over the last years, the number of efficient algorithms and tools for HPO grew substantially. At the same time, the community is still lacking realistic, diverse, computationally cheap, and standardized benchmarks. This is especially the case for multi-fidelity… ▽ More

    Submitted 6 October, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Published at NeurIPS Datasets and Benchmarks Track 2021. Updated version

  35. arXiv:2107.14330  [pdf, ps, other

    cs.CY cs.LG

    Develo** Open Source Educational Resources for Machine Learning and Data Science

    Authors: Ludwig Bothmann, Sven Strickroth, Giuseppe Casalicchio, David Rügamer, Marius Lindauer, Fabian Scheipl, Bernd Bischl

    Abstract: Education should not be a privilege but a common good. It should be openly accessible to everyone, with as few barriers as possible; even more so for key technologies such as Machine Learning (ML) and Data Science (DS). Open Educational Resources (OER) are a crucial factor for greater educational equity. In this paper, we describe the specific requirements for OER in ML and DS and argue that it is… ▽ More

    Submitted 10 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 6 pages

    Journal ref: Proceedings of the Third Teaching Machine Learning and Artificial Intelligence Workshop, PMLR 207:1-6, 2022

  36. arXiv:2107.05847  [pdf, other

    stat.ML cs.LG

    Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

    Authors: Bernd Bischl, Martin Binder, Michel Lang, Tobias Pielok, Jakob Richter, Stefan Coors, Janek Thomas, Theresa Ullmann, Marc Becker, Anne-Laure Boulesteix, Difan Deng, Marius Lindauer

    Abstract: Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find well-performing hyperparameter configurations, various automatic hyperparameter optimization (HPO) methods, e.g., based on resampling error estimation for superv… ▽ More

    Submitted 24 November, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  37. arXiv:2106.11189  [pdf, other

    cs.LG

    Well-tuned Simple Nets Excel on Tabular Datasets

    Authors: Arlind Kadra, Marius Lindauer, Frank Hutter, Josif Grabocka

    Abstract: Tabular datasets are the last "unconquered castle" for deep learning, with traditional ML methods like Gradient-Boosted Decision Trees still performing strongly even against recent specialized neural architectures. In this paper, we hypothesize that the key to boosting the performance of neural networks lies in rethinking the joint and simultaneous application of a large set of modern regularizati… ▽ More

    Submitted 5 November, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

  38. arXiv:2106.06317  [pdf, other

    cs.LG

    Automatic Risk Adaptation in Distributional Reinforcement Learning

    Authors: Frederik Schubert, Theresa Eimer, Bodo Rosenhahn, Marius Lindauer

    Abstract: The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the es… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Journal ref: Reinforcement Learning for Real Life Workshop, ICML 2021

  39. arXiv:2106.05262  [pdf, other

    cs.LG

    TempoRL: Learning When to Act

    Authors: André Biedenkapp, Raghu Rajan, Frank Hutter, Marius Lindauer

    Abstract: Reinforcement learning is a powerful approach to learn behaviour through interactions with an environment. However, behaviours are usually learned in a purely reactive fashion, where an appropriate action is selected based on an observation. In this form, it is challenging to learn when it is necessary to execute new decisions. This makes learning inefficient, especially in environments that need… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted at ICML'21

  40. arXiv:2106.05110  [pdf, other

    cs.LG

    Self-Paced Context Evaluation for Contextual Reinforcement Learning

    Authors: Theresa Eimer, André Biedenkapp, Frank Hutter, Marius Lindauer

    Abstract: Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task cur… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of Machine Learning Research 139 (ICML 2021)

  41. arXiv:2105.08541  [pdf, other

    cs.AI

    DACBench: A Benchmark Library for Dynamic Algorithm Configuration

    Authors: Theresa Eimer, André Biedenkapp, Maximilian Reimer, Steven Adriaensen, Frank Hutter, Marius Lindauer

    Abstract: Dynamic Algorithm Configuration (DAC) aims to dynamically control a target algorithm's hyperparameters in order to improve its performance. Several theoretical and empirical results have demonstrated the benefits of dynamically controlling hyperparameters in domains like evolutionary computation, AI Planning or deep learning. Replicating these results, as well as studying new methods for DAC, howe… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: Accepted at IJCAI 2021

    Journal ref: 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)

  42. arXiv:2105.01015  [pdf, other

    cs.LG cs.AI stat.ML

    Bag of Baselines for Multi-objective Joint Neural Architecture Search and Hyperparameter Optimization

    Authors: Julia Guerrero-Viu, Sven Hauns, Sergio Izquierdo, Guilherme Miotto, Simon Schrodi, Andre Biedenkapp, Thomas Elsken, Difan Deng, Marius Lindauer, Frank Hutter

    Abstract: Neural architecture search (NAS) and hyperparameter optimization (HPO) make deep learning accessible to non-experts by automatically finding the architecture of the deep neural network to use and tuning the hyperparameters of the used training pipeline. While both NAS and HPO have been studied extensively in recent years, NAS methods typically assume fixed hyperparameters and vice versa - there ex… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  43. arXiv:2012.08180  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Squirrel: A Switching Hyperparameter Optimizer

    Authors: Noor Awad, Gresa Shala, Difan Deng, Neeratyoy Mallik, Matthias Feurer, Katharina Eggensperger, Andre' Biedenkapp, Diederick Vermetten, Hao Wang, Carola Doerr, Marius Lindauer, Frank Hutter

    Abstract: In this short note, we describe our submission to the NeurIPS 2020 BBO challenge. Motivated by the fact that different optimizers work well on different problems, our approach switches between different optimizers. Since the team names on the competition's leaderboard were randomly generated "alliteration nicknames", consisting of an adjective and an animal with the same initial letter, we called… ▽ More

    Submitted 16 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  44. arXiv:2009.13828  [pdf, other

    cs.AI cs.LG

    Neural Model-based Optimization with Right-Censored Observations

    Authors: Katharina Eggensperger, Kai Haase, Philipp Müller, Marius Lindauer, Frank Hutter

    Abstract: In many fields of study, we only observe lower bounds on the true response value of some experiments. When fitting a regression model to predict the distribution of the outcomes, we cannot simply drop these right-censored observations, but need to properly model them. In this work, we focus on the concept of censored data in the light of model-based optimization where prematurely terminating evalu… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  45. arXiv:2007.04074  [pdf, other

    cs.LG stat.ML

    Auto-Sklearn 2.0: Hands-free AutoML via Meta-Learning

    Authors: Matthias Feurer, Katharina Eggensperger, Stefan Falkner, Marius Lindauer, Frank Hutter

    Abstract: Automated Machine Learning (AutoML) supports practitioners and researchers with the tedious task of designing machine learning pipelines and has recently achieved substantial success. In this paper, we introduce new AutoML approaches motivated by our winning submission to the second ChaLearn AutoML challenge. We develop PoSH Auto-sklearn, which enables AutoML systems to work well on large datasets… ▽ More

    Submitted 4 October, 2022; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Final version as published at JMLR 23(261)

    Journal ref: Journal of Machine Learning Research 23(261), 2022

  46. arXiv:2006.14608  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with a Prior for the Optimum

    Authors: Artur Souza, Luigi Nardi, Leonardo B. Oliveira, Kunle Olukotun, Marius Lindauer, Frank Hutter

    Abstract: While Bayesian Optimization (BO) is a very popular method for optimizing expensive black-box functions, it fails to leverage the experience of domain experts. This causes BO to waste function evaluations on bad design choices (e.g., machine learning hyperparameters) that the expert already knows to work poorly. To address this issue, we introduce Bayesian Optimization with a Prior for the Optimum… ▽ More

    Submitted 19 April, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

  47. arXiv:2006.13799  [pdf, other

    cs.LG cs.AI stat.ML

    Auto-PyTorch Tabular: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL

    Authors: Lucas Zimmer, Marius Lindauer, Frank Hutter

    Abstract: While early AutoML frameworks focused on optimizing traditional ML pipelines and their hyperparameters, a recent trend in AutoML is to focus on neural architecture search. In this paper, we introduce Auto-PyTorch, which brings the best of these two worlds together by jointly and robustly optimizing the architecture of networks and the training hyperparameters to enable fully automated deep learnin… ▽ More

    Submitted 26 April, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  48. arXiv:2006.08246  [pdf, other

    cs.AI cs.LG

    Learning Heuristic Selection with Dynamic Algorithm Configuration

    Authors: David Speck, André Biedenkapp, Frank Hutter, Robert Mattmüller, Marius Lindauer

    Abstract: A key challenge in satisficing planning is to use multiple heuristics within one heuristic search. An aggregation of multiple heuristic estimates, for example by taking the maximum, has the disadvantage that bad estimates of a single heuristic can negatively affect the whole search. Since the performance of a heuristic varies from instance to instance, approaches such as algorithm selection can be… ▽ More

    Submitted 12 April, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Long version of the paper at the International Conference on Automated Planning and Scheduling (ICAPS) 2021

  49. arXiv:1909.02453  [pdf, other

    cs.LG stat.ML

    Best Practices for Scientific Research on Neural Architecture Search

    Authors: Marius Lindauer, Frank Hutter

    Abstract: Finding a well-performing architecture is often tedious for both DL practitioners and researchers, leading to tremendous interest in the automation of this task by means of neural architecture search (NAS). Although the community has made major strides in develo** better NAS methods, the quality of scientific empirical evaluations in the young field of NAS is still lacking behind that of other a… ▽ More

    Submitted 3 November, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

  50. arXiv:1908.06756  [pdf, other

    cs.LG cs.AI stat.ML

    BOAH: A Tool Suite for Multi-Fidelity Bayesian Optimization & Analysis of Hyperparameters

    Authors: Marius Lindauer, Katharina Eggensperger, Matthias Feurer, André Biedenkapp, Joshua Marben, Philipp Müller, Frank Hutter

    Abstract: Hyperparameter optimization and neural architecture search can become prohibitively expensive for regular black-box Bayesian optimization because the training and evaluation of a single model can easily take several hours. To overcome this, we introduce a comprehensive tool suite for effective multi-fidelity Bayesian optimization and the analysis of its runs. The suite, written in Python, provides… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.