Skip to main content

Showing 1–12 of 12 results for author: Falkner, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.00742  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Meta-Learning with Gaussian Processes

    Authors: Petru Tighineanu, Lukas Grossberger, Paul Baireuther, Kathrin Skubch, Stefan Falkner, Julia Vinogradska, Felix Berkenkamp

    Abstract: Meta-learning is a powerful approach that exploits historical data to quickly solve new tasks from the same distribution. In the low-data regime, methods based on the closed-form posterior of Gaussian processes (GP) together with Bayesian optimization have achieved high performance. However, these methods are either computationally expensive or introduce assumptions that hinder a principled propag… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  2. arXiv:2307.03565  [pdf, other

    cs.LG stat.ML

    MALIBO: Meta-learning for Likelihood-free Bayesian Optimization

    Authors: Jiarong Pan, Stefan Falkner, Felix Berkenkamp, Joaquin Vanschoren

    Abstract: Bayesian optimization (BO) is a popular method to optimize costly black-box functions. While traditional BO optimizes each new target task from scratch, meta-learning has emerged as a way to leverage knowledge from related tasks to optimize new tasks faster. However, existing meta-learning BO methods rely on surrogate models that suffer from scalability issues and are sensitive to observations wit… ▽ More

    Submitted 28 June, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  3. arXiv:2302.08757  [pdf, other

    physics.comp-ph cond-mat.stat-mech cs.LG

    Enhanced Sampling of Configuration and Path Space in a Generalized Ensemble by Shooting Point Exchange

    Authors: Sebastian Falkner, Alessandro Coretti, Christoph Dellago

    Abstract: The computer simulation of many molecular processes is complicated by long time scales caused by rare transitions between long-lived states. Here, we propose a new approach to simulate such rare events, which combines transition path sampling with enhanced exploration of configuration space. The method relies on exchange moves between configuration and trajectory space, carried out based on a gene… ▽ More

    Submitted 22 March, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Added Supplementary Information for simulation details and network parameters

  4. arXiv:2207.14530  [pdf, other

    physics.comp-ph cond-mat.stat-mech cs.LG

    Conditioning Normalizing Flows for Rare Event Sampling

    Authors: Sebastian Falkner, Alessandro Coretti, Salvatore Romano, Phillip Geissler, Christoph Dellago

    Abstract: Understanding the dynamics of complex molecular processes is often linked to the study of infrequent transitions between long-lived stable states. The standard approach to the sampling of such rare events is to generate an ensemble of transition paths using a random walk in trajectory space. This, however, comes with the drawback of strong correlations between subsequently sampled paths and with a… ▽ More

    Submitted 19 May, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

  5. arXiv:2007.04074  [pdf, other

    cs.LG stat.ML

    Auto-Sklearn 2.0: Hands-free AutoML via Meta-Learning

    Authors: Matthias Feurer, Katharina Eggensperger, Stefan Falkner, Marius Lindauer, Frank Hutter

    Abstract: Automated Machine Learning (AutoML) supports practitioners and researchers with the tedious task of designing machine learning pipelines and has recently achieved substantial success. In this paper, we introduce new AutoML approaches motivated by our winning submission to the second ChaLearn AutoML challenge. We develop PoSH Auto-sklearn, which enables AutoML systems to work well on large datasets… ▽ More

    Submitted 4 October, 2022; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Final version as published at JMLR 23(261)

    Journal ref: Journal of Machine Learning Research 23(261), 2022

  6. arXiv:1910.04522  [pdf, other

    cs.LG stat.ML

    Probabilistic Rollouts for Learning Curve Extrapolation Across Hyperparameter Settings

    Authors: Matilde Gargiani, Aaron Klein, Stefan Falkner, Frank Hutter

    Abstract: We propose probabilistic models that can extrapolate learning curves of iterative machine learning algorithms, such as stochastic gradient descent for training deep networks, based on training data with variable-length learning curves. We study instantiations of this framework based on random forests and Bayesian recurrent neural networks. Our experiments show that these models yield better predic… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  7. arXiv:1904.02642  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

    Authors: Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer, Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

    Abstract: Transferring knowledge across tasks to improve data-efficiency is one of the open key challenges in the field of global black-box optimization. Readily available algorithms are typically designed to be universal optimizers and, therefore, often suboptimal for specific tasks. We propose a novel transfer learning method to obtain customized optimizers within the well-established framework of Bayesia… ▽ More

    Submitted 14 February, 2020; v1 submitted 4 April, 2019; originally announced April 2019.

  8. arXiv:1812.11951  [pdf, other

    cs.LG q-bio.QM stat.ML

    Learning to Design RNA

    Authors: Frederic Runge, Danny Stoll, Stefan Falkner, Frank Hutter

    Abstract: Designing RNA molecules has garnered recent interest in medicine, synthetic biology, biotechnology and bioinformatics since many functional RNA molecules were shown to be involved in regulatory processes for transcription, epigenetics and translation. Since an RNA's function depends on its structural properties, the RNA Design problem is to find an RNA sequence which satisfies given structural con… ▽ More

    Submitted 12 April, 2019; v1 submitted 31 December, 2018; originally announced December 2018.

    Comments: Published as a conference paper at the International Conference on Learning Representations 2019 (ICLR19)

  9. arXiv:1807.06906  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search

    Authors: Arber Zela, Aaron Klein, Stefan Falkner, Frank Hutter

    Abstract: While existing work on neural architecture search (NAS) tunes hyperparameters in a separate post-processing step, we demonstrate that architectural choices and other hyperparameter settings interact in a way that can render this separation suboptimal. Likewise, we demonstrate that the common practice of using very few epochs during the main NAS and much larger numbers of epochs during a post-proce… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: 11 pages, 3 figures, 3 tables, ICML 2018 AutoML Workshop

    Journal ref: ICML 2018 AutoML Workshop

  10. arXiv:1807.01774  [pdf, other

    cs.LG stat.ML

    BOHB: Robust and Efficient Hyperparameter Optimization at Scale

    Authors: Stefan Falkner, Aaron Klein, Frank Hutter

    Abstract: Modern deep learning methods are very sensitive to many hyperparameters, and, due to the long training times of state-of-the-art models, vanilla Bayesian hyperparameter optimization is typically computationally infeasible. On the other hand, bandit-based configuration evaluation approaches based on random search lack guidance and do not converge to the best configurations as quickly. Here, we prop… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

    Comments: published at ICML2018

  11. arXiv:1612.00767  [pdf, other

    stat.ML cs.AI cs.LG

    Asynchronous Stochastic Gradient MCMC with Elastic Coupling

    Authors: Jost Tobias Springenberg, Aaron Klein, Stefan Falkner, Frank Hutter

    Abstract: We consider parallel asynchronous Markov Chain Monte Carlo (MCMC) sampling for problems where we can leverage (stochastic) gradients to define continuous dynamics which explore the target distribution. We outline a solution strategy for this setting based on stochastic gradient Hamiltonian Monte Carlo sampling (SGHMC) which we alter to include an elastic coupling term that ties together multiple M… ▽ More

    Submitted 8 December, 2016; v1 submitted 2 December, 2016; originally announced December 2016.

  12. arXiv:1605.07079  [pdf, other

    cs.LG cs.AI stat.ML

    Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets

    Authors: Aaron Klein, Stefan Falkner, Simon Bartels, Philipp Hennig, Frank Hutter

    Abstract: Bayesian optimization has become a successful tool for hyperparameter optimization of machine learning algorithms, such as support vector machines or deep neural networks. Despite its success, for large datasets, training and validating a single configuration often takes hours, days, or even weeks, which limits the achievable performance. To accelerate hyperparameter optimization, we propose a gen… ▽ More

    Submitted 7 March, 2017; v1 submitted 23 May, 2016; originally announced May 2016.