Skip to main content

Showing 1–11 of 11 results for author: Elsken, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.08727  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Architecture Search: Insights from 1000 Papers

    Authors: Colin White, Mahmoud Safari, Rhea Sukthanker, Binxin Ru, Thomas Elsken, Arber Zela, Debadeepta Dey, Frank Hutter

    Abstract: In the past decade, advances in deep learning have resulted in breakthroughs in a variety of areas, including computer vision, natural language understanding, speech recognition, and reinforcement learning. Specialized, high-performing neural architectures are crucial to the success of deep learning in these areas. Neural architecture search (NAS), the process of automating the design of neural ar… ▽ More

    Submitted 25 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  2. arXiv:2202.07242  [pdf, other

    cs.CV cs.LG

    Neural Architecture Search for Dense Prediction Tasks in Computer Vision

    Authors: Thomas Elsken, Arber Zela, Jan Hendrik Metzen, Benedikt Staffler, Thomas Brox, Abhinav Valada, Frank Hutter

    Abstract: The success of deep learning in recent years has lead to a rising demand for neural network architecture engineering. As a consequence, neural architecture search (NAS), which aims at automatically designing neural network architectures in a data-driven manner rather than manually, has evolved as a popular field of research. With the advent of weight sharing strategies across architectures, NAS ha… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  3. arXiv:2107.03719  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Bag of Tricks for Neural Architecture Search

    Authors: Thomas Elsken, Benedikt Staffler, Arber Zela, Jan Hendrik Metzen, Frank Hutter

    Abstract: While neural architecture search methods have been successful in previous years and led to new state-of-the-art performance on various problems, they have also been criticized for being unstable, being highly sensitive with respect to their hyperparameters, and often not performing better than random search. To shed some light on this issue, we discuss some practical considerations that help impro… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  4. arXiv:2105.01015  [pdf, other

    cs.LG cs.AI stat.ML

    Bag of Baselines for Multi-objective Joint Neural Architecture Search and Hyperparameter Optimization

    Authors: Julia Guerrero-Viu, Sven Hauns, Sergio Izquierdo, Guilherme Miotto, Simon Schrodi, Andre Biedenkapp, Thomas Elsken, Difan Deng, Marius Lindauer, Frank Hutter

    Abstract: Neural architecture search (NAS) and hyperparameter optimization (HPO) make deep learning accessible to non-experts by automatically finding the architecture of the deep neural network to use and tuning the hyperparameters of the used training pipeline. While both NAS and HPO have been studied extensively in recent years, NAS methods typically assume fixed hyperparameters and vice versa - there ex… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  5. arXiv:2006.08573  [pdf, other

    cs.LG stat.ML

    Neural Ensemble Search for Uncertainty Estimation and Dataset Shift

    Authors: Sheheryar Zaidi, Arber Zela, Thomas Elsken, Chris Holmes, Frank Hutter, Yee Whye Teh

    Abstract: Ensembles of neural networks achieve superior performance compared to stand-alone networks in terms of accuracy, uncertainty calibration and robustness to dataset shift. \emph{Deep ensembles}, a state-of-the-art method for uncertainty estimation, only ensemble random initializations of a \emph{fixed} architecture. Instead, we propose two methods for automatically constructing ensembles with \emph{… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted at NeurIPS 2021; earlier version of this work was accepted for oral presentation at ICML 2020 Workshop on Uncertainty & Robustness in Deep Learning

  6. arXiv:1911.11090  [pdf, other

    cs.LG stat.ML

    Meta-Learning of Neural Architectures for Few-Shot Learning

    Authors: Thomas Elsken, Benedikt Staffler, Jan Hendrik Metzen, Frank Hutter

    Abstract: The recent progress in neural architecture search (NAS) has allowed scaling the automated design of neural architectures to real-world domains, such as object detection and semantic segmentation. However, one prerequisite for the application of NAS are large amounts of labeled data and compute resources. This renders its application challenging in few-shot learning scenarios, where many related ta… ▽ More

    Submitted 14 June, 2021; v1 submitted 25 November, 2019; originally announced November 2019.

    Journal ref: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  7. arXiv:1909.13844  [pdf, other

    cs.LG stat.ML

    Automated design of error-resilient and hardware-efficient deep neural networks

    Authors: Christoph Schorn, Thomas Elsken, Sebastian Vogel, Armin Runge, Andre Guntoro, Gerd Ascheid

    Abstract: Applying deep neural networks (DNNs) in mobile and safety-critical systems, such as autonomous vehicles, demands a reliable and efficient execution on hardware. Optimized dedicated hardware accelerators are being developed to achieve this. However, the design of efficient and reliable hardware has become increasingly difficult, due to the increased complexity of modern integrated circuit technolog… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  8. arXiv:1909.09656  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Understanding and Robustifying Differentiable Architecture Search

    Authors: Arber Zela, Thomas Elsken, Tonmoy Saikia, Yassine Marrakchi, Thomas Brox, Frank Hutter

    Abstract: Differentiable Architecture Search (DARTS) has attracted a lot of attention due to its simplicity and small search costs achieved by a continuous relaxation and an approximation of the resulting bi-level optimization problem. However, DARTS does not work robustly for new problems: we identify a wide range of search spaces for which DARTS yields degenerate architectures with very poor test performa… ▽ More

    Submitted 28 January, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: In: International Conference on Learning Representations (ICLR 2020); 28 pages, 30 figures

  9. arXiv:1808.05377  [pdf, other

    stat.ML cs.LG cs.NE

    Neural Architecture Search: A Survey

    Authors: Thomas Elsken, Jan Hendrik Metzen, Frank Hutter

    Abstract: Deep Learning has enabled remarkable progress over the last years on a variety of tasks, such as image recognition, speech recognition, and machine translation. One crucial aspect for this progress are novel neural architectures. Currently employed architectures have mostly been developed manually by human experts, which is a time-consuming and error-prone process. Because of this, there is growin… ▽ More

    Submitted 26 April, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-21

  10. arXiv:1804.09081  [pdf, other

    stat.ML cs.LG

    Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution

    Authors: Thomas Elsken, Jan Hendrik Metzen, Frank Hutter

    Abstract: Neural Architecture Search aims at automatically finding neural architectures that are competitive with architectures designed by human experts. While recent approaches have achieved state-of-the-art predictive performance for image recognition, they are problematic under resource constraints for two reasons: (1)the neural architectures found are solely optimized for high predictive performance, w… ▽ More

    Submitted 26 February, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR, International Conference on Learning Representations, 2019

  11. arXiv:1711.04528  [pdf, other

    stat.ML cs.AI cs.LG

    Simple And Efficient Architecture Search for Convolutional Neural Networks

    Authors: Thomas Elsken, Jan-Hendrik Metzen, Frank Hutter

    Abstract: Neural networks have recently had a lot of success for many tasks. However, neural network architectures that perform well are still typically designed manually by experts in a cumbersome trial-and-error process. We propose a new method to automatically search for well-performing CNN architectures based on a simple hill climbing procedure whose operators apply network morphisms, followed by short… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

    Comments: Under review as a conference paper at ICLR 2018