Skip to main content

Showing 1–4 of 4 results for author: Staffler, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.18213  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-objective Differentiable Neural Architecture Search

    Authors: Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Samuel Dooley, Josif Grabocka, Frank Hutter

    Abstract: Pareto front profiling in multi-objective optimization (MOO), i.e. finding a diverse set of Pareto optimal solutions, is challenging, especially with expensive objectives like neural network training. Typically, in MOO neural architecture search (NAS), we aim to balance performance and hardware metrics across devices. Prior NAS approaches simplify this task by incorporating hardware constraints in… ▽ More

    Submitted 19 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 37 pages, 27 figures

  2. arXiv:2107.03719  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Bag of Tricks for Neural Architecture Search

    Authors: Thomas Elsken, Benedikt Staffler, Arber Zela, Jan Hendrik Metzen, Frank Hutter

    Abstract: While neural architecture search methods have been successful in previous years and led to new state-of-the-art performance on various problems, they have also been criticized for being unstable, being highly sensitive with respect to their hyperparameters, and often not performing better than random search. To shed some light on this issue, we discuss some practical considerations that help impro… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  3. arXiv:2008.10293  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Bosch Deep Learning Hardware Benchmark

    Authors: Armin Runge, Thomas Wenzel, Dimitrios Bariamis, Benedikt Sebastian Staffler, Lucas Rego Drumond, Michael Pfeiffer

    Abstract: The widespread use of Deep Learning (DL) applications in science and industry has created a large demand for efficient inference systems. This has resulted in a rapid increase of available Hardware Accelerators (HWAs) making comparison challenging and laborious. To address this, several DL hardware benchmarks have been proposed aiming at a comprehensive comparison for many models, tasks, and hardw… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented in MLBench: Workshop on Benchmarking Machine Learning Workloads (https://sites.google.com/g.harvard.edu/mlbench/home)

  4. arXiv:1911.11090  [pdf, other

    cs.LG stat.ML

    Meta-Learning of Neural Architectures for Few-Shot Learning

    Authors: Thomas Elsken, Benedikt Staffler, Jan Hendrik Metzen, Frank Hutter

    Abstract: The recent progress in neural architecture search (NAS) has allowed scaling the automated design of neural architectures to real-world domains, such as object detection and semantic segmentation. However, one prerequisite for the application of NAS are large amounts of labeled data and compute resources. This renders its application challenging in few-shot learning scenarios, where many related ta… ▽ More

    Submitted 14 June, 2021; v1 submitted 25 November, 2019; originally announced November 2019.

    Journal ref: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)