Skip to main content

Showing 1–4 of 4 results for author: Tiedemann, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.12043  [pdf, other

    cs.LG cs.AI cs.NE

    Efficient Sampling for Predictor-Based Neural Architecture Search

    Authors: Lukas Mauch, Stephen Tiedemann, Javier Alonso Garcia, Bac Nguyen Cong, Kazuki Yoshiyama, Fabien Cardinaux, Thomas Kemp

    Abstract: Recently, predictor-based algorithms emerged as a promising approach for neural architecture search (NAS). For NAS, we typically have to calculate the validation accuracy of a large number of Deep Neural Networks (DNNs), what is computationally complex. Predictor-based NAS algorithms address this problem. They train a proxy model that can infer the validation accuracy of DNNs directly from their n… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  2. Iteratively Training Look-Up Tables for Network Quantization

    Authors: Fabien Cardinaux, Stefan Uhlich, Kazuki Yoshiyama, Javier Alonso Garcia, Lukas Mauch, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

    Abstract: Operating deep neural networks (DNNs) on devices with limited resources requires the reduction of their memory as well as computational footprint. Popular reduction methods are network quantization or pruning, which either reduce the word length of the network parameters or remove weights from the network if they are not needed. In this article we discuss a general framework for network reduction… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: Copyright 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  3. arXiv:1905.11452  [pdf

    cs.LG cs.CV stat.ML

    Mixed Precision DNNs: All you need is a good parametrization

    Authors: Stefan Uhlich, Lukas Mauch, Fabien Cardinaux, Kazuki Yoshiyama, Javier Alonso Garcia, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

    Abstract: Efficient deep neural network (DNN) inference on mobile or embedded devices typically involves quantization of the network parameters and activations. In particular, mixed precision networks achieve better performance than networks with homogeneous bitwidth for the same size constraint. Since choosing the optimal bitwidths is not straight forward, training methods, which can learn them, are desira… ▽ More

    Submitted 22 May, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: International Conference on Learning Representations (ICLR) 2020; Source code at https://github.com/sony/ai-research-code

  4. arXiv:1811.05355  [pdf, ps, other

    cs.LG stat.ML

    Iteratively Training Look-Up Tables for Network Quantization

    Authors: Fabien Cardinaux, Stefan Uhlich, Kazuki Yoshiyama, Javier Alonso GarcĂ­a, Stephen Tiedemann, Thomas Kemp, Akira Nakamura

    Abstract: Operating deep neural networks on devices with limited resources requires the reduction of their memory footprints and computational requirements. In this paper we introduce a training method, called look-up table quantization, LUT-Q, which learns a dictionary and assigns each weight to one of the dictionary's values. We show that this method is very flexible and that many other techniques can be… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: NIPS 2018 workshop on Compact Deep Neural Networks with industrial applications