Skip to main content

Showing 1–4 of 4 results for author: de Putter, F

.
  1. arXiv:2301.11810  [pdf, other

    cs.LG cs.CV

    BOMP-NAS: Bayesian Optimization Mixed Precision NAS

    Authors: David van Son, Floran de Putter, Sebastian Vogel, Henk Corporaal

    Abstract: Bayesian Optimization Mixed-Precision Neural Architecture Search (BOMP-NAS) is an approach to quantization-aware neural architecture search (QA-NAS) that leverages both Bayesian optimization (BO) and mixed-precision quantization (MP) to efficiently search for compact, high performance deep neural networks. The results show that integrating quantization-aware fine-tuning (QAFT) into the NAS loop is… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  2. arXiv:2211.11331  [pdf, other

    cs.AR

    BrainTTA: A 35 fJ/op Compiler Programmable Mixed-Precision Transport-Triggered NN SoC

    Authors: Maarten Molendijk, Floran de Putter, Manil Gomony, Pekka Jääskeläinen, Henk Corporaal

    Abstract: Recently, accelerators for extremely quantized deep neural network (DNN) inference with operand widths as low as 1-bit have gained popularity due to their ability to largely cut down energy cost per inference. In this paper, a flexible SoC with mixed-precision support is presented. Contrary to the current trend of fixed-datapath accelerators, this architecture makes use of a flexible datapath base… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  3. arXiv:2206.12358  [pdf, other

    cs.AR

    Low- and Mixed-Precision Inference Accelerators

    Authors: Maarten Molendijk, Floran de Putter, Henk Corporaal

    Abstract: With the surging popularity of edge computing, the need to efficiently perform neural network inference on battery-constrained IoT devices has greatly increased. While algorithmic developments enable neural networks to solve increasingly more complex tasks, the deployment of these networks on edge devices can be problematic due to the stringent energy, latency, and memory requirements. One way to… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  4. How to train accurate BNNs for embedded systems?

    Authors: Floran de Putter, Henk Corporaal

    Abstract: A key enabler of deploying convolutional neural networks on resource-constrained embedded systems is the binary neural network (BNN). BNNs save on memory and simplify computation by binarizing both features and weights. Unfortunately, binarization is inevitably accompanied by a severe decrease in accuracy. To reduce the accuracy gap between binary and full-precision networks, many repair methods h… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Journal ref: Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing (2023)