Skip to main content

Showing 1–8 of 8 results for author: Waschneck, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07958  [pdf, other

    cs.LG cs.AI

    Temporal Decisions: Leveraging Temporal Correlation for Efficient Decisions in Early Exit Neural Networks

    Authors: Max Sponner, Lorenzo Servadei, Bernd Waschneck, Robert Wille, Akash Kumar

    Abstract: Deep Learning is becoming increasingly relevant in Embedded and Internet-of-things applications. However, deploying models on embedded devices poses a challenge due to their resource limitations. This can impact the model's inference accuracy and latency. One potential solution are Early Exit Neural Networks, which adjust model depth dynamically through additional classifiers attached between thei… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  2. arXiv:2403.07957  [pdf, other

    cs.LG cs.AI

    Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments

    Authors: Max Sponner, Lorenzo Servadei, Bernd Waschneck, Robert Wille, Akash Kumar

    Abstract: Early Exit Neural Networks (EENNs) present a solution to enhance the efficiency of neural network deployments. However, creating EENNs is challenging and requires specialized domain knowledge, due to the large amount of additional design choices. To address this issue, we propose an automated augmentation flow that focuses on converting an existing model into an EENN. It performs all required desi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  3. arXiv:2309.05686  [pdf, other

    cs.LG cs.NI eess.SP

    Temporal Patience: Efficient Adaptive Deep Learning for Embedded Radar Data Processing

    Authors: Max Sponner, Julius Ott, Lorenzo Servadei, Bernd Waschneck, Robert Wille, Akash Kumar

    Abstract: Radar sensors offer power-efficient solutions for always-on smart devices, but processing the data streams on resource-constrained embedded platforms remains challenging. This paper presents novel techniques that leverage the temporal correlation present in streaming radar data to enhance the efficiency of Early Exit Neural Networks for Deep Learning inference on embedded devices. These networks a… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: CODAI 2023 Workshop Submission

  4. Convolutional Neural Networks Quantization with Attention

    Authors: Binyi Wu, Bernd Waschneck, Christian Georg Mayr

    Abstract: It has been proven that, compared to using 32-bit floating-point numbers in the training phase, Deep Convolutional Neural Networks (DCNNs) can operate with low precision during inference, thereby saving memory space and power consumption. However, quantizing networks is always accompanied by an accuracy decrease. Here, we propose a method, double-stage Squeeze-and-Threshold (double-stage ST). It u… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: Preprint of an article published in International Journal of Neural Systems, [10.1142/S0129065722500514] \c{opyright} [copyright World Scientific Publishing Company] [https://www.worldscientific.com/doi/10.1142/S0129065722500514]

  5. arXiv:2208.07265  [pdf, other

    cs.LG

    Combining Gradients and Probabilities for Heterogeneous Approximation of Neural Networks

    Authors: Elias Trommer, Bernd Waschneck, Akash Kumar

    Abstract: This work explores the search for heterogeneous approximate multiplier configurations for neural networks that produce high accuracy and low energy consumption. We discuss the validity of additive Gaussian noise added to accurate neural network computations as a surrogate model for behavioral simulation of approximate multipliers. The continuous and differentiable properties of the solution space… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted at International Conference on Computer-Aided Design (ICCAD) 2022

  6. arXiv:2111.12345  [pdf, other

    cs.DS

    dCSR: A Memory-Efficient Sparse Matrix Representation for Parallel Neural Network Inference

    Authors: Elias Trommer, Bernd Waschneck, Akash Kumar

    Abstract: Reducing the memory footprint of neural networks is a crucial prerequisite for deploying them in small and low-cost embedded devices. Network parameters can often be reduced significantly through pruning. We discuss how to best represent the indexing overhead of sparse networks for the coming generation of Single Instruction, Multiple Data (SIMD)-capable microcontrollers. From this, we develop Del… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted at International Conference on Computer-Aided Design (ICCAD) 2021

  7. arXiv:2104.04576  [pdf, other

    cs.PL cs.LG

    Compiler Toolchains for Deep Learning Workloads on Embedded Platforms

    Authors: Max Sponner, Bernd Waschneck, Akash Kumar

    Abstract: As the usage of deep learning becomes increasingly popular in mobile and embedded solutions, it is necessary to convert the framework-specific network representations into executable code for these embedded platforms. This paper consists of two parts: The first section is made up of a survey and benchmark of the available open source deep learning compiler toolchains, which focus on the capabiliti… ▽ More

    Submitted 8 March, 2021; originally announced April 2021.

    Comments: tinyML 2021 conference

    ACM Class: I.2.0

  8. arXiv:1911.02086  [pdf, other

    eess.AS cs.CL cs.SD

    Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions

    Authors: Simon Mittermaier, Ludwig Kürzinger, Bernd Waschneck, Gerhard Rigoll

    Abstract: Keyword Spotting (KWS) enables speech-based user interaction on smart devices. Always-on and battery-powered application scenarios for smart devices put constraints on hardware resources and power consumption, while also demanding high accuracy as well as real-time capability. Previous architectures first extracted acoustic features and then applied a neural network to classify keyword probabiliti… ▽ More

    Submitted 3 May, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: Accepted at ICASSP 2020