Skip to main content

Showing 1–9 of 9 results for author: Wilton, S

.
  1. arXiv:2407.04964  [pdf, other

    cs.AI

    ZOBNN: Zero-Overhead Dependable Design of Binary Neural Networks with Deliberately Quantized Parameters

    Authors: Behnam Ghavami, Mohammad Shahidzadeh, Lesley Shannon, Steve Wilton

    Abstract: Low-precision weights and activations in deep neural networks (DNNs) outperform their full-precision counterparts in terms of hardware efficiency. When implemented with low-precision operations, specifically in the extreme case where network parameters are binarized (i.e. BNNs), the two most frequently mentioned benefits of quantization are reduced memory consumption and a faster inference process… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2404.12599  [pdf, other

    cs.LG cs.CV

    QUTE: Quantifying Uncertainty in TinyML models with Early-exit-assisted ensembles

    Authors: Nikhil P Ghanathe, Steve Wilton

    Abstract: Existing methods for uncertainty quantification incur massive memory and compute overhead, often requiring multiple models/inferences. Hence they are impractical on ultra-low-power KB-sized TinyML devices. To reduce overhead, prior works have proposed the use of early-exit networks as ensembles to quantify uncertainty in a single forward-pass. However, they still have a prohibitive cost for tinyML… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2404.02947  [pdf, other

    cs.LG cs.AI

    DNN Memory Footprint Reduction via Post-Training Intra-Layer Multi-Precision Quantization

    Authors: Behnam Ghavami, Amin Kamjoo, Lesley Shannon, Steve Wilton

    Abstract: The imperative to deploy Deep Neural Network (DNN) models on resource-constrained edge devices, spurred by privacy concerns, has become increasingly apparent. To facilitate the transition from cloud to edge computing, this paper introduces a technique that effectively reduces the memory footprint of DNNs, accommodating the limitations of resource-constrained edge devices while preserving model acc… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: The 25th International Symposium on Quality Electronic Design (ISQED'24)

  4. arXiv:2207.06613  [pdf, other

    cs.LG cs.CV eess.IV

    T-RECX: Tiny-Resource Efficient Convolutional neural networks with early-eXit

    Authors: Nikhil P Ghanathe, Steve Wilton

    Abstract: Deploying Machine learning (ML) on milliwatt-scale edge devices (tinyML) is gaining popularity due to recent breakthroughs in ML and Internet of Things (IoT). Most tinyML research focuses on model compression techniques that trade accuracy (and model capacity) for compact models to fit into the KB-sized tiny-edge devices. In this paper, we show how such models can be enhanced by the addition of an… ▽ More

    Submitted 26 April, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at 20th ACM International Conference on Computing Frontiers

  5. arXiv:2107.03653  [pdf, ps, other

    cs.AR cs.DC cs.LG cs.PL

    MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

    Authors: Nikhil Pratap Ghanathe, Vivek Seshadri, Rahul Sharma, Steve Wilton, Aayan Kumar

    Abstract: Recent breakthroughs in ML have produced new classes of models that allow ML inference to run directly on milliwatt-powered IoT devices. On one hand, existing ML-to-FPGA compilers are designed for deep neural-networks on large FPGAs. On the other hand, general-purpose HLS tools fail to exploit properties specific to ML inference, thereby resulting in suboptimal performance. We propose MAFIA, a too… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Accepted at The International Conference on Field-Programmable Logic and Applications (FPL), 2021

  6. arXiv:1807.05317  [pdf

    cs.LG stat.ML

    LeFlow: Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks

    Authors: Daniel H. Noronha, Bahar Salehpour, Steven J. E. Wilton

    Abstract: Recent work has shown that Field-Programmable Gate Arrays (FPGAs) play an important role in the acceleration of Machine Learning applications. Initial specification of machine learning applications are often done using a high-level Python-oriented framework such as Tensorflow, followed by a manual translation to either C or RTL for synthesis using vendor tools. This manual translation step is time… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: To be published in FPGA for Software Programmers (FSP 2018)

  7. arXiv:1709.04972  [pdf, other

    quant-ph

    Algorithms for Embedding Quantum-Dot Cellular Automata Networks onto a Quantum Annealing Processor

    Authors: Jacob Retallick, Michael Babcock, Miguel Aroca-Ouellette, Shane McNamara, Steve Wilton, Aidan Roy, Mark Johnson, Konrad Walus

    Abstract: Advancements in computing based on qubit networks, and in particular the flux-qubit processor architecture developed by D-Wave System's Inc., have enabled the physical simulation of quantum-dot cellular automata (QCA) networks beyond the limit of classical methods. However, the embedding of QCA networks onto the available processor architecture is a key challenge in preparing such simulations. In… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

  8. arXiv:1606.06457  [pdf

    cs.AR

    Enabling Effective FPGA Debug using Overlays: Opportunities and Challenges

    Authors: Fatemeh Eslami, Eddie Hung, Steven J. E. Wilton

    Abstract: FPGAs are going mainstream. Major companies that were not traditionally FPGA-focused are now seeking ways to exploit the benefits of reconfigurable technology and provide it to their customers. In order to do so, a debug ecosystem that provides for effective visibility into a working design and quick debug turn-around times is essential. Overlays have the opportunity to play a key role in this eco… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

    Comments: Presented at 2nd International Workshop on Overlay Architectures for FPGAs (OLAF 2016) arXiv:1605.08149

    Report number: OLAF/2016/08

  9. arXiv:1508.06805  [pdf

    cs.SE cs.AR

    Allowing Software Developers to Debug HLS Hardware

    Authors: Jeffrey Goeders, Steven J. E. Wilton

    Abstract: High-Level Synthesis (HLS) is emerging as a mainstream design methodology, allowing software designers to enjoy the benefits of a hardware implementation. Significant work has led to effective compilers that produce high-quality hardware designs from software specifications. However, in order to fully benefit from the promise of HLS, a complete ecosystem that provides the ability to analyze, debug… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

    Comments: Presented at Second International Workshop on FPGAs for Software Programmers (FSP 2015) (arXiv:1508.06320)

    Report number: FSP/2015/01