Skip to main content

Showing 1–6 of 6 results for author: Boland, D

.
  1. arXiv:2406.05999  [pdf, other

    cs.AR cs.AI cs.LG

    fSEAD: a Composable FPGA-based Streaming Ensemble Anomaly Detection Library

    Authors: Binglei Lou, David Boland, Philip H. W. Leong

    Abstract: Machine learning ensembles combine multiple base models to produce a more accurate output. They can be applied to a range of machine learning problems, including anomaly detection. In this paper, we investigate how to maximize the composability and scalability of an FPGA-based streaming ensemble anomaly detector (fSEAD). To achieve this, we propose a flexible computing architecture consisting of m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: The source code for this paper is available at: https://github.com/bingleilou/fSEAD

    Journal ref: ACM Transactions on Reconfigurable Technology and Systems(TRETS),16, 3, Article 42 (2023). Journal Track of The International Conference on Field Programmable Technology (FPT'22), Hong Kong SAR, China

  2. arXiv:2406.04910  [pdf, other

    cs.LG cs.AI cs.AR

    PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs

    Authors: Binglei Lou, Richard Rademacher, David Boland, Philip H. W. Leong

    Abstract: FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modelled using LUTs, help maximize this promise of offering ultra-low latency and high area efficiency on FPGAs. Unfortunately, LUT resource usage scales exponentially with the number of inputs to the LUT, restricting PolyLUT to small L… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: To be published in the International Conference on Field-Programmable Logic and Applications (FPL) 2024

  3. LUXOR: An FPGA Logic Cell Architecture for Efficient Compressor Tree Implementations

    Authors: SeyedRamin Rasoulinezhad, Siddhartha, Hao Zhou, Lingli Wang, David Boland, Philip H. W. Leong

    Abstract: We propose two tiers of modifications to FPGA logic cell architecture to deliver a variety of performance and utilization benefits with only minor area overheads. In the irst tier, we augment existing commercial logic cell datapaths with a 6-input XOR gate in order to improve the expressiveness of each element, while maintaining backward compatibility. This new architecture is vendor-agnostic, and… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: In Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA'20), February 23-25, 2020, Seaside, CA, USA

    ACM Class: B.2.1; C.0

  4. MajorityNets: BNNs Utilising Approximate Popcount for Improved Efficiency

    Authors: Seyedramin Rasoulinezhad, Sean Fox, Hao Zhou, Lingli Wang, David Boland, Philip H. W. Leong

    Abstract: Binarized neural networks (BNNs) have shown exciting potential for utilising neural networks in embedded implementations where area, energy and latency constraints are paramount. With BNNs, multiply-accumulate (MAC) operations can be simplified to XnorPopcount operations, leading to massive reductions in both memory and computation resources. Furthermore, multiple efficient implementations of BNNs… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: 4 pages

    Journal ref: International Conference on Field-Programmable Technology, {FPT} 2019,Tian**, China, December 9-13, 2019

  5. arXiv:1911.08097  [pdf

    eess.SP cs.AR cs.CV

    AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers

    Authors: Julian Faraone, Martin Kumm, Martin Hardieck, Peter Zipf, Xueyuan Liu, David Boland, Philip H. W. Leong

    Abstract: Low-precision arithmetic operations to accelerate deep-learning applications on field-programmable gate arrays (FPGAs) have been studied extensively, because they offer the potential to save silicon area or increase throughput. However, these benefits come at the cost of a decrease in accuracy. In this article, we demonstrate that reconfigurable constant coefficient multipliers (RCCMs) offer a bet… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 14 pages

  6. arXiv:1909.04509  [pdf, other

    eess.SP cs.LG cs.NE eess.IV

    Unrolling Ternary Neural Networks

    Authors: Stephen Tridgell, Martin Kumm, Martin Hardieck, David Boland, Duncan Moss, Peter Zipf, Philip H. W. Leong

    Abstract: The computational complexity of neural networks for large scale or real-time applications necessitates hardware acceleration. Most approaches assume that the network architecture and parameters are unknown at design time, permitting usage in a large number of applications. This paper demonstrates, for the case where the neural network architecture and ternary weight values are known a priori, that… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted in TRETS