Skip to main content

Showing 1–11 of 11 results for author: Merrett, G V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08965  [pdf, other

    cs.CV

    Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices

    Authors: Lei Xun, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to several key advantages in latency, privacy and always-on availability. However, due to limited computing resources, efficient DNN deployment on mobile and embedded platforms is challenging. Although many hardware accelerators and static model compression methods were proposed by previous work… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024, PhD Forum

  2. arXiv:2401.08943  [pdf, other

    cs.CV

    Fluid Dynamic DNNs for Reliable and Adaptive Distributed Inference on Edge Devices

    Authors: Lei Xun, Mingyu Hu, Hengrui Zhao, Amit Kumar Singh, Jonathon Hare, Geoff V. Merrett

    Abstract: Distributed inference is a popular approach for efficient DNN inference at the edge. However, traditional Static and Dynamic DNNs are not distribution-friendly, causing system reliability and adaptability issues. In this paper, we introduce Fluid Dynamic DNNs (Fluid DyDNNs), tailored for distributed inference. Distinct from Static and Dynamic DNNs, Fluid DyDNNs utilize a novel nested incremental t… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024

  3. arXiv:2206.02525  [pdf, other

    cs.AR

    Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms

    Authors: Lei Xun, Bashir M. Al-Hashimi, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to low latency and better privacy. However, efficient deployment on these platforms is challenging due to the intensive computation and memory access. We propose a holistic system design for DNN performance and energy optimisation, combining the trade-off opportunities in both algorithms and har… ▽ More

    Submitted 6 June, 2022; v1 submitted 17 May, 2022; originally announced June 2022.

    Comments: Accepted as a presentation at Fourth UK Mobile, Wearable and Ubiquitous Systems Research Symposium (MobiUK 2022)

  4. arXiv:2109.12047  [pdf, other

    cs.NI cs.PF

    Intermittent Opportunistic Routing Components for the INET Framework

    Authors: Edward Longman, Mohammed El-Hajjar, Geoff V. Merrett

    Abstract: Intermittently-powered wireless sensor networks (WSNs) use energy harvesting and small energy storage to remove the need for battery replacement and to extend the operational lifetime. However, an intermittently-powered forwarder regularly turns on or off, which requires alternative networking solutions. Opportunistic routing (OR) is a potential cross-layer solution for this novel application, but… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Published in: M. Marek, G. Nardini, V. Vesely (Eds.), Proceedings of the 8th OMNeT++ Community Summit, Virtual Summit, September 8-10, 2021

    Report number: OMNET/2021/05

  5. arXiv:2109.09495  [pdf, other

    cs.LG cs.NE

    GhostShiftAddNet: More Features from Energy-Efficient Operations

    Authors: Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep convolutional neural networks (CNNs) are computationally and memory intensive. In CNNs, intensive multiplication can have resource implications that may challenge the ability for effective deployment of inference on resource-constrained edge devices. This paper proposes GhostShiftAddNet, where the motivation is to implement a hardware-efficient deep network: a multiplication-free CNN with few… ▽ More

    Submitted 3 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Journal ref: The 32nd British Machine Vision Conference BMVC 2021

  6. arXiv:2107.08199  [pdf, other

    cs.CL cs.LG

    Dynamic Transformer for Efficient Machine Translation on Embedded Devices

    Authors: Hishan Parry, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: The Transformer architecture is widely used for machine translation tasks. However, its resource-intensive nature makes it challenging to implement on constrained embedded devices, particularly where available hardware resources can vary at run-time. We propose a dynamic machine translation model that scales the Transformer architecture based on the available resources at any particular time. The… ▽ More

    Submitted 30 July, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Accepted at MLCAD 2021

  7. arXiv:2106.11208  [pdf, other

    cs.CV

    Temporal Early Exits for Efficient Video Object Detection

    Authors: Amin Sabet, Jonathon Hare, Bashir Al-Hashimi, Geoff V. Merrett

    Abstract: Transferring image-based object detectors to the domain of video remains challenging under resource constraints. Previous efforts utilised optical flow to allow unchanged features to be propagated, however, the overhead is considerable when working with very slowly changing scenes from applications such as surveillance. In this paper, we propose temporal early exits to reduce the computational com… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  8. arXiv:2105.03608  [pdf, other

    cs.CV

    Optimising Resource Management for Embedded Machine Learning

    Authors: Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

    Abstract: Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning workloads. Performance can be defined using platf… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at DATE 2020

  9. arXiv:2105.03600  [pdf, other

    cs.CV

    Incremental Training and Group Convolution Pruning for Runtime DNN Performance Scaling on Heterogeneous Embedded Platforms

    Authors: Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

    Abstract: Inference for Deep Neural Networks is increasingly being executed locally on mobile and embedded platforms due to its advantages in latency, privacy and connectivity. Since modern System on Chips typically execute a combination of different and dynamic workloads concurrently, it is challenging to consistently meet inference time/energy budget at runtime because of the local computing resources ava… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at ACM/IEEE Workshop on Machine Learning for CAD (MLCAD) 2019

  10. arXiv:2105.03596  [pdf, other

    cs.CV

    Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

    Authors: Wei Lou, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: Mobile and embedded platforms are increasingly required to efficiently execute computationally demanding DNNs across heterogeneous processing elements. At runtime, the available hardware resources to DNNs can vary considerably due to other concurrently running applications. The performance requirements of the applications could also change under different scenarios. To achieve the desired performa… ▽ More

    Submitted 11 May, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR ECV Workshop 2021

  11. arXiv:1810.03333  [pdf

    cs.ET

    Practical Implementation of Memristor-Based Threshold Logic Gates

    Authors: Georgios Papandroulidakis, Alexantrou Serb, Ali Khiat, Geoff V. Merrett, Themistoklis Prodromakis

    Abstract: Current advances in emerging memory technologies enable novel and unconventional computing architectures for high-performance and low-power electronic systems, capable of carrying out massively parallel operations at the edge. One emerging technology, ReRAM, also known to belong in the family of memristors (memory resistors), is gathering attention due to its attractive features for logic and in-m… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.