Skip to main content

Showing 1–10 of 10 results for author: Xun, L

.
  1. arXiv:2401.08965  [pdf, other

    cs.CV

    Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices

    Authors: Lei Xun, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to several key advantages in latency, privacy and always-on availability. However, due to limited computing resources, efficient DNN deployment on mobile and embedded platforms is challenging. Although many hardware accelerators and static model compression methods were proposed by previous work… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024, PhD Forum

  2. arXiv:2401.08943  [pdf, other

    cs.CV

    Fluid Dynamic DNNs for Reliable and Adaptive Distributed Inference on Edge Devices

    Authors: Lei Xun, Mingyu Hu, Hengrui Zhao, Amit Kumar Singh, Jonathon Hare, Geoff V. Merrett

    Abstract: Distributed inference is a popular approach for efficient DNN inference at the edge. However, traditional Static and Dynamic DNNs are not distribution-friendly, causing system reliability and adaptability issues. In this paper, we introduce Fluid Dynamic DNNs (Fluid DyDNNs), tailored for distributed inference. Distinct from Static and Dynamic DNNs, Fluid DyDNNs utilize a novel nested incremental t… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024

  3. arXiv:2307.06261  [pdf, other

    cs.RO

    Cosserat-Rod Based Dynamic Modeling of Soft Slender Robot Interacting with Environment

    Authors: Lingxiao Xun, Gang Zheng, Alexandre Kruszewski

    Abstract: Soft slender robots have attracted more and more research attentions in these years due to their continuity and compliance natures. However, mechanics modeling for soft robots interacting with environment is still an academic challenge because of the non-linearity of deformation and the non-smooth property of the contacts. In this work, starting from a piece-wise local strain field assumption, we… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  4. arXiv:2206.03546  [pdf, other

    cs.RO

    Piecewise Linear Strain Cosserat Model for Soft Slender Manipulator

    Authors: Haihong Li, Lingxiao Xun, Gang Zheng

    Abstract: Recently soft robotics has rapidly become a novel and promising area of research with many designs and applications due to their flexible and compliant structure. However, it is more difficult to derive the nonlinear dynamic model of such soft robots. The differential kinematics and dynamics of the soft manipulator can be formulated as a set of highly nonlinear partial differential equations (PDEs… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  5. arXiv:2206.02525  [pdf, other

    cs.AR

    Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms

    Authors: Lei Xun, Bashir M. Al-Hashimi, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to low latency and better privacy. However, efficient deployment on these platforms is challenging due to the intensive computation and memory access. We propose a holistic system design for DNN performance and energy optimisation, combining the trade-off opportunities in both algorithms and har… ▽ More

    Submitted 6 June, 2022; v1 submitted 17 May, 2022; originally announced June 2022.

    Comments: Accepted as a presentation at Fourth UK Mobile, Wearable and Ubiquitous Systems Research Symposium (MobiUK 2022)

  6. arXiv:2202.07848  [pdf, other

    cs.DC cs.AI

    Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads

    Authors: Dharma Shukla, Muthian Sivathanu, Srinidhi Viswanatha, Bhargav Gulavani, Rimma Nehme, Amey Agrawal, Chen Chen, Nipun Kwatra, Ramachandran Ramjee, Pankaj Sharma, Atul Katiyar, Vipul Modi, Vaibhav Sharma, Abhishek Singh, Shreshth Singhal, Kaustubh Welankar, Lu Xun, Ravi Anupindi, Karthik Elangovan, Hasibur Rahman, Zhou Lin, Rahul Seetharaman, Cheng Xu, Eddie Ailijiang, Suresh Krishnappa , et al. (1 additional authors not shown)

    Abstract: Lowering costs by driving high utilization across deep learning workloads is a crucial lever for cloud providers. We present Singularity, Microsoft's globally distributed scheduling service for highly-efficient and reliable execution of deep learning training and inference workloads. At the heart of Singularity is a novel, workload-aware scheduler that can transparently preempt and elastically sca… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Revision: Fixed some typos

  7. arXiv:2107.08199  [pdf, other

    cs.CL cs.LG

    Dynamic Transformer for Efficient Machine Translation on Embedded Devices

    Authors: Hishan Parry, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: The Transformer architecture is widely used for machine translation tasks. However, its resource-intensive nature makes it challenging to implement on constrained embedded devices, particularly where available hardware resources can vary at run-time. We propose a dynamic machine translation model that scales the Transformer architecture based on the available resources at any particular time. The… ▽ More

    Submitted 30 July, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Accepted at MLCAD 2021

  8. arXiv:2105.03608  [pdf, other

    cs.CV

    Optimising Resource Management for Embedded Machine Learning

    Authors: Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

    Abstract: Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning workloads. Performance can be defined using platf… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at DATE 2020

  9. arXiv:2105.03600  [pdf, other

    cs.CV

    Incremental Training and Group Convolution Pruning for Runtime DNN Performance Scaling on Heterogeneous Embedded Platforms

    Authors: Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

    Abstract: Inference for Deep Neural Networks is increasingly being executed locally on mobile and embedded platforms due to its advantages in latency, privacy and connectivity. Since modern System on Chips typically execute a combination of different and dynamic workloads concurrently, it is challenging to consistently meet inference time/energy budget at runtime because of the local computing resources ava… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at ACM/IEEE Workshop on Machine Learning for CAD (MLCAD) 2019

  10. arXiv:2105.03596  [pdf, other

    cs.CV

    Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

    Authors: Wei Lou, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: Mobile and embedded platforms are increasingly required to efficiently execute computationally demanding DNNs across heterogeneous processing elements. At runtime, the available hardware resources to DNNs can vary considerably due to other concurrently running applications. The performance requirements of the applications could also change under different scenarios. To achieve the desired performa… ▽ More

    Submitted 11 May, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR ECV Workshop 2021