Skip to main content

Showing 1–12 of 12 results for author: Sarwar, S S

.
  1. arXiv:2312.14750  [pdf, other

    cs.AR

    Siracusa: A 16 nm Heterogenous RISC-V SoC for Extended Reality with At-MRAM Neural Engine

    Authors: Arpan Suravi Prasad, Moritz Scherer, Francesco Conti, Davide Rossi, Alfio Di Mauro, Manuel Eggimann, Jorge Tómas Gómez, Ziyun Li, Syed Shakib Sarwar, Zhao Wang, Barbara De Salvo, Luca Benini

    Abstract: Extended reality (XR) applications are Machine Learning (ML)-intensive, featuring deep neural networks (DNNs) with millions of weights, tightly latency-bound (10-20 ms end-to-end), and power-constrained (low tens of mW average power). While ML performance and efficiency can be achieved by introducing neural engines within low-power systems-on-chip (SoCs), system-level power for nontrivial DNNs dep… ▽ More

    Submitted 14 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Final accepted manuscript pre-print submitted to the IEEE Journal of Solid-State Circuits

  2. arXiv:2207.00670  [pdf, other

    cs.CV cs.LG

    DRESS: Dynamic REal-time Sparse Subnets

    Authors: Zhongnan Qu, Syed Shakib Sarwar, Xin Dong, Yuecheng Li, Ekin Sumbul, Barbara De Salvo

    Abstract: The limited and dynamically varied resources on edge devices motivate us to deploy an optimized deep neural network that can adapt its sub-networks to fit in different resource constraints. However, existing works often build sub-networks through searching different network architectures in a hand-crafted sampling space, which not only can result in a subpar performance but also may cause on-devic… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Published in Efficient Deep Learning for Computer Vision (ECV) CVPR Workshop 2022

  3. arXiv:2206.06780  [pdf, other

    cs.AR cs.AI

    Memory-Oriented Design-Space Exploration of Edge-AI Hardware for XR Applications

    Authors: Vivek Parmar, Syed Shakib Sarwar, Ziyun Li, Hsien-Hsin S. Lee, Barbara De Salvo, Manan Suri

    Abstract: Low-Power Edge-AI capabilities are essential for on-device extended reality (XR) applications to support the vision of Metaverse. In this work, we investigate two representative XR workloads: (i) Hand detection and (ii) Eye segmentation, for hardware design space exploration. For both applications, we train deep neural networks and analyze the impact of quantization and hardware specific bottlenec… ▽ More

    Submitted 28 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023

  4. arXiv:2203.07474  [pdf, other

    cs.AR cs.LG

    Distributed On-Sensor Compute System for AR/VR Devices: A Semi-Analytical Simulation Framework for Power Estimation

    Authors: Jorge Gomez, Saavan Patel, Syed Shakib Sarwar, Ziyun Li, Raffaele Capoccia, Zhao Wang, Reid Pinkham, Andrew Berkovich, Tsung-Hsun Tsai, Barbara De Salvo, Chiao Liu

    Abstract: Augmented Reality/Virtual Reality (AR/VR) glasses are widely foreseen as the next generation computing platform. AR/VR glasses are a complex "system of systems" which must satisfy stringent form factor, computing-, power- and thermal- requirements. In this paper, we will show that a novel distributed on-sensor compute architecture, coupled with new semiconductor technologies (such as dense 3D-IC i… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 6 pages, 5 figures, TinyML Research Symposium

  5. arXiv:2203.05025  [pdf, other

    cs.LG

    Power-of-Two Quantization for Low Bitwidth and Hardware Compliant Neural Networks

    Authors: Dominika Przewlocka-Rus, Syed Shakib Sarwar, H. Ekin Sumbul, Yuecheng Li, Barbara De Salvo

    Abstract: Deploying Deep Neural Networks in low-power embedded devices for real time-constrained applications requires optimization of memory and computational complexity of the networks, usually by quantizing the weights. Most of the existing works employ linear quantization which causes considerable degradation in accuracy for weight bit widths lower than 8. Since the distribution of weights is usually no… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: TinyML Research Symposium

  6. arXiv:1905.02704  [pdf, other

    cs.NE cs.LG eess.SP

    A Comprehensive Analysis on Adversarial Robustness of Spiking Neural Networks

    Authors: Saima Sharmin, Priyadarshini Panda, Syed Shakib Sarwar, Chankyu Lee, Wachirawit Ponghiran, Kaushik Roy

    Abstract: In this era of machine learning models, their functionality is being threatened by adversarial attacks. In the face of this struggle for making artificial neural networks robust, finding a model, resilient to these attacks, is very important. In this work, we present, for the first time, a comprehensive analysis of the behavior of more bio-plausible networks, namely Spiking Neural Network (SNN) un… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: Accepted in IJCNN2019

  7. Enabling Spike-based Backpropagation for Training Deep Neural Network Architectures

    Authors: Chankyu Lee, Syed Shakib Sarwar, Priyadarshini Panda, Gopalakrishnan Srinivasan, Kaushik Roy

    Abstract: Spiking Neural Networks (SNNs) have recently emerged as a prominent neural computing paradigm. However, the typical shallow SNN architectures have limited capacity for expressing complex representations while training deep SNNs using input spikes has not been successful so far. Diverse methods have been proposed to get around this issue such as converting off-the-shelf trained deep Artificial Neur… ▽ More

    Submitted 24 March, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: Chankyu Lee and Syed Shakib Sarwar contributed equally to the work

    Journal ref: Frontiers in Neuroscience, 14 (2020)

  8. Incremental Learning in Deep Convolutional Neural Networks Using Partial Network Sharing

    Authors: Syed Shakib Sarwar, Aayush Ankit, Kaushik Roy

    Abstract: Deep convolutional neural network (DCNN) based supervised learning is a widely practiced approach for large-scale image classification. However, retraining these large networks to accommodate new, previously unseen data demands high computational time and energy requirements. Also, previously seen training samples may not be available at the time of retraining. We propose an efficient training met… ▽ More

    Submitted 2 May, 2019; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: 18 pages, 13 figures. IEEE Access 2019

  9. Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks

    Authors: Syed Shakib Sarwar, Priyadarshini Panda, Kaushik Roy

    Abstract: Convolutional Neural Networks (CNN) are being increasingly used in computer vision for a wide range of classification and recognition problems. However, training these large networks demands high computational time and energy requirements; hence, their energy-efficient implementation is of great interest. In this work, we reduce the training complexity of CNNs by replacing certain weight kernels o… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.

    Comments: Accepted in ISLPED 2017

    Journal ref: EEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), Taipei, 2017, pp. 1-6

  10. arXiv:1602.08557  [pdf

    cs.NE

    Multiplier-less Artificial Neurons Exploiting Error Resiliency for Energy-Efficient Neural Computing

    Authors: Syed Shakib Sarwar, Swagath Venkataramani, Anand Raghunathan, Kaushik Roy

    Abstract: Large-scale artificial neural networks have shown significant promise in addressing a wide range of classification and recognition applications. However, their large computational requirements stretch the capabilities of computing platforms. The fundamental components of these neural networks are the neurons and its synapses. The core of a digital hardware neuron consists of multiplier, accumulato… ▽ More

    Submitted 27 February, 2016; originally announced February 2016.

    Comments: Accepted in Design, Automation and Test in Europe 2016 conference (DATE-2016)

    Journal ref: In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2016, pp. 145-150

  11. arXiv:1602.08556  [pdf, other

    cs.NE

    Significance Driven Hybrid 8T-6T SRAM for Energy-Efficient Synaptic Storage in Artificial Neural Networks

    Authors: Gopalakrishnan Srinivasan, Parami Wijesinghe, Syed Shakib Sarwar, Akhilesh Jaiswal, Kaushik Roy

    Abstract: Multilayered artificial neural networks (ANN) have found widespread utility in classification and recognition applications. The scale and complexity of such networks together with the inadequacies of general purpose computing platforms have led to a significant interest in the development of efficient hardware implementations. In this work, we focus on designing energy efficient on-chip storage fo… ▽ More

    Submitted 27 February, 2016; originally announced February 2016.

    Comments: Accepted in Design, Automation and Test in Europe 2016 conference (DATE-2016)

    Journal ref: In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2016, pp. 151-156

  12. Spin-Torque Sensors for Energy Efficient High Speed Long Interconnects

    Authors: Zubair Al Azim, Abhronil Sengupta, Syed Shakib Sarwar, Kaushik Roy

    Abstract: In this paper, we propose a Spin-Torque (ST) based sensing scheme that can enable energy efficient multi-bit long distance interconnect architectures. Current-mode interconnects have recently been proposed to overcome the performance degradations associated with conventional voltage mode Copper (Cu) interconnects. However, the performance of current mode interconnects are limited by analog current… ▽ More

    Submitted 2 December, 2015; originally announced December 2015.

    Comments: To appear in IEEE Transactions on Electron Devices