Skip to main content

Showing 1–7 of 7 results for author: Langroudi, H F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.02233  [pdf, other

    cs.LG cs.AI

    TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

    Authors: Hamed F. Langroudi, Vedant Karia, Tej Pandit, Dhireesha Kudithipudi

    Abstract: In this research, we propose a new low-precision framework, TENT, to leverage the benefits of a tapered fixed-point numerical format in TinyML models. We introduce a tapered fixed-point quantization algorithm that matches the numerical format's dynamic range and distribution to that of the deep neural network model's parameter distribution at each layer. An accelerator architecture for the tapered… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: poster presented at the first tinyML Research Symposium, March 26, 2021

  2. arXiv:1908.02386  [pdf, other

    cs.LG cs.NE stat.ML

    Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for DNNs on the Edge

    Authors: Hamed F. Langroudi, Zachariah Carmichael, David Pastuch, Dhireesha Kudithipudi

    Abstract: Low-precision DNNs have been extensively explored in order to reduce the size of DNN models for edge devices. Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision in [5..8]-bits. However, previous studies were limited to studying posit for DNN inference only. In this paper, we propose the Cheetah framework, which supports both DNN… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  3. arXiv:1907.13216  [pdf, other

    cs.LG stat.ML

    Deep Learning Training on the Edge with Low-Precision Posits

    Authors: Hamed F. Langroudi, Zachariah Carmichael, Dhireesha Kudithipudi

    Abstract: Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision ([5..8]-bit). However, majority of studies focus only on DNN inference. In this work, we propose DNN training using posits and compare with the floating point training. We evaluate on both MNIST and Fashion MNIST corpuses, where 16-bit posits outperform 16-bit floating point for… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

  4. Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: 9 pages, Proceedings of the ACM Conference for Next Generation Arithmetic (CoNGA) 2019

  5. arXiv:1812.01762  [pdf, other

    cs.DC cs.LG cs.NE

    Deep Positron: A Deep Neural Network Using the Posit Number System

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on ho… ▽ More

    Submitted 18 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 6 pages, Design, Automation and Test in Europe 2019

  6. arXiv:1805.08624  [pdf, other

    cs.CV

    Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

    Authors: Seyed H. F. Langroudi, Tej Pandit, Dhireesha Kudithipudi

    Abstract: Performing the inference step of deep learning in resource constrained environments, such as embedded devices, is challenging. Success requires optimization at both software and hardware levels. Low precision arithmetic and specifically low precision fixed-point number systems have become the standard for performing deep learning inference. However, representing non-uniform data and distributed pa… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  7. arXiv:1711.01201  [pdf, ps, other

    cs.CV cs.NE eess.IV

    Convolutional Drift Networks for Video Classification

    Authors: Dillon Graham, Seyed Hamed Fatemi Langroudi, Christopher Kanan, Dhireesha Kudithipudi

    Abstract: Analyzing spatio-temporal data like video is a challenging task that requires processing visual and temporal information effectively. Convolutional Neural Networks have shown promise as baseline fixed feature extractors through transfer learning, a technique that helps minimize the training cost on visual information. Temporal information is often handled using hand-crafted features or Recurrent N… ▽ More

    Submitted 3 November, 2017; originally announced November 2017.

    Comments: Published in IEEE Rebooting Computing