Skip to main content

Showing 1–4 of 4 results for author: Cheekatmalla, S K

.
  1. arXiv:2303.02284  [pdf, other

    eess.AS cs.AI cs.LG eess.SP

    Fixed-point quantization aware training for on-device keyword-spotting

    Authors: Sashank Macha, Om Oza, Alex Escott, Francesco Caliva, Robbie Armitano, Santosh Kumar Cheekatmalla, Sree Hari Krishnan Parthasarathi, Yuzong Liu

    Abstract: Fixed-point (FXP) inference has proven suitable for embedded devices with limited computational resources, and yet model training is continually performed in floating-point (FLP). FXP training has not been fully explored and the non-trivial conversion from FLP to FXP presents unavoidable performance drop. We propose a novel method to train and obtain FXP convolutional keyword-spotting (KWS) models… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 5 pages, 3 figures, 4 tables

    Journal ref: ICASSP 2023

  2. arXiv:2207.06920  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets

    Authors: Lu Zeng, Sree Hari Krishnan Parthasarathi, Yuzong Liu, Alex Escott, Santosh Kumar Cheekatmalla, Nikko Strom, Shiv Vitaladevuni

    Abstract: We propose a novel 2-stage sub 8-bit quantization aware training algorithm for all components of a 250K parameter feedforward, streaming, state-free keyword spotting model. For the 1st-stage, we adapt a recently proposed quantization technique using a non-linear transformation with tanh(.) on dense layer weights. In the 2nd-stage, we use linear quantization methods on the rest of the network, incl… ▽ More

    Submitted 8 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  3. arXiv:2109.14725  [pdf, other

    cs.LG cs.SD eess.AS

    Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

    Authors: Mohammad Omar Khursheed, Christin Jose, Rajath Kumar, Gengshen Fu, Brian Kulis, Santosh Kumar Cheekatmalla

    Abstract: In this work, we propose Tiny-CRNN (Tiny Convolutional Recurrent Neural Network) models applied to the problem of wakeword detection, and augment them with scaled dot product attention. We find that, compared to Convolutional Neural Network models, False Accepts in a 250k parameter budget can be reduced by 25% with a 10% reduction in parameter size by using models based on the Tiny-CRNN architectu… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2011.12941

    ACM Class: I.2.0

  4. arXiv:2011.12941  [pdf, other

    eess.AS

    Small Footprint Convolutional Recurrent Networks for Streaming Wakeword Detection

    Authors: Mohammad Omar Khursheed, Christin Jose, Rajath Kumar, Gengshen Fu, Brian Kulis, Santosh Kumar Cheekatmalla

    Abstract: In this work, we propose small footprint Convolutional Recurrent Neural Network models applied to the problem of wakeword detection and augment them with scaled dot product attention. We find that false accepts compared to Convolutional Neural Network models in a 250k parameter budget can be reduced by 25% with a 10% reduction in parameter size by using CRNNs, and we can get up to 32% improvement… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works