Skip to main content

Showing 1–13 of 13 results for author: Samragh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09299  [pdf, other

    cs.LG cs.CL cs.CV

    Weight subcloning: direct initialization of transformers using larger pretrained ones

    Authors: Mohammad Samragh, Mehrdad Farajtabar, Sachin Mehta, Raviteja Vemulapalli, Fartash Faghri, Devang Naik, Oncel Tuzel, Mohammad Rastegari

    Abstract: Training large transformer models from scratch for a target task requires lots of data and is computationally demanding. The usual practice of transfer learning overcomes this challenge by initializing the model with weights of a pretrained model of the same size and specification to increase the convergence and training speed. However, what if no pretrained model of the required size is available… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  2. arXiv:2309.00140  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Improving vision-inspired keyword spotting using dynamic module skip** in streaming conformer encoder

    Authors: Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik

    Abstract: Using a vision-inspired keyword spotting framework, we propose an architecture with input-dependent dynamic depth capable of processing streaming audio. Specifically, we extend a conformer encoder with trainable binary gates that allow us to dynamically skip network modules according to the input audio. Our approach improves detection and localization accuracy on continuous speech using Librispeec… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  3. arXiv:2210.13567  [pdf, ps, other

    cs.CV cs.LG cs.SD eess.AS

    I see what you hear: a vision-inspired method to localize words

    Authors: Mohammad Samragh, Arnav Kundu, Ting-Yao Hu, Minsik Cho, Aman Chadha, Ashish Shrivastava, Oncel Tuzel, Devang Naik

    Abstract: This paper explores the possibility of using visual object detection techniques for word localization in speech data. Object detection has been thoroughly studied in the contemporary literature for visual data. Noting that an audio can be interpreted as a 1-dimensional image, object localization techniques can be fundamentally useful for word localization. Building upon this idea, we propose a lig… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  4. arXiv:2109.02836  [pdf, other

    cs.LG

    Trojan Signatures in DNN Weights

    Authors: Greg Fields, Mohammad Samragh, Mojan Javaheripi, Farinaz Koushanfar, Tara Javidi

    Abstract: Deep neural networks have been shown to be vulnerable to backdoor, or trojan, attacks where an adversary has embedded a trigger in the network at training time such that the model correctly classifies all standard inputs, but generates a targeted, incorrect classification on any input which contains the trigger. In this paper, we present the first ultra light-weight and highly effective trojan det… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 8 pages, 13 figures

  5. arXiv:2104.11413  [pdf, other

    cs.LG cs.IT

    Unsupervised Information Obfuscation for Split Inference of Neural Networks

    Authors: Mohammad Samragh, Hossein Hosseini, Aleksei Triastcyn, Kambiz Azarian, Joseph Soriaga, Farinaz Koushanfar

    Abstract: Splitting network computations between the edge device and a server enables low edge-compute inference of neural networks but might expose sensitive information about the test query to the server. To address this problem, existing techniques train the model to minimize information leakage for a given set of sensitive attributes. In practice, however, the test queries might contain attributes that… ▽ More

    Submitted 22 June, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

  6. arXiv:2009.02326  [pdf, other

    cs.LG cs.AR cs.CR cs.CV stat.ML

    CLEANN: Accelerated Trojan Shield for Embedded Neural Networks

    Authors: Mojan Javaheripi, Mohammad Samragh, Gregory Fields, Tara Javidi, Farinaz Koushanfar

    Abstract: We propose CLEANN, the first end-to-end framework that enables online mitigation of Trojans for embedded Deep Neural Network (DNN) applications. A Trojan attack works by injecting a backdoor in the DNN while training; during inference, the Trojan can be activated by the specific backdoor trigger. What differentiates CLEANN from the prior work is its lightweight methodology which recovers the groun… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

  7. arXiv:2004.04249  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    GeneCAI: Genetic Evolution for Acquiring Compact AI

    Authors: Mojan Javaheripi, Mohammad Samragh, Tara Javidi, Farinaz Koushanfar

    Abstract: In the contemporary big data realm, Deep Neural Networks (DNNs) are evolving towards more complex architectures to achieve higher inference accuracy. Model compression techniques can be leveraged to efficiently deploy such compute-intensive architectures on resource-limited mobile devices. Such methods comprise various hyper-parameters that require per-layer customization to ensure high accuracy.… ▽ More

    Submitted 14 April, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

  8. arXiv:1911.06471  [pdf, other

    cs.LG cs.NE stat.ML

    ASCAI: Adaptive Sampling for acquiring Compact AI

    Authors: Mojan Javaheripi, Mohammad Samragh, Tara Javidi, Farinaz Koushanfar

    Abstract: This paper introduces ASCAI, a novel adaptive sampling methodology that can learn how to effectively compress Deep Neural Networks (DNNs) for accelerated inference on resource-constrained platforms. Modern DNN compression techniques comprise various hyperparameters that require per-layer customization to ensure high accuracy. Choosing such hyperparameters is cumbersome as the pertinent search spac… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

  9. arXiv:1902.07342  [pdf, other

    cs.CR

    XONN: XNOR-based Oblivious Deep Neural Network Inference

    Authors: M. Sadegh Riazi, Mohammad Samragh, Hao Chen, Kim Laine, Kristin Lauter, Farinaz Koushanfar

    Abstract: Advancements in deep learning enable cloud servers to provide inference-as-a-service for clients. In this scenario, clients send their raw data to the server to run the deep learning model and send back the results. One standing challenge in this setting is to ensure the privacy of the clients' sensitive data. Oblivious inference is the task of running the neural network on the client's input with… ▽ More

    Submitted 13 September, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: To appear in USENIX Security 2019

  10. arXiv:1901.05582  [pdf, other

    cs.LG stat.ML

    CodeX: Bit-Flexible Encoding for Streaming-based FPGA Acceleration of DNNs

    Authors: Mohammad Samragh, Mojan Javaheripi, Farinaz Koushanfar

    Abstract: This paper proposes CodeX, an end-to-end framework that facilitates encoding, bitwidth customization, fine-tuning, and implementation of neural networks on FPGA platforms. CodeX incorporates nonlinear encoding to the computation flow of neural networks to save memory. The encoded features demand significantly lower storage compared to the raw full-precision activation values; therefore, the execut… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

  11. arXiv:1806.05794  [pdf, other

    cs.NE cs.AI cs.AR

    RAPIDNN: In-Memory Deep Neural Network Acceleration Framework

    Authors: Mohsen Imani, Mohammad Samragh, Yeseong Kim, Saransh Gupta, Farinaz Koushanfar, Tajana Rosing

    Abstract: Deep neural networks (DNN) have demonstrated effectiveness for various applications such as image processing, video segmentation, and speech recognition. Running state-of-the-art DNNs on current systems mostly relies on either generalpurpose processors, ASIC designs, or FPGA accelerators, all of which suffer from data movements due to the limited onchip memory and data transfer bandwidth. In this… ▽ More

    Submitted 11 April, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

  12. arXiv:1711.01243  [pdf, other

    cs.LG cs.CV cs.NE

    ReBNet: Residual Binarized Neural Network

    Authors: Mohammad Ghasemzadeh, Mohammad Samragh, Farinaz Koushanfar

    Abstract: This paper proposes ReBNet, an end-to-end framework for training reconfigurable binary neural networks on software and develo** efficient accelerators for execution on FPGA. Binary neural networks offer an intriguing opportunity for deploying large-scale deep learning models on resource-constrained devices. Binarization reduces the memory footprint and replaces the power-hungry matrix-multiplica… ▽ More

    Submitted 27 March, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: To Appear In The 26th IEEE International Symposium on Field-Programmable Custom Computing Machines

  13. arXiv:1709.02538  [pdf, other

    cs.CR cs.LG stat.ML

    DeepFense: Online Accelerated Defense Against Adversarial Deep Learning

    Authors: Bita Darvish Rouhani, Mohammad Samragh, Mojan Javaheripi, Tara Javidi, Farinaz Koushanfar

    Abstract: Recent advances in adversarial Deep Learning (DL) have opened up a largely unexplored surface for malicious attacks jeopardizing the integrity of autonomous DL systems. With the wide-spread usage of DL in critical and time-sensitive applications, including unmanned vehicles, drones, and video surveillance systems, online detection of malicious inputs is of utmost importance. We propose DeepFense,… ▽ More

    Submitted 20 August, 2018; v1 submitted 8 September, 2017; originally announced September 2017.

    Comments: Adding hardware acceleration for real-time execution of defender modules