Skip to main content

Showing 1–6 of 6 results for author: Neseem, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00103  [pdf, other

    cs.LG cs.CV

    PikeLPN: Mitigating Overlooked Inefficiencies of Low-Precision Neural Networks

    Authors: Marina Neseem, Conor McCullough, Randy Hsin, Chas Leichner, Shan Li, In Suk Chong, Andrew G. Howard, Lukasz Lew, Sherief Reda, Ville-Mikko Rautio, Daniele Moro

    Abstract: Low-precision quantization is recognized for its efficacy in neural network optimization. Our analysis reveals that non-quantized elementwise operations which are prevalent in layers such as parameterized activation functions, batch normalization, and quantization scaling dominate the inference cost of low-precision models. These non-quantized elementwise operations are commonly overlooked in SOTA… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted in CVPR 2024. 10 Figures, 9 Tables

  2. arXiv:2403.20320  [pdf, other

    cs.CV cs.AI cs.LG

    MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning

    Authors: Ahmed Agiza, Marina Neseem, Sherief Reda

    Abstract: Adapting models pre-trained on large-scale datasets to a variety of downstream tasks is a common strategy in deep learning. Consequently, parameter-efficient fine-tuning methods have emerged as a promising way to adapt pre-trained models to different tasks while training only a minimal number of parameters. While most of these methods are designed for single-task adaptation, parameter-efficient tr… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  3. arXiv:2304.08594  [pdf, other

    cs.CV

    AdaMTL: Adaptive Input-dependent Inference for Efficient Multi-Task Learning

    Authors: Marina Neseem, Ahmed Agiza, Sherief Reda

    Abstract: Modern Augmented reality applications require performing multiple tasks on each input frame simultaneously. Multi-task learning (MTL) represents an effective approach where multiple tasks share an encoder to extract representative features from the input frame, followed by task-specific decoders to generate predictions for each task. Generally, the shared encoder in MTL models needs to have a larg… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: This paper will appear in the proceedings of CVPR 2023

  4. BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge

    Authors: Abdelrahman Hosny, Marina Neseem, Sherief Reda

    Abstract: Training on the Edge enables neural networks to learn continuously from new data after deployment on memory-constrained edge devices. Previous work is mostly concerned with reducing the number of model parameters which is only beneficial for inference. However, memory footprint from activations is the main bottleneck for training on the edge. Existing incremental training methods fine-tune the las… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: 12 pages, 13 figures, to appear in the proceedings of The Sixth ACM/IEEE Symposium on Edge Computing (SEC 2021)

  5. arXiv:2108.06850  [pdf, other

    cs.CV cs.AI

    AdaCon: Adaptive Context-Aware Object Detection for Resource-Constrained Embedded Devices

    Authors: Marina Neseem, Sherief Reda

    Abstract: Convolutional Neural Networks achieve state-of-the-art accuracy in object detection tasks. However, they have large computational and energy requirements that challenge their deployment on resource-constrained edge devices. Object detection takes an image as an input, and identifies the existing object classes as well as their locations in the image. In this paper, we leverage the prior knowledge… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: 9 pages, 6 figures, 2021 IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2021)

  6. arXiv:2006.05884  [pdf, other

    eess.SP cs.HC cs.LG

    AdaSense: Adaptive Low-Power Sensing and Activity Recognition for Wearable Devices

    Authors: Marina Neseem, Jon Nelson, Sherief Reda

    Abstract: Wearable devices have strict power and memory limitations. As a result, there is a need to optimize the power consumption on those devices without sacrificing the accuracy. This paper presents AdaSense: a sensing, feature extraction and classification co-optimized framework for Human Activity Recognition. The proposed techniques reduce the power consumption by dynamically switching among different… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: 6 pages, 7 figures, To appear in DAC 2020