Skip to main content

Showing 1–3 of 3 results for author: Dutta, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05276  [pdf, other

    cs.LG

    VTrans: Accelerating Transformer Compression with Variational Information Bottleneck based Pruning

    Authors: Oshin Dutta, Ritvik Gupta, Sumeet Agarwal

    Abstract: In recent years, there has been a growing emphasis on compressing large pre-trained transformer models for resource-constrained devices. However, traditional pruning methods often leave the embedding layer untouched, leading to model over-parameterization. Additionally, they require extensive compression time with large datasets to maintain performance in pruned models. To address these challenges… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2307.04443  [pdf, other

    cs.CV cs.LG

    Search-time Efficient Device Constraints-Aware Neural Architecture Search

    Authors: Oshin Dutta, Tanu Kanvar, Sumeet Agarwal

    Abstract: Edge computing aims to enable edge devices, such as IoT devices, to process data locally instead of relying on the cloud. However, deep learning techniques like computer vision and natural language processing can be computationally expensive and memory-intensive. Creating manual architectures specialized for each device is infeasible due to their varying memory and computational constraints. To ad… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Accepted to 10th International Conference on Pattern Recognition and Machine Intelligence (PReMI) 2023

  3. arXiv:2010.01343  [pdf, other

    cs.CV cs.IT cs.LG

    A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition

    Authors: Ayush Srivastava, Oshin Dutta, Prathosh AP, Sumeet Agarwal, Jigyasa Gupta

    Abstract: In the last few years, compression of deep neural networks has become an important strand of machine learning and computer vision research. Deep models require sizeable computational complexity and storage, when used for instance for Human Action Recognition (HAR) from videos, making them unsuitable to be deployed on edge devices. In this paper, we address this issue and propose a method to effect… ▽ More

    Submitted 9 November, 2020; v1 submitted 3 October, 2020; originally announced October 2020.

    Comments: Accepted at IEEE WACV 2021