Skip to main content

Showing 1–7 of 7 results for author: Nooralinejad, P

.
  1. arXiv:2406.19301  [pdf, other

    cs.LG

    MCNC: Manifold Constrained Network Compression

    Authors: Chayne Thrash, Ali Abbasi, Parsa Nooralinejad, Soroush Abbasi Koohpayegani, Reed Andreas, Hamed Pirsiavash, Soheil Kolouri

    Abstract: The outstanding performance of large foundational models across diverse tasks-from computer vision to speech and natural language processing-has significantly increased their demand. However, storing and transmitting these models pose significant challenges due to their massive size (e.g., 350GB for GPT-3). Recent literature has focused on compressing the original weights or reducing the number of… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2311.11995  [pdf, other

    cs.LG cs.AI cs.CR

    BrainWash: A Poisoning Attack to Forget in Continual Learning

    Authors: Ali Abbasi, Parsa Nooralinejad, Hamed Pirsiavash, Soheil Kolouri

    Abstract: Continual learning has gained substantial attention within the deep learning community, offering promising solutions to the challenging problem of sequential learning. Yet, a largely unexplored facet of this paradigm is its susceptibility to adversarial attacks, especially with the aim of inducing forgetting. In this paper, we introduce "BrainWash," a novel data poisoning method tailored to impose… ▽ More

    Submitted 23 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  3. arXiv:2310.02556  [pdf, other

    cs.CL cs.CV

    NOLA: Compressing LoRA using Linear Combination of Random Basis

    Authors: Soroush Abbasi Koohpayegani, KL Navaneet, Parsa Nooralinejad, Soheil Kolouri, Hamed Pirsiavash

    Abstract: Fine-tuning Large Language Models (LLMs) and storing them for each downstream task or domain is impractical because of the massive model size (e.g., 350GB in GPT-3). Current literature, such as LoRA, showcases the potential of low-rank modifications to the original weights of an LLM, enabling efficient adaptation and storage for task-specific models. These methods can reduce the number of paramete… ▽ More

    Submitted 29 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Our code is available here: https://github.com/UCDvision/NOLA

  4. arXiv:2206.08464  [pdf, other

    cs.LG

    PRANC: Pseudo RAndom Networks for Compacting deep models

    Authors: Parsa Nooralinejad, Ali Abbasi, Soroush Abbasi Koohpayegani, Kossar Pourahmadi Meibodi, Rana Muhammad Shahroz Khan, Soheil Kolouri, Hamed Pirsiavash

    Abstract: We demonstrate that a deep model can be reparametrized as a linear combination of several randomly initialized and frozen deep models in the weight space. During training, we seek local minima that reside within the subspace spanned by these random models (i.e., `basis' networks). Our framework, PRANC, enables significant compaction of a deep model. The model can be reconstructed using a single sc… ▽ More

    Submitted 28 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

  5. arXiv:2203.06514  [pdf, other

    cs.LG cs.AI cs.CV

    Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

    Authors: Ali Abbasi, Parsa Nooralinejad, Vladimir Braverman, Hamed Pirsiavash, Soheil Kolouri

    Abstract: Continual/lifelong learning from a non-stationary input data stream is a cornerstone of intelligence. Despite their phenomenal performance in a wide variety of applications, deep neural networks are prone to forgetting their previously learned information upon learning new ones. This phenomenon is called "catastrophic forgetting" and is deeply rooted in the stability-plasticity dilemma. Overcoming… ▽ More

    Submitted 8 July, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

  6. arXiv:2110.12033  [pdf, other

    cs.CV cs.LG

    A Simple Baseline for Low-Budget Active Learning

    Authors: Kossar Pourahmadi, Parsa Nooralinejad, Hamed Pirsiavash

    Abstract: Active learning focuses on choosing a subset of unlabeled data to be labeled. However, most such methods assume that a large subset of the data can be annotated. We are interested in low-budget active learning where only a small subset (e.g., 0.2% of ImageNet) can be annotated. Instead of proposing a new query strategy to iteratively sample batches of unlabeled data given an initial pool, we learn… ▽ More

    Submitted 1 April, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages, 16 tables; additional experiments

  7. TaxoNN: A Light-Weight Accelerator for Deep Neural Network Training

    Authors: Reza Hojabr, Kamyar Givaki, Kossar Pourahmadi, Parsa Nooralinejad, Ahmad Khonsari, Dara Rahmati, M. Hassan Najafi

    Abstract: Emerging intelligent embedded devices rely on Deep Neural Networks (DNNs) to be able to interact with the real-world environment. This interaction comes with the ability to retrain DNNs, since environmental conditions change continuously in time. Stochastic Gradient Descent (SGD) is a widely used algorithm to train DNNs by optimizing the parameters over the training data iteratively. In this work,… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted to ISCAS 2020. 5 pages, 5 figures

    Journal ref: 2020 IEEE International Symposium on Circuits and Systems (ISCAS), 2020, pp. 1-5