Skip to main content

Showing 1–5 of 5 results for author: Edalati, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15025  [pdf, other

    cs.LG cs.CL

    OAC: Output-adaptive Calibration for Accurate Post-training Quantization

    Authors: Ali Edalati, Alireza Ghaffari, Masoud Asgharian, Lu Hou, Boxing Chen, Vahid Partovi Nia

    Abstract: Deployment of Large Language Models (LLMs) has major computational costs, due to their rapidly expanding size. Compression of LLMs reduces the memory footprint, latency, and energy required for their inference. Post-training Quantization (PTQ) techniques have been developed to compress LLMs while avoiding expensive re-training. Most PTQ approaches formulate the quantization error based on a layer-… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 20 pages, 4 figures

  2. arXiv:2401.13212  [pdf, other

    cs.CV cs.AI cs.LG

    AdCorDA: Classifier Refinement via Adversarial Correction and Domain Adaptation

    Authors: Lulan Shen, Ali Edalati, Brett Meyer, Warren Gross, James J. Clark

    Abstract: This paper describes a simple yet effective technique for refining a pretrained classifier network. The proposed AdCorDA method is based on modification of the training set and making use of the duality between network weights and layer inputs. We call this input space training. The method consists of two stages - adversarial correction followed by domain adaptation. Adversarial correction uses ad… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  3. arXiv:2401.12014  [pdf, other

    cs.LG cs.AI cs.CV

    Robustness to distribution shifts of compressed networks for edge devices

    Authors: Lulan Shen, Ali Edalati, Brett Meyer, Warren Gross, James J. Clark

    Abstract: It is necessary to develop efficient DNNs deployed on edge devices with limited computation resources. However, the compressed networks often execute new tasks in the target domain, which is different from the source domain where the original network is trained. It is important to investigate the robustness of compressed networks in two types of data distribution shifts: domain shifts and adversar… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2212.10650  [pdf, other

    cs.CL

    KronA: Parameter Efficient Tuning with Kronecker Adapter

    Authors: Ali Edalati, Marzieh Tahaei, Ivan Kobyzev, Vahid Partovi Nia, James J. Clark, Mehdi Rezagholizadeh

    Abstract: Fine-tuning a Pre-trained Language Model (PLM) on a specific downstream task has been a well-known paradigm in Natural Language Processing. However, with the ever-growing size of PLMs, training the entire model on several downstream tasks becomes very expensive and resource-hungry. Recently, different Parameter Efficient Tuning (PET) techniques are proposed to improve the efficiency of fine-tuning… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  5. arXiv:2110.08152  [pdf, ps, other

    cs.CL

    Kronecker Decomposition for GPT Compression

    Authors: Ali Edalati, Marzieh Tahaei, Ahmad Rashid, Vahid Partovi Nia, James J. Clark, Mehdi Rezagholizadeh

    Abstract: GPT is an auto-regressive Transformer-based pre-trained language model which has attracted a lot of attention in the natural language processing (NLP) domain due to its state-of-the-art performance in several downstream tasks. The success of GPT is mostly attributed to its pre-training on huge amount of data and its large number of parameters (from ~100M to billions of parameters). Despite the sup… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.