Skip to main content

Showing 1–18 of 18 results for author: Asadi, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15414  [pdf, other

    cs.LG cs.CV

    Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?

    Authors: Nader Asadi, Mahdi Beitollahi, Yasser Khalil, Yinchuan Li, Guojun Zhang, Xi Chen

    Abstract: Parameter-efficient fine-tuning stands as the standard for efficiently fine-tuning large language and vision models on downstream tasks. Specifically, the efficiency of low-rank adaptation has facilitated the creation and sharing of hundreds of custom LoRA modules, each trained on distinct data from various downstream tasks. In this paper, we explore the composability of LoRA modules, examining if… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2402.01863  [pdf, other

    cs.LG cs.AI cs.DC

    DFML: Decentralized Federated Mutual Learning

    Authors: Yasser H. Khalil, Amir H. Estiri, Mahdi Beitollahi, Nader Asadi, Sobhan Hemati, Xu Li, Guojun Zhang, Xi Chen

    Abstract: In the realm of real-world devices, centralized servers in Federated Learning (FL) present challenges including communication bottlenecks and susceptibility to a single point of failure. Additionally, contemporary devices inherently exhibit model and data heterogeneity. Existing work lacks a Decentralized FL (DFL) framework capable of accommodating such heterogeneity without imposing architectural… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2311.12091  [pdf, other

    cs.CV

    DAS: A Deformable Attention to Capture Salient Information in CNNs

    Authors: Farzad Salajegheh, Nader Asadi, Soroush Saryazdi, Sudhir Mudur

    Abstract: Convolutional Neural Networks (CNNs) excel in local spatial pattern recognition. For many vision tasks, such as object recognition and segmentation, salient information is also present outside CNN's kernel boundaries. However, CNNs struggle in capturing such relevant information due to their confined receptive fields. Self-attention can improve a model's access to global information but increases… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  4. arXiv:2303.14771  [pdf, other

    cs.LG

    Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning

    Authors: Nader Asadi, MohammadReza Davari, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky

    Abstract: In Continual learning (CL) balancing effective adaptation while combating catastrophic forgetting is a central challenge. Many of the recent best-performing methods utilize various forms of prior task data, e.g. a replay buffer, to tackle the catastrophic forgetting problem. Having access to previous task data can be restrictive in many real-world scenarios, for example when task data is sensitive… ▽ More

    Submitted 6 June, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted at ICML 2023

  5. arXiv:2301.01758  [pdf, other

    cs.DC cs.CV

    An Ensemble Mobile-Cloud Computing Method for Affordable and Accurate Glucometer Readout

    Authors: Navidreza Asadi, Maziar Goudarzi

    Abstract: Despite essential efforts towards advanced wireless medical devices for regular monitoring of blood properties, many such devices are not available or not affordable for everyone in many countries. Alternatively using ordinary devices, patients ought to log data into a mobile health-monitoring manually. It causes several issues: (1) clients reportedly tend to enter unrealistic data; (2) ty** val… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: 12 pages, 12 figures, 8 tables

  6. arXiv:2211.06982  [pdf, ps, other

    cs.PF cs.LG cs.SD eess.AS

    FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs

    Authors: Hossein Katebi, Navidreza Asadi, Maziar Goudarzi

    Abstract: Although prior art has demonstrated negligible accuracy drop in sub-byte quantization -- where weights and/or activations are represented by less than 8 bits -- popular SIMD instructions of CPUs do not natively support these datatypes. While recent methods, such as ULPPACK, are already using sub-byte quantization on general-purpose CPUs with vector units, they leave out several empty bits between… ▽ More

    Submitted 20 November, 2022; v1 submitted 13 November, 2022; originally announced November 2022.

  7. Variant Parallelism: Lightweight Deep Convolutional Models for Distributed Inference on IoT Devices

    Authors: Navidreza Asadi, Maziar Goudarzi

    Abstract: Two major techniques are commonly used to meet real-time inference limitations when distributing models across resource-constrained IoT devices: (1) model parallelism (MP) and (2) class parallelism (CP). In MP, transmitting bulky intermediate data (orders of magnitude larger than input) between devices imposes huge communication overhead. Although CP solves this problem, it has limitations on the… ▽ More

    Submitted 11 June, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: 8 pages, 6 figures, 7 tables

  8. arXiv:2203.13381  [pdf, other

    cs.LG cs.AI cs.CV

    Probing Representation Forgetting in Supervised and Unsupervised Continual Learning

    Authors: MohammadReza Davari, Nader Asadi, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky

    Abstract: Continual Learning research typically focuses on tackling the phenomenon of catastrophic forgetting in neural networks. Catastrophic forgetting is associated with an abrupt loss of knowledge previously learned by a model when the task, or more broadly the data distribution, being trained on changes. In supervised learning problems this forgetting, resulting from a change in the model's representat… ▽ More

    Submitted 5 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  9. arXiv:2203.13307  [pdf, other

    cs.LG cs.AI

    Tackling Online One-Class Incremental Learning by Removing Negative Contrasts

    Authors: Nader Asadi, Sudhir Mudur, Eugene Belilovsky

    Abstract: Recent work studies the supervised online continual learning setting where a learner receives a stream of data whose class distribution changes over time. Distinct from other continual learning settings the learner is presented new samples only once and must distinguish between all seen classes. A number of successful methods in this setting focus on storing and replaying a subset of samples along… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at NeurIPS 2021 Workshop on Distribution Shifts

  10. arXiv:2203.03798   

    cs.LG cs.AI

    New Insights on Reducing Abrupt Representation Change in Online Continual Learning

    Authors: Lucas Caccia, Rahaf Aljundi, Nader Asadi, Tinne Tuytelaars, Joelle Pineau, Eugene Belilovsky

    Abstract: In the online continual learning paradigm, agents must learn from a changing distribution while respecting memory and compute constraints. Experience Replay (ER), where a small subset of past data is stored and replayed alongside new data, has emerged as a simple and effective learning strategy. In this work, we focus on the change in representations of observed data that arises when previously un… ▽ More

    Submitted 25 April, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: This has been withdrawn as it is a new version of arXiv:2104.05025

  11. Identifying and correcting invalid citations due to DOI errors in Crossref data

    Authors: Alessia Cioffi, Sara Coppini, Arcangelo Massari, Arianna Moretti, Silvio Peroni, Cristian Santini, Nooshin Shahidzadeh Asadi

    Abstract: This work aims to identify classes of DOI mistakes by analysing the open bibliographic metadata available in Crossref, highlighting which publishers were responsible for such mistakes and how many of these incorrect DOIs could be corrected through automatic processes. By using a list of invalid cited DOIs gathered by OpenCitations while processing the OpenCitations Index of Crossref open DOI-to-DO… ▽ More

    Submitted 7 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Journal ref: Scientometrics 127, 3593-3612 (2022)

  12. arXiv:2104.05025  [pdf, other

    cs.LG

    New Insights on Reducing Abrupt Representation Change in Online Continual Learning

    Authors: Lucas Caccia, Rahaf Aljundi, Nader Asadi, Tinne Tuytelaars, Joelle Pineau, Eugene Belilovsky

    Abstract: In the online continual learning paradigm, agents must learn from a changing distribution while respecting memory and compute constraints. Experience Replay (ER), where a small subset of past data is stored and replayed alongside new data, has emerged as a simple and effective learning strategy. In this work, we focus on the change in representations of observed data that arises when previously un… ▽ More

    Submitted 2 May, 2022; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted at ICLR 2022. Code available at www.github.com/pclucas14/AML

  13. arXiv:1909.08245  [pdf, other

    cs.CV

    Towards Shape Biased Unsupervised Representation Learning for Domain Generalization

    Authors: Nader Asadi, Amir M. Sarfi, Mehrdad Hosseinzadeh, Zahra Karimpour, Mahdi Eftekhari

    Abstract: It is known that, without awareness of the process, our brain appears to focus on the general shape of objects rather than superficial statistics of context. On the other hand, learning autonomously allows discovering invariant regularities which help generalization. In this work, we propose a learning framework to improve the shape bias property of self-supervised methods. Our method learns seman… ▽ More

    Submitted 29 March, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: Under review

  14. arXiv:1907.01023  [pdf, other

    cs.CV cs.LG

    Diminishing the Effect of Adversarial Perturbations via Refining Feature Representation

    Authors: Nader Asadi, AmirMohammad Sarfi, Mehrdad Hosseinzadeh, Sahba Tahsini, Mahdi Eftekhari

    Abstract: Deep neural networks are highly vulnerable to adversarial examples, which imposes severe security issues for these state-of-the-art models. Many defense methods have been proposed to mitigate this problem. However, a lot of them depend on modification or additional training of the target model. In this work, we analytically investigate each layer's representation of non-perturbed and perturbed ima… ▽ More

    Submitted 1 October, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Accepted at NeuralIPS 2019 workshop on Safety and Robustness in Decision Making

  15. arXiv:1305.0699  [pdf, other

    cs.IR cs.DB

    Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections

    Authors: Nima Asadi, Jimmy Lin

    Abstract: For text retrieval systems, the assumption that all data structures reside in main memory is increasingly common. In this context, we present a novel incremental inverted indexing algorithm for web-scale collections that directly constructs compressed postings lists in memory. Designing efficient in-memory algorithms requires understanding modern processor architectures and memory hierarchies: in… ▽ More

    Submitted 3 May, 2013; originally announced May 2013.

  16. arXiv:1302.5302  [pdf, other

    cs.IR cs.DB

    Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search

    Authors: Nima Asadi, Jimmy Lin, Michael Busch

    Abstract: We explore a real-time Twitter search application where tweets are arriving at a rate of several thousands per second. Real-time search demands that they be indexed and searchable immediately, which leads to a number of implementation challenges. In this paper, we focus on one aspect: dynamic postings allocation policies for index structures that are completely held in main memory. The core issue… ▽ More

    Submitted 21 February, 2013; originally announced February 2013.

  17. arXiv:1212.2287  [pdf, other

    cs.DB cs.IR cs.LG

    Runtime Optimizations for Prediction with Tree-Based Models

    Authors: Nima Asadi, Jimmy Lin, Arjen P. de Vries

    Abstract: Tree-based models have proven to be an effective solution for web ranking as well as other problems in diverse domains. This paper focuses on optimizing the runtime performance of applying such models to make predictions, given an already-trained model. Although exceedingly simple conceptually, most implementations of tree-based models do not efficiently utilize modern superscalar processor archit… ▽ More

    Submitted 26 April, 2013; v1 submitted 10 December, 2012; originally announced December 2012.

  18. arXiv:1107.3765  [pdf, other

    cs.AI cs.DC

    Using Variational Inference and MapReduce to Scale Topic Modeling

    Authors: Ke Zhai, Jordan Boyd-Graber, Nima Asadi

    Abstract: Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for exploring document collections. Because of the increasing prevalence of large datasets, there is a need to improve the scalability of inference of LDA. In this paper, we propose a technique called ~\emph{MapReduce LDA} (Mr. LDA) to accommodate very large corpus collections in the MapReduce framework. In contrast to other t… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.