Skip to main content

Showing 1–50 of 64 results for author: Rostami, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16098  [pdf, other

    cs.CV

    Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion

    Authors: Zizhao Hu, Mohammad Rostami

    Abstract: The Transformer architecture has dominated machine learning in a wide range of tasks. The specific characteristic of this architecture is an expensive scaled dot-product attention mechanism that models the inter-token interactions, which is known to be the reason behind its success. However, such a mechanism does not have a direct parallel to the human brain which brings the question if the scaled… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2404.06856  [pdf, other

    cs.SE cs.AR cs.CR cs.LG

    Beyond Random Inputs: A Novel ML-Based Hardware Fuzzing

    Authors: Mohamadreza Rostami, Marco Chilese, Shaza Zeitouni, Rahul Kande, Jeyavijayan Rajendran, Ahmad-Reza Sadeghi

    Abstract: Modern computing systems heavily rely on hardware as the root of trust. However, their increasing complexity has given rise to security-critical vulnerabilities that cross-layer at-tacks can exploit. Traditional hardware vulnerability detection methods, such as random regression and formal verification, have limitations. Random regression, while scalable, is slow in exploring hardware, and formal… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2403.16530  [pdf, other

    cs.CV cs.AI

    An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models

    Authors: Zizhao Hu, Shaochong Jia, Mohammad Rostami

    Abstract: Diffusion models have been widely used for conditional data cross-modal generation tasks such as text-to-image and text-to-video. However, state-of-the-art models still fail to align the generated visual concepts with high-level semantics in a language such as object count, spatial relationship, etc. We approach this problem from a multimodal data fusion perspective and investigate how different f… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2403.16188  [pdf, other

    cs.CV

    Cross-domain Multi-modal Few-shot Object Detection via Rich Text

    Authors: Zeyu Shangguan, Daniel Seita, Mohammad Rostami

    Abstract: Cross-modal feature extraction and integration have led to steady performance improvements in few-shot learning tasks due to generating richer features. However, existing multi-modal object detection (MM-OD) methods degrade when facing significant domain-shift and are sample insufficient. We hypothesize that rich text information could more effectively help the model to build a knowledge relations… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  5. arXiv:2403.12511  [pdf, other

    cs.LG math.OC

    Forward Gradient-Based Frank-Wolfe Optimization for Memory Efficient Deep Neural Network Training

    Authors: M. Rostami, S. S. Kia

    Abstract: Training a deep neural network using gradient-based methods necessitates the calculation of gradients at each level. However, using backpropagation or reverse mode differentiation, to calculate the gradients necessities significant memory consumption, rendering backpropagation an inefficient method for computing gradients. This paper focuses on analyzing the performance of the well-known Frank-Wol… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  6. arXiv:2403.03018  [pdf, other

    cs.LG q-bio.GN

    CRISPR: Ensemble Model

    Authors: Mohammad Rostami, Amin Ghariyazi, Hamed Dashti, Mohammad Hossein Rohban, Hamid R. Rabiee

    Abstract: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a gene editing technology that has revolutionized the fields of biology and medicine. However, one of the challenges of using CRISPR is predicting the on-target efficacy and off-target sensitivity of single-guide RNAs (sgRNAs). This is because most existing methods are trained on separate datasets with different genes and cells,… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2402.18599  [pdf, other

    cs.LG cs.AI

    Meta-Tasks: An alternative view on Meta-Learning Regularization

    Authors: Mohammad Rostami, Atik Faysal, Huaxia Wang, Avimanyu Sahoo, Ryan Antle

    Abstract: Few-shot learning (FSL) is a challenging machine learning problem due to a scarcity of labeled data. The ability to generalize effectively on both novel and training tasks is a significant barrier to FSL. This paper proposes a novel solution that can generalize to both training and novel tasks while also utilizing unlabeled samples. The method refines the embedding model before updating the outer… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  8. arXiv:2402.03704  [pdf, other

    cs.CR

    WhisperFuzz: White-Box Fuzzing for Detecting and Locating Timing Vulnerabilities in Processors

    Authors: Pallavi Borkar, Chen Chen, Mohamadreza Rostami, Nikhilesh Singh, Rahul Kande, Ahmad-Reza Sadeghi, Chester Rebeiro, Jeyavijayan Rajendran

    Abstract: Timing vulnerabilities in processors have emerged as a potent threat. As processors are the foundation of any computing system, identifying these flaws is imperative. Recently fuzzing techniques, traditionally used for detecting software vulnerabilities, have shown promising results for uncovering vulnerabilities in large-scale hardware designs, such as processors. Researchers have adapted black-b… ▽ More

    Submitted 14 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to USENIX Sec'24

  9. arXiv:2402.00580  [pdf, other

    cs.LG cs.CV

    Continuous Unsupervised Domain Adaptation Using Stabilized Representations and Experience Replay

    Authors: Mohammad Rostami

    Abstract: We introduce an algorithm for tackling the problem of unsupervised domain adaptation (UDA) in continual learning (CL) scenarios. The primary objective is to maintain model generalization under domain shift when new domains arrive continually through updating a base model when only unlabeled data is accessible in subsequent tasks. While there are many existing UDA algorithms, they typically require… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  10. arXiv:2401.15275  [pdf, other

    cs.CV

    Dynamic Transformer Architecture for Continual Learning of Multimodal Tasks

    Authors: Yuliang Cai, Mohammad Rostami

    Abstract: Transformer neural networks are increasingly replacing prior architectures in a wide range of applications in different data modalities. The increasing size and computational demands of fine-tuning large pre-trained transformer neural networks pose significant challenges for the widespread adoption of these models for applications that demand on-edge computing. To tackle this challenge, continual… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  11. arXiv:2401.07207  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation Using Compact Internal Representations

    Authors: Mohammad Rostami

    Abstract: A major technique for tackling unsupervised domain adaptation involves map** data points from both the source and target domains into a shared embedding space. The map** encoder to the embedding space is trained such that the embedding space becomes domain agnostic, allowing a classifier trained on the source domain to generalize well on the target domain. To further enhance the performance of… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  12. arXiv:2401.02941  [pdf, other

    cs.CV cs.AI

    Unsupervised Federated Domain Adaptation for Segmentation of MRI Images

    Authors: Navapat Nananukul, Hamid Soltanian-zadeh, Mohammad Rostami

    Abstract: Automatic semantic segmentation of magnetic resonance imaging (MRI) images using deep neural networks greatly assists in evaluating and planning treatments for various clinical applications. However, training these models is conditioned on the availability of abundant annotated data to implement the end-to-end supervised learning procedure. Even if we annotate enough data, MRI images display consi… ▽ More

    Submitted 13 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  13. arXiv:2401.01042  [pdf, other

    cs.CV

    Relating Events and Frames Based on Self-Supervised Learning and Uncorrelated Conditioning for Unsupervised Domain Adaptation

    Authors: Mohammad Rostami, Dayuan Jian

    Abstract: Event-based cameras provide accurate and high temporal resolution measurements for performing computer vision tasks in challenging scenarios, such as high-dynamic range environments and fast-motion maneuvers. Despite their advantages, utilizing deep learning for event-based vision encounters a significant obstacle due to the scarcity of annotated data caused by the relatively recent emergence of e… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  14. arXiv:2401.01035  [pdf, other

    cs.CV

    Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations

    Authors: Serban Stan, Mohammad Rostami

    Abstract: Semantic segmentation models trained on annotated data fail to generalize well when the input data distribution changes over extended time period, leading to requiring re-training to maintain performance. Classic Unsupervised domain adaptation (UDA) attempts to address a similar problem when there is target domain with no annotated data points through transferring knowledge from a source domain wi… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  15. arXiv:2311.16488  [pdf, other

    cs.CV cs.AI

    Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net

    Authors: Zizhao Hu, Shaochong Jia, Mohammad Rostami

    Abstract: Recently, diffusion models have been used successfully to fit distributions for cross-modal data translation and multimodal data generation. However, these methods rely on extensive scaling, overlooking the inefficiency and interference between modalities. We develop Partially Shared U-Net (PS-U-Net) architecture which is an efficient multimodal diffusion model that allows text and image inputs to… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  16. arXiv:2310.13085  [pdf, other

    cs.LG cs.AI

    Unsupervised Representation Learning to Aid Semi-Supervised Meta Learning

    Authors: Atik Faysal, Mohammad Rostami, Huaxia Wang, Avimanyu Sahoo, Ryan Antle

    Abstract: Few-shot learning or meta-learning leverages the data scarcity problem in machine learning. Traditionally, training data requires a multitude of samples and labeling for supervised learning. To address this issue, we propose a one-shot unsupervised meta-learning to learn the latent representation of the training samples. We use augmented samples as the query set during the training phase of the un… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  17. arXiv:2310.07925  [pdf, other

    math.OC cs.LG

    First-Order Dynamic Optimization for Streaming Convex Costs

    Authors: M. Rostami, H. Moradian, S. S. Kia

    Abstract: This paper proposes a set of novel optimization algorithms for solving a class of convex optimization problems with time-varying streaming cost function. We develop an approach to track the optimal solution with a bounded error. Unlike the existing results, our algorithm is executed only by using the first-order derivatives of the cost function which makes it computationally efficient for optimiza… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  18. arXiv:2310.03898  [pdf, other

    cs.LG

    Class-Incremental Learning Using Generative Experience Replay Based on Time-aware Regularization

    Authors: Zizhao Hu, Mohammad Rostami

    Abstract: Learning new tasks accumulatively without forgetting remains a critical challenge in continual learning. Generative experience replay addresses this challenge by synthesizing pseudo-data points for past learned tasks and later replaying them for concurrent training along with the new tasks' data. Generative replay is the best strategy for continual learning under a strict class-incremental setting… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  19. arXiv:2309.15522  [pdf, ps, other

    cs.LG cs.AI

    Robust Internal Representations for Domain Generalization

    Authors: Mohammad Rostami

    Abstract: This paper which is part of the New Faculty Highlights Invited Speaker Program of AAAI'23, serves as a comprehensive survey of my research in transfer learning by utilizing embedding spaces. The work reviewed in this paper specifically revolves around the inherent challenges associated with continual learning and limited availability of labeled data. By providing an overview of my past and ongoing… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: to appear in AI Magazine Winter 2023 Issue

  20. arXiv:2308.07535  [pdf, other

    cs.CV

    Improved Region Proposal Network for Enhanced Few-Shot Object Detection

    Authors: Zeyu Shangguan, Mohammad Rostami

    Abstract: Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.10422

  21. arXiv:2305.18675  [pdf, other

    cs.LG cs.AI cs.CL

    History Repeats: Overcoming Catastrophic Forgetting For Event-Centric Temporal Knowledge Graph Completion

    Authors: Mehrnoosh Mirtaheri, Mohammad Rostami, Aram Galstyan

    Abstract: Temporal knowledge graph (TKG) completion models typically rely on having access to the entire graph during training. However, in real-world scenarios, TKG data is often received incrementally as events unfold, leading to a dynamic non-stationary data distribution over time. While one could incorporate fine-tuning to existing methods to allow them to adapt to evolving TKG data, this can lead to fo… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 14 pages, 6 figures

    ACM Class: I.2.6; I.2.7

  22. arXiv:2305.18433  [pdf, other

    cs.LG cs.CV

    Cognitively Inspired Cross-Modal Data Generation Using Diffusion Models

    Authors: Zizhao Hu, Mohammad Rostami

    Abstract: Most existing cross-modal generative methods based on diffusion models use guidance to provide control over the latent space to enable conditional generation across different modalities. Such methods focus on providing guidance through separately-trained models, each for one modality. As a result, these methods suffer from cross-modal information loss and are limited to unidirectional conditional… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  23. arXiv:2304.02769  [pdf, other

    cs.AI

    Low-Shot Learning for Fictional Claim Verification

    Authors: Viswanath Chadalapaka, Derek Nguyen, JoonWon Choi, Shaunak Joshi, Mohammad Rostami

    Abstract: In this paper, we study the problem of claim verification in the context of claims about fictional stories in a low-shot learning setting. To this end, we generate two synthetic datasets and then develop an end-to-end pipeline and model that is tested on both benchmarks. To test the efficacy of our pipeline and the difficulty of benchmarks, we compare our models' results against human and random a… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 6 pages

  24. arXiv:2304.02168  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    I2I: Initializing Adapters with Improvised Knowledge

    Authors: Tejas Srinivasan, Furong Jia, Mohammad Rostami, Jesse Thomason

    Abstract: Adapters present a promising solution to the catastrophic forgetting problem in continual learning. However, training independent Adapter modules for every new task misses an opportunity for cross-task knowledge transfer. We propose Improvise to Initialize (I2I), a continual learning algorithm that initializes Adapters for incoming tasks by distilling knowledge from previously-learned tasks' Adapt… ▽ More

    Submitted 10 July, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted at 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  25. arXiv:2303.14615  [pdf, other

    cs.LG cs.CV

    Explainable Artificial Intelligence Architecture for Melanoma Diagnosis Using Indicator Localization and Self-Supervised Learning

    Authors: Ruitong Sun, Mohammad Rostami

    Abstract: Melanoma is a prevalent lethal type of cancer that is treatable if diagnosed at early stages of development. Skin lesions are a typical indicator for diagnosing melanoma but they often led to delayed diagnosis due to high similarities of cancerous and benign lesions at early stages of melanoma. Deep learning (DL) can be used as a solution to classify skin lesion pictures with a high accuracy, but… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  26. arXiv:2303.14423  [pdf, other

    cs.LG

    Task-Attentive Transformer Architecture for Continual Learning of Vision-and-Language Tasks Using Knowledge Distillation

    Authors: Yuliang Cai, Jesse Thomason, Mohammad Rostami

    Abstract: The size and the computational load of fine-tuning large-scale pre-trained neural network are becoming two major obstacles in adopting machine learning in many applications. Continual learning (CL) can serve as a remedy through enabling knowledge-transfer across sequentially arriving tasks which relaxes the need to fine-tune all network weights from scratch. However, existing CL algorithms primari… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  27. arXiv:2303.12424  [pdf, other

    cs.CV cs.LG

    Unsupervised Domain Adaptation for Training Event-Based Networks Using Contrastive Learning and Uncorrelated Conditioning

    Authors: Dayuan Jian, Mohammad Rostami

    Abstract: Event-based cameras offer reliable measurements for preforming computer vision tasks in high-dynamic range environments and during fast motion maneuvers. However, adopting deep learning in event-based vision faces the challenge of annotated data scarcity due to recency of event cameras. Transferring the knowledge that can be obtained from conventional camera annotated data offers a practical solut… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  28. arXiv:2303.12255  [pdf, other

    cs.LG cs.CV

    Encoding Binary Concepts in the Latent Space of Generative Models for Enhancing Data Representation

    Authors: Zizhao Hu, Mohammad Rostami

    Abstract: Binary concepts are empirically used by humans to generalize efficiently. And they are based on Bernoulli distribution which is the building block of information. These concepts span both low-level and high-level features such as "large vs small" and "a neuron is active or inactive". Binary concepts are ubiquitous features and can be used to transfer knowledge to improve model generalization. We p… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  29. arXiv:2303.10422  [pdf, other

    cs.CV cs.AI

    Identification of Novel Classes for Improving Few-Shot Object Detection

    Authors: Zeyu Shangguan, Mohammad Rostami

    Abstract: Conventional training of deep neural networks requires a large number of the annotated image which is a laborious and time-consuming task, particularly for rare objects. Few-shot object detection (FSOD) methods offer a remedy by realizing robust object detection using only a few training samples per class. An unexplored challenge for FSOD is that instances from unlabeled novel classes that do not… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  30. arXiv:2301.12369  [pdf, other

    cs.LG

    Preserving Fairness in AI under Domain Shift

    Authors: Serban Stan, Mohammad Rostami

    Abstract: Existing algorithms for ensuring fairness in AI use a single-shot training strategy, where an AI model is trained on an annotated training dataset with sensitive attributes and then fielded for utilization. This training strategy is effective in problems with stationary distributions, where both training and testing data are drawn from the same distribution. However, it is vulnerable with respect… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  31. arXiv:2301.12361  [pdf, other

    cs.LG

    Graph Harmony: Denoising and Nuclear-Norm Wasserstein Adaptation for Enhanced Domain Transfer in Graph-Structured Data

    Authors: Mengxi Wu, Mohammad Rostami

    Abstract: Graph-structured data can be found in numerous domains, yet the scarcity of labeled instances hinders its effective utilization of deep learning in many scenarios. Traditional unsupervised domain adaptation (UDA) strategies for graphs primarily hinge on adversarial learning and pseudo-labeling. These approaches fail to effectively leverage graph discriminative features, leading to class mismatchin… ▽ More

    Submitted 12 December, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

  32. arXiv:2211.00807  [pdf, other

    cs.CV cs.LG

    Unsupervised Model Adaptation for Source-free Segmentation of Medical Images

    Authors: Serban Stan, Mohammad Rostami

    Abstract: The recent prevalence of deep neural networks has lead semantic segmentation networks to achieve human-level performance in the medical field when sufficient training data is provided. Such networks however fail to generalize when tasked with predicting semantic maps for out-of-distribution images, requiring model re-training on the new distributions. This expensive process necessitates expert kno… ▽ More

    Submitted 29 July, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

  33. arXiv:2210.14362  [pdf, other

    cs.LG

    Federated Learning Using Variance Reduced Stochastic Gradient for Probabilistically Activated Agents

    Authors: M. R. Rostami, S. S. Kia

    Abstract: This paper proposes an algorithm for Federated Learning (FL) with a two-layer structure that achieves both variance reduction and a faster convergence rate to an optimal solution in the setting where each agent has an arbitrary probability of selection in each iteration. In distributed machine learning, when privacy matters, FL is a functional tool. Placing FL in an environment where it has some i… ▽ More

    Submitted 1 April, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  34. arXiv:2209.14644  [pdf, other

    cs.LG cs.CV

    Increasing Model Generalizability for Unsupervised Domain Adaptation

    Authors: Mohammad Rostami

    Abstract: A dominant approach for addressing unsupervised domain adaptation is to map data points for the source and the target domains into an embedding space which is modeled as the output-space of a shared deep encoder. The encoder is trained to make the embedding space domain-agnostic to make a source-trained classifier generalizable on the target domain. A secondary mechanism to improve UDA performance… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Presented 2022 Conference on Lifelong Learning Agents

  35. An Adaptive Image Encryption Scheme Guided by Fuzzy Models

    Authors: Mahdi Shariatzadeh, Mohammad Javad Rostami, Mahdi Eftekhari

    Abstract: A new image encryption scheme using the advanced encryption standard (AES), a chaotic map, a genetic operator, and a fuzzy inference system is proposed in this paper. In this work, plain images were used as input, and the required security level was achieved. Security criteria were computed after running a proposed encryption process. Then an adaptive fuzzy system decided whether to repeat the enc… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: Iranian Journal of Fuzzy Systems (2023)

  36. arXiv:2208.07635  [pdf

    cs.MM eess.IV

    A New Scheme for Image Compression and Encryption Using ECIES, Henon Map, and AEGAN

    Authors: Mahdi Shariatzadeh, Mahdi Eftekhari, Mohammad Javad Rostami

    Abstract: Providing security in the transmission of images and other multimedia data has become one of the most important scientific and practical issues. In this paper, a method for compressing and encryption images is proposed, which can safely transmit images in low-bandwidth data transmission channels. At first, using the autoencoding generative adversarial network (AEGAN) model, the images are mapped t… ▽ More

    Submitted 24 August, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

  37. arXiv:2207.04565  [pdf, other

    eess.IV cs.LG

    Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine Learning

    Authors: Kleanthis Avramidis, Mohammad Rostami, Melinda Chang, Shrikanth Narayanan

    Abstract: Papilledema is an ophthalmic neurologic disorder in which increased intracranial pressure leads to swelling of the optic nerves. Undiagnosed papilledema in children may lead to blindness and may be a sign of life-threatening conditions, such as brain tumors. Robust and accurate clinical diagnosis of this syndrome can be facilitated by automated analysis of fundus images using deep learning, especi… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 5 pages, 4 figures, 2 tables, 2022 IEEE International Conference on Image Processing (ICIP)

  38. arXiv:2206.09059  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks

    Authors: Tejas Srinivasan, Ting-Yun Chang, Leticia Leonor Pinto Alva, Georgios Chochlakis, Mohammad Rostami, Jesse Thomason

    Abstract: Current state-of-the-art vision-and-language models are evaluated on tasks either individually or in a multi-task setting, overlooking the challenges of continually learning (CL) tasks as they arrive. Existing CL benchmarks have facilitated research on task adaptation and mitigating "catastrophic forgetting", but are limited to vision-only and language-only tasks. We present CLiMB, a benchmark to… ▽ More

    Submitted 24 November, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to NeurIPS 2022 Datasets and Benchmarks track

  39. arXiv:2110.04662  [pdf, other

    cs.LG cs.AI

    Cognitively Inspired Learning of Incremental Drifting Concepts

    Authors: Mohammad Rostami, Aram Galstyan

    Abstract: Humans continually expand their learned knowledge to new domains and learn new concepts without any interference with past learned experiences. In contrast, machine learning models perform poorly in a continual learning setting, where input data distribution changes over time. Inspired by the nervous system learning mechanisms, we develop a computational model that enables a deep neural network to… ▽ More

    Submitted 21 April, 2023; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: 2023 International Joint Conference on Artificial Intelligence

  40. arXiv:2109.02051  [pdf, other

    cs.SD cs.CL cs.CR eess.AS

    Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection

    Authors: Amir Mohammad Rostami, Mohammad Mehdi Homayounpour, Ahmad Nickabadi

    Abstract: Many endeavors have sought to develop countermeasure techniques as enhancements on Automatic Speaker Verification (ASV) systems, in order to make them more robust against spoof attacks. As evidenced by the latest ASVspoof 2019 countermeasure challenge, models currently deployed for the task of ASV are, at their best, devoid of suitable degrees of generalization to unseen attacks. Upon further inve… ▽ More

    Submitted 19 September, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

  41. arXiv:2108.12081  [pdf, other

    cs.CV

    Detection and Continual Learning of Novel Face Presentation Attacks

    Authors: Mohammad Rostami, Leonidas Spinoulas, Mohamed Hussein, Joe Mathai, Wael Abd-Almageed

    Abstract: Advances in deep learning, combined with availability of large datasets, have led to impressive improvements in face presentation attack detection research. However, state-of-the-art face antispoofing systems are still vulnerable to novel types of attacks that are never seen during training. Moreover, even if such attacks are correctly detected, these systems lack the ability to adapt to newly enc… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Journal ref: 2021 International Conference on Computer Vision

  42. arXiv:2107.01598  [pdf, other

    cs.CL cs.LG

    Domain Adaptation for Sentiment Analysis Using Increased Intraclass Separation

    Authors: Mohammad Rostami, Aram Galstyan

    Abstract: Sentiment analysis is a costly yet necessary task for enterprises to study the opinions of their customers to improve their products and to determine optimal marketing strategies. Due to the existence of a wide range of domains across different products and services, cross-domain sentiment analysis methods have received significant attention. These methods mitigate the domain gap between different… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

  43. arXiv:2106.12124  [pdf, other

    cs.LG

    Secure Domain Adaptation with Multiple Sources

    Authors: Serban Stan, Mohammad Rostami

    Abstract: Multi-source unsupervised domain adaptation (MUDA) is a framework to address the challenge of annotated data scarcity in a target domain via transferring knowledge from multiple annotated source domains. When the source domains are distributed, data privacy and security can become significant concerns and protocols may limit data sharing, yet existing MUDA methods overlook these constraints. We de… ▽ More

    Submitted 14 November, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

  44. arXiv:2104.08808  [pdf, other

    cs.CL

    Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning

    Authors: Xisen **, Bill Yuchen Lin, Mohammad Rostami, Xiang Ren

    Abstract: The ability to continuously expand knowledge over time and utilize it to rapidly generalize to new tasks is a key feature of human linguistic intelligence. Existing models that pursue rapid generalization to new tasks (e.g., few-shot learning methods), however, are mostly trained in a single shot on fixed datasets, unable to dynamically expand their knowledge; while continual learning algorithms a… ▽ More

    Submitted 20 August, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted at Findings of EMNLP 2021; Fixed an error in Table 3 (see footnote 4); Updated Q3 in Sec. 4.2

  45. arXiv:2101.00522  [pdf, other

    cs.CV cs.CR cs.LG eess.IV

    Domain Adaptation for the Segmentation of Confidential Medical Images

    Authors: Serban Stan, Mohammad Rostami

    Abstract: Convolutional neural networks (CNNs) have led to significant improvements in the semantic segmentation of images. When source and target datasets come from different modalities, CNN performance suffers due to domain shift. In such cases data annotation in the target domain becomes necessary to maintain model performance. To circumvent the re-annotation process, unsupervised domain adaptation (UDA)… ▽ More

    Submitted 7 October, 2022; v1 submitted 2 January, 2021; originally announced January 2021.

  46. arXiv:2012.15695  [pdf, other

    cs.SD cs.CL eess.AS

    EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting

    Authors: Amir Mohammad Rostami, Ali Karimi, Mohammad Ali Akhaee

    Abstract: Keyword spotting is a process of finding some specific words or phrases in recorded speeches by computers. Deep neural network algorithms, as a powerful engine, can handle this problem if they are trained over an appropriate dataset. To this end, the football keyword dataset (FKD), as a new keyword spotting dataset in Persian, is collected with crowdsourcing. This dataset contains nearly 31000 sam… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 9 pages, 2 figures

  47. arXiv:2010.12144  [pdf, other

    cs.LG cs.AI

    One-shot Learning for Temporal Knowledge Graphs

    Authors: Mehrnoosh Mirtaheri, Mohammad Rostami, Xiang Ren, Fred Morstatter, Aram Galstyan

    Abstract: Most real-world knowledge graphs are characterized by a long-tail relation frequency distribution where a significant fraction of relations occurs only a handful of times. This observation has given rise to recent interest in low-shot learning methods that are able to generalize from only a few examples. The existing approaches, however, are tailored to static knowledge graphs and not easily gener… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  48. arXiv:2009.12518  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Model Adaptation for Continual Semantic Segmentation

    Authors: Serban Stan, Mohammad Rostami

    Abstract: We develop an algorithm for adapting a semantic segmentation model that is trained using a labeled source domain to generalize well in an unlabeled target domain. A similar problem has been studied extensively in the unsupervised domain adaptation (UDA) literature, but existing UDA algorithms require access to both the source domain labeled data and the target domain unlabeled data for training a… ▽ More

    Submitted 9 January, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: 12 pages, 5 figures

  49. arXiv:2009.04825  [pdf

    cs.SI cs.LG

    Presentation a Trust Walker for rating prediction in Recommender System with Biased Random Walk: Effects of H-index Centrality, Similarity in Items and Friends

    Authors: Saman Forouzandeh, Mehrdad Rostami, Kamal Berahmand

    Abstract: The use of recommender systems has increased dramatically to assist online social network users in the decision-making process and selecting appropriate items. On the other hand, due to many different items, users cannot score a wide range of them, and usually, there is a scattering problem for the matrix created for users. To solve the problem, the trust-based recommender systems are applied to p… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

  50. arXiv:2008.04103  [pdf

    cs.LG cs.NE stat.ML

    Review of Swarm Intelligence-based Feature Selection Methods

    Authors: Mehrdad Rostami, Kamal Berahmand, Saman Forouzandeh

    Abstract: In the past decades, the rapid growth of computer and database technologies has led to the rapid growth of large-scale datasets. On the other hand, data mining applications with high dimensional datasets that require high speed and accuracy are rapidly increasing. An important issue with these applications is the curse of dimensionality, where the number of features is much higher than the number… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.