-
Heteroskedasticity as a Signature of Association for Age-Related Genes
Authors:
Salman Mohamadi,
Donald A. Adjeroh
Abstract:
Human aging is a process controlled by both genetics and environment. Many studies have been conducted to identify a subset of genes related to aging from the human genome. Biologists implicitly categorize age-related genes into genes that cause aging and genes that are influenced by aging, which resulted in both causal inference and inference of associations studies. While inference of associatio…
▽ More
Human aging is a process controlled by both genetics and environment. Many studies have been conducted to identify a subset of genes related to aging from the human genome. Biologists implicitly categorize age-related genes into genes that cause aging and genes that are influenced by aging, which resulted in both causal inference and inference of associations studies. While inference of association is better explored, causal inference and computational causal inference, remains less explored. In this work, we are primarily motivated to tackle the problem of identifying genes associated with aging, while having a brief look into genes with probable causal relations, both from a computational perspective. Specifically, we form a set of hypotheses and accordingly, introduce a data-tailored framework for inference. First we perform linear modeling on the expression values of age-related genes, and then examine the presence of heteroskedastic properties in the residual of the model. We evaluate this framework and our results suggest that, 1) presence of heteroskedasticity in these residuals is a potential signature of association for age-related genes, and 2) consistent heteroskedasticity along the human life span could imply some sort of causality. To our knowledge, along with identifying age-associated genes, this is the first work to propose a framework for computational causal inference on age-related genes, using a dataset of human dermal fibroblast gene expression data. Hence the results of our simple, yet effective approach can be used not only to assess future age-related genes, but also as a possible criterion to select new associative or potential causal genes with respect to aging.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
Authors:
Salman Mohamadi,
Ghulam Mujtaba,
Ngan Le,
Gianfranco Doretto,
Donald A. Adjeroh
Abstract:
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully trained on a large amount of data. It has revolutionized the field of natural language processing (NLP) and has pushed the boundaries of LLM capabilities. ChatGPT has played a pivotal role in enabling widespread public interaction with generative artificial intelligence (GAI) on a large scale. It has also sparked res…
▽ More
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully trained on a large amount of data. It has revolutionized the field of natural language processing (NLP) and has pushed the boundaries of LLM capabilities. ChatGPT has played a pivotal role in enabling widespread public interaction with generative artificial intelligence (GAI) on a large scale. It has also sparked research interest in develo** similar technologies and investigating their applications and implications. In this paper, our primary goal is to provide a concise survey on the current lines of research on ChatGPT and its evolution. We considered both the glass box and black box views of ChatGPT, encompassing the components and foundational elements of the technology, as well as its applications, impacts, and implications. The glass box approach focuses on understanding the inner workings of the technology, and the black box approach embraces it as a complex system, and thus examines its inputs, outputs, and effects. This paves the way for a comprehensive exploration of the technology and provides a road map for further research and experimentation. We also lay out essential foundational literature on LLMs and GAI in general and their connection with ChatGPT. This overview sheds light on existing and missing research lines in the emerging field of LLMs, benefiting both public users and developers. Furthermore, the paper delves into the broad spectrum of applications and significant concerns in fields such as education, research, healthcare, finance, etc.
△ Less
Submitted 15 July, 2023; v1 submitted 9 July, 2023;
originally announced July 2023.
-
More Synergy, Less Redundancy: Exploiting Joint Mutual Information for Self-Supervised Learning
Authors:
Salman Mohamadi,
Gianfranco Doretto,
Donald A. Adjeroh
Abstract:
Self-supervised learning (SSL) is now a serious competitor for supervised learning, even though it does not require data annotation. Several baselines have attempted to make SSL models exploit information about data distribution, and less dependent on the augmentation effect. However, there is no clear consensus on whether maximizing or minimizing the mutual information between representations of…
▽ More
Self-supervised learning (SSL) is now a serious competitor for supervised learning, even though it does not require data annotation. Several baselines have attempted to make SSL models exploit information about data distribution, and less dependent on the augmentation effect. However, there is no clear consensus on whether maximizing or minimizing the mutual information between representations of augmentation views practically contribute to improvement or degradation in performance of SSL models. This paper is a fundamental work where, we investigate role of mutual information in SSL, and reformulate the problem of SSL in the context of a new perspective on mutual information. To this end, we consider joint mutual information from the perspective of partial information decomposition (PID) as a key step in \textbf{reliable multivariate information measurement}. PID enables us to decompose joint mutual information into three important components, namely, unique information, redundant information and synergistic information. Our framework aims for minimizing the redundant information between views and the desired target representation while maximizing the synergistic information at the same time. Our experiments lead to a re-calibration of two redundancy reduction baselines, and a proposal for a new SSL training protocol. Extensive experimental results on multiple datasets and two downstream tasks show the effectiveness of this framework.
△ Less
Submitted 2 July, 2023;
originally announced July 2023.
-
FUSSL: Fuzzy Uncertain Self Supervised Learning
Authors:
Salman Mohamadi,
Gianfranco Doretto,
Donald A. Adjeroh
Abstract:
Self supervised learning (SSL) has become a very successful technique to harness the power of unlabeled data, with no annotation effort. A number of developed approaches are evolving with the goal of outperforming supervised alternatives, which have been relatively successful. One main issue in SSL is robustness of the approaches under different settings. In this paper, for the first time, we reco…
▽ More
Self supervised learning (SSL) has become a very successful technique to harness the power of unlabeled data, with no annotation effort. A number of developed approaches are evolving with the goal of outperforming supervised alternatives, which have been relatively successful. One main issue in SSL is robustness of the approaches under different settings. In this paper, for the first time, we recognize the fundamental limits of SSL coming from the use of a single-supervisory signal. To address this limitation, we leverage the power of uncertainty representation to devise a robust and general standard hierarchical learning/training protocol for any SSL baseline, regardless of their assumptions and approaches. Essentially, using the information bottleneck principle, we decompose feature learning into a two-stage training procedure, each with a distinct supervision signal. This double supervision approach is captured in two key steps: 1) invariance enforcement to data augmentation, and 2) fuzzy pseudo labeling (both hard and soft annotation). This simple, yet, effective protocol which enables cross-class/cluster feature learning, is instantiated via an initial training of an ensemble of models through invariance enforcement to data augmentation as first training phase, and then assigning fuzzy labels to the original samples for the second training phase. We consider multiple alternative scenarios with double supervision and evaluate the effectiveness of our approach on recent baselines, covering four different SSL paradigms, including geometrical, contrastive, non-contrastive, and hard/soft whitening (redundancy reduction) baselines. Extensive experiments under multiple settings show that the proposed training protocol consistently improves the performance of the former baselines, independent of their respective underlying principles.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Deep Active Ensemble Sampling For Image Classification
Authors:
Salman Mohamadi,
Gianfranco Doretto,
Donald A. Adjeroh
Abstract:
Conventional active learning (AL) frameworks aim to reduce the cost of data annotation by actively requesting the labeling for the most informative data points. However, introducing AL to data hungry deep learning algorithms has been a challenge. Some proposed approaches include uncertainty-based techniques, geometric methods, implicit combination of uncertainty-based and geometric approaches, and…
▽ More
Conventional active learning (AL) frameworks aim to reduce the cost of data annotation by actively requesting the labeling for the most informative data points. However, introducing AL to data hungry deep learning algorithms has been a challenge. Some proposed approaches include uncertainty-based techniques, geometric methods, implicit combination of uncertainty-based and geometric approaches, and more recently, frameworks based on semi/self supervised techniques. In this paper, we address two specific problems in this area. The first is the need for efficient exploitation/exploration trade-off in sample selection in AL. For this, we present an innovative integration of recent progress in both uncertainty-based and geometric frameworks to enable an efficient exploration/exploitation trade-off in sample selection strategy. To this end, we build on a computationally efficient approximate of Thompson sampling with key changes as a posterior estimator for uncertainty representation. Our framework provides two advantages: (1) accurate posterior estimation, and (2) tune-able trade-off between computational overhead and higher accuracy. The second problem is the need for improved training protocols in deep AL. For this, we use ideas from semi/self supervised learning to propose a general approach that is independent of the specific AL technique being used. Taken these together, our framework shows a significant improvement over the state-of-the-art, with results that are comparable to the performance of supervised-learning under the same setting. We show empirical results of our framework, and comparative performance with the state-of-the-art on four datasets, namely, MNIST, CIFAR10, CIFAR100 and ImageNet to establish a new baseline in two different settings.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Human Age Estimation from Gene Expression Data using Artificial Neural Networks
Authors:
Salman Mohamadi,
Gianfranco. Doretto,
Nasser M. Nasrabadi,
Donald A. Adjeroh
Abstract:
The study of signatures of aging in terms of genomic biomarkers can be uniquely helpful in understanding the mechanisms of aging and develo** models to accurately predict the age. Prior studies have employed gene expression and DNA methylation data aiming at accurate prediction of age. In this line, we propose a new framework for human age estimation using information from human dermal fibroblas…
▽ More
The study of signatures of aging in terms of genomic biomarkers can be uniquely helpful in understanding the mechanisms of aging and develo** models to accurately predict the age. Prior studies have employed gene expression and DNA methylation data aiming at accurate prediction of age. In this line, we propose a new framework for human age estimation using information from human dermal fibroblast gene expression data. First, we propose a new spatial representation as well as a data augmentation approach for gene expression data. Next in order to predict the age, we design an architecture of neural network and apply it to this new representation of the original and augmented data, as an ensemble classification approach. Our experimental results suggest the superiority of the proposed framework over state-of-the-art age estimation methods using DNA methylation and gene expression data.
△ Less
Submitted 4 November, 2021; v1 submitted 4 November, 2021;
originally announced November 2021.
-
An Information-Theoretic Framework for Identifying Age-Related Genes Using Human Dermal Fibroblast Transcriptome Data
Authors:
Salman Mohamadi,
Donald Adjeroh
Abstract:
Investigation of age-related genes is of great importance for multiple purposes, for instance, improving our understanding of the mechanism of ageing, increasing life expectancy, age prediction, and other healthcare applications. In his work, starting with a set of 27,142 genes, we develop an information-theoretic framework for identifying genes that are associated with aging by applying unsupervi…
▽ More
Investigation of age-related genes is of great importance for multiple purposes, for instance, improving our understanding of the mechanism of ageing, increasing life expectancy, age prediction, and other healthcare applications. In his work, starting with a set of 27,142 genes, we develop an information-theoretic framework for identifying genes that are associated with aging by applying unsupervised and semi-supervised learning techniques on human dermal fibroblast gene expression data. First, we use unsupervised learning and apply information-theoretic measures to identify key features for effective representation of gene expression values in the transcriptome data. Using the identified features, we perform clustering on the data. Finally, we apply semi-supervised learning on the clusters using different distance measures to identify novel genes that are potentially associated with aging. Performance assessment for both unsupervised and semi-supervised methods show the effectiveness of the framework.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
Deep GAN-Based Cross-Spectral Cross-Resolution Iris Recognition
Authors:
Moktari Mostofa,
Salman Mohamadi,
Jeremy Dawson,
Nasser M. Nasrabadi
Abstract:
In recent years, cross-spectral iris recognition has emerged as a promising biometric approach to establish the identity of individuals. However, matching iris images acquired at different spectral bands (i.e., matching a visible (VIS) iris probe to a gallery of near-infrared (NIR) iris images or vice versa) shows a significant performance degradation when compared to intraband NIR matching. Hence…
▽ More
In recent years, cross-spectral iris recognition has emerged as a promising biometric approach to establish the identity of individuals. However, matching iris images acquired at different spectral bands (i.e., matching a visible (VIS) iris probe to a gallery of near-infrared (NIR) iris images or vice versa) shows a significant performance degradation when compared to intraband NIR matching. Hence, in this paper, we have investigated a range of deep convolutional generative adversarial network (DCGAN) architectures to further improve the accuracy of cross-spectral iris recognition methods. Moreover, unlike the existing works in the literature, we introduce a resolution difference into the classical cross-spectral matching problem domain. We have developed two different techniques using the conditional generative adversarial network (cGAN) as a backbone architecture for cross-spectral iris matching. In the first approach, we simultaneously address the cross-resolution and cross-spectral matching problem by training a cGAN that jointly translates cross-resolution as well as cross-spectral tasks to the same resolution and within the same spectrum. In the second approach, we design a coupled generative adversarial network (cpGAN) architecture consisting of a pair of cGAN modules that project the VIS and NIR iris images into a low-dimensional embedding domain to ensure maximum pairwise similarity between the feature vectors from the two iris modalities of the same subject.
△ Less
Submitted 3 August, 2021;
originally announced August 2021.
-
Deep Bayesian Active Learning, A Brief Survey on Recent Advances
Authors:
Salman Mohamadi,
Hamidreza Amindavar
Abstract:
Active learning frameworks offer efficient data annotation without remarkable accuracy degradation. In other words, active learning starts training the model with a small size of labeled data while exploring the space of unlabeled data in order to select most informative samples to be labeled. Generally speaking, representing the uncertainty is crucial in any active learning framework, however, de…
▽ More
Active learning frameworks offer efficient data annotation without remarkable accuracy degradation. In other words, active learning starts training the model with a small size of labeled data while exploring the space of unlabeled data in order to select most informative samples to be labeled. Generally speaking, representing the uncertainty is crucial in any active learning framework, however, deep learning methods are not capable of either representing or manipulating model uncertainty. On the other hand, from the real world application perspective, uncertainty representation is getting more and more attention in the machine learning community. Deep Bayesian active learning frameworks and generally any Bayesian active learning settings, provide practical consideration in the model which allows training with small data while representing the model uncertainty for further efficient training. In this paper, we briefly survey recent advances in Bayesian active learning and in particular deep Bayesian active learning frameworks.
△ Less
Submitted 21 April, 2022; v1 submitted 14 December, 2020;
originally announced December 2020.
-
A New Framework For Spatial Modeling And Synthesis of Genome Sequence
Authors:
Salman Mohamadi,
Farhang Yeganegi,
Hamidreza Amindavar
Abstract:
This paper provides a framework in order to statistically model sequences from human genome, which is allowing a formulation to synthesize gene sequences. We start by converting the alphabetic sequence of genome to decimal sequence by Huffman coding. Then, this decimal sequence is decomposed by HP filter into two components, trend and cyclic. Next, a statistical modeling, ARIMA-GARCH, is implement…
▽ More
This paper provides a framework in order to statistically model sequences from human genome, which is allowing a formulation to synthesize gene sequences. We start by converting the alphabetic sequence of genome to decimal sequence by Huffman coding. Then, this decimal sequence is decomposed by HP filter into two components, trend and cyclic. Next, a statistical modeling, ARIMA-GARCH, is implemented on trend component exhibiting heteroskedasticity, autoregressive integrated moving average (ARIMA) to capture the linear characteristics of the sequence and later, generalized autoregressive conditional heteroskedasticity (GARCH) is then appropriated for the statistical nonlinearity of genome sequence. This modeling approach synthesizes a given genome sequence regarding to its statistical features. Finally, the PDF of a given sequence is estimated using Gaussian mixture model and based on estimated PDF, we determine a new PDF presenting sequences that counteract statistically the original sequence. Our strategy is performed on several genes as well as HIV nucleotide sequence and corresponding results is presented.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.
-
Detection and Statistical Modeling of Birth-Death Anomaly
Authors:
Salman Mohamadi,
Farhang Yeganegi,
Nasser M Nasrabadi
Abstract:
Generally, anomaly detection has a great importance particularly in applied statistical signal processing. Here we provide a general framework in order to detect anomaly through the statistical modeling. In this paper, it is assumed that a signal is corrupted by noise whose variance follows an ARMA model. The assumption on the signal is further compromised to encompass the inherent nonstationarity…
▽ More
Generally, anomaly detection has a great importance particularly in applied statistical signal processing. Here we provide a general framework in order to detect anomaly through the statistical modeling. In this paper, it is assumed that a signal is corrupted by noise whose variance follows an ARMA model. The assumption on the signal is further compromised to encompass the inherent nonstationarity associated with natural phenomenon, hence, the signal of interest is assumed to follow an ARIMA model and the noise to denote an anomaly, however, unknown. Anomaly is assumed to possess heteroskedastic properties, therefore, ARCH/GARCH modeling could extract the anomaly pattern given an additive model for signal of interest and anomaly.
△ Less
Submitted 27 June, 2019;
originally announced June 2019.
-
Short Block-length Codes for Ultra-Reliable Low-Latency Communications
Authors:
Mahyar Shirvanimoghaddam,
Mohamad Sadegh Mohamadi,
Rana Abbas,
Aleksandar Minja,
Chentao Yue,
Balazs Matuz,
Guojun Han,
Zihuai Lin,
Yonghui Li,
Sarah Johnson,
Branka Vucetic
Abstract:
This paper reviews the state of the art channel coding techniques for ultra-reliable low latency communication (URLLC). The stringent requirements of URLLC services, such as ultra-high reliability and low latency, have made it the most challenging feature of the fifth generation (5G) mobile systems. The problem is even more challenging for the services beyond the 5G promise, such as tele-surgery a…
▽ More
This paper reviews the state of the art channel coding techniques for ultra-reliable low latency communication (URLLC). The stringent requirements of URLLC services, such as ultra-high reliability and low latency, have made it the most challenging feature of the fifth generation (5G) mobile systems. The problem is even more challenging for the services beyond the 5G promise, such as tele-surgery and factory automation, which require latencies less than 1ms and failure rate as low as $10^{-9}$. The very low latency requirements of URLLC do not allow traditional approaches such as re-transmission to be used to increase the reliability. On the other hand, to guarantee the delay requirements, the block length needs to be small, so conventional channel codes, originally designed and optimised for moderate-to-long block-lengths, show notable deficiencies for short blocks. This paper provides an overview on channel coding techniques for short block lengths and compares them in terms of performance and complexity. Several important research directions are identified and discussed in more detail with several possible solutions.
△ Less
Submitted 5 September, 2018; v1 submitted 26 February, 2018;
originally announced February 2018.
-
The Impact of Data Replicatino on Job Scheduling Performance in Hierarchical data Grid
Authors:
Somayeh Abdi,
Hossein Pedram,
Somayeh Mohamadi
Abstract:
In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth. The major bottleneck to supporting fast data access in Grids is the high latencies of Wide Area Networks and Internet. Effective scheduling can reduce the amount of data transferred across the internet by dispatching a job to where the needed data are present. Another solu…
▽ More
In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth. The major bottleneck to supporting fast data access in Grids is the high latencies of Wide Area Networks and Internet. Effective scheduling can reduce the amount of data transferred across the internet by dispatching a job to where the needed data are present. Another solution is to use a data replication mechanism. Objective of dynamic replica strategies is reducing file access time which leads to reducing job runtime. In this paper we develop a job scheduling policy and a dynamic data replication strategy, called HRS (Hierarchical Replication Strategy), to improve the data access efficiencies. We study our approach and evaluate it through simulation. The results show that our algorithm has improved 12% over the current strategies.
△ Less
Submitted 4 October, 2010;
originally announced October 2010.