Search | arXiv e-print repository

doi 10.1088/1402-4896/ac7f62

Energy Transfer and Coherence in Coupled Oscillators with Delayed Coupling: A Classical Picture for Two-Level Systems

Authors: Fahhad H Alharbi, Abdelrahman S Abdelrahman, Abdullah M Alkathiry, Hussain M Al-Qahtan

Abstract: The Frimmer-Novotny model to simulate two-level systems by coupled oscillators is extended by incorporating a constant time delay in the coupling. The effects of the introduced delay on system dynamics and two-level modeling are then investigated and found substantial. Mathematically, introducing a delay converts the dynamical system from a finite one into an infinite-dimensional system. The resul… ▽ More The Frimmer-Novotny model to simulate two-level systems by coupled oscillators is extended by incorporating a constant time delay in the coupling. The effects of the introduced delay on system dynamics and two-level modeling are then investigated and found substantial. Mathematically, introducing a delay converts the dynamical system from a finite one into an infinite-dimensional system. The resulted system of delay differential equations is solved using the Krylov method with Chebyshev interpolation and post-processing refinement. The calculations and analyses reveal the critical role that a delay can play. It has oscillatory effects as the main dynamical eigenmodes move around a circle with a radius proportional to the coupling strength and an angle linear with the delay. This alteration governs the energy transfer dynamics and coherence. Accordingly, both, the delay and the coupling strength dictate the stability of the system. The delay is the main related parameter as for certain intervals of it, the system remains stable regardless of the coupling. A significant effect occurs when one of the main modes crosses the imaginary axis, where it becomes pure imaginary and dam**less. Thus, the two states energies can live and be exchanged for an extremely long time. Furthermore, it is found that the delay alters both the splitting and the linewidth in a way further influencing the energy transfer and coherence. It is found also that the delay should not be large to have significant effect. For example, for an optical system with 500 nm wavelength, the critical delay can be in tens of attoseconds. △ Less

Submitted 5 September, 2022; originally announced September 2022.

Journal ref: Phys. Scr. 97 085215, 2022

arXiv:2203.04015 [pdf, other]

A Compilation Flow for the Generation of CNN Inference Accelerators on FPGAs

Authors: Seung-Hun Chung, Tarek S. Abdelrahman

Abstract: We present a compilation flow for the generation of CNN inference accelerators on FPGAs. The flow translates a frozen model into OpenCL kernels with the TVM compiler and uses the Intel OpenCL SDK to compile to an FPGA bitstream. We improve the quality of the generated hardware with optimizations applied to the base OpenCL kernels generated by TVM. These optimizations increase parallelism, reduce m… ▽ More We present a compilation flow for the generation of CNN inference accelerators on FPGAs. The flow translates a frozen model into OpenCL kernels with the TVM compiler and uses the Intel OpenCL SDK to compile to an FPGA bitstream. We improve the quality of the generated hardware with optimizations applied to the base OpenCL kernels generated by TVM. These optimizations increase parallelism, reduce memory access latency, increase concurrency and save on-chip resources. We automate these optimizations in TVM and evaluate them by generating accelerators for LeNet-5, MobileNetV1 and ResNet-34 on an Intel Stratix~10SX. We show that the optimizations improve the performance of the generated accelerators by up to 846X over the base accelerators. The performance of the optimized accelerators is up to 4.57X better than TensorFlow on CPU, 3.83X better than single-threaded TVM and is only 0.34X compared to TVM with 56 threads. Our optimized kernels also outperform ones generated by a similar approach (that also uses high-level synthesis) while providing more functionality and flexibility. However, it underperforms an approach that utilizes hand-optimized designs. Thus, we view our approach as useful in pre-production environments that benefit from increased performance and fast prototy**, realizing the benefits of FPGAs without hardware design expertise. △ Less

Submitted 8 March, 2022; originally announced March 2022.

Comments: 8 pages

arXiv:2009.06783 [pdf]

Learning Hidden Patterns from Patient Multivariate Time Series Data Using Convolutional Neural Networks: A Case Study of Healthcare Cost Prediction

Authors: Mohammad Amin Morid, Olivia R. Liu Sheng, Kensaku Kawamoto, Samir Abdelrahman

Abstract: Objective: To develop an effective and scalable individual-level patient cost prediction method by automatically learning hidden temporal patterns from multivariate time series data in patient insurance claims using a convolutional neural network (CNN) architecture. Methods: We used three years of medical and pharmacy claims data from 2013 to 2016 from a healthcare insurer, where data from the f… ▽ More Objective: To develop an effective and scalable individual-level patient cost prediction method by automatically learning hidden temporal patterns from multivariate time series data in patient insurance claims using a convolutional neural network (CNN) architecture. Methods: We used three years of medical and pharmacy claims data from 2013 to 2016 from a healthcare insurer, where data from the first two years were used to build the model to predict costs in the third year. The data consisted of the multivariate time series of cost, visit and medical features that were shaped as images of patients' health status (i.e., matrices with time windows on one dimension and the medical, visit and cost features on the other dimension). Patients' multivariate time series images were given to a CNN method with a proposed architecture. After hyper-parameter tuning, the proposed architecture consisted of three building blocks of convolution and pooling layers with an LReLU activation function and a customized kernel size at each layer for healthcare data. The proposed CNN learned temporal patterns became inputs to a fully connected layer. Conclusions: Feature learning through the proposed CNN configuration significantly improved individual-level healthcare cost prediction. The proposed CNN was able to outperform temporal pattern detection methods that look for a pre-defined set of pattern shapes, since it is capable of extracting a variable number of patterns with various shapes. Temporal patterns learned from medical, visit and cost data made significant contributions to the prediction performance. Hyper-parameter tuning showed that considering three-month data patterns has the highest prediction accuracy. Our results showed that patients' images extracted from multivariate time series data are different from regular images, and hence require unique designs of CNN architectures. △ Less

Submitted 14 September, 2020; originally announced September 2020.

arXiv:2009.06780 [pdf]

doi 10.1016/j.jbi.2019.103113

Healthcare Cost Prediction: Leveraging Fine-grain Temporal Patterns

Authors: Mohammad Amin Morid, Olivia R. Liu Sheng, Kensaku Kawamoto, Travis Ault, Josette Dorius, Samir Abdelrahman

Abstract: Objective: To design and assess a method to leverage individuals' temporal data for predicting their healthcare cost. To achieve this goal, we first used patients' temporal data in their fine-grain form as opposed to coarse-grain form. Second, we devised novel spike detection features to extract temporal patterns that improve the performance of cost prediction. Third, we evaluated the effectivenes… ▽ More Objective: To design and assess a method to leverage individuals' temporal data for predicting their healthcare cost. To achieve this goal, we first used patients' temporal data in their fine-grain form as opposed to coarse-grain form. Second, we devised novel spike detection features to extract temporal patterns that improve the performance of cost prediction. Third, we evaluated the effectiveness of different types of temporal features based on cost information, visit information and medical information for the prediction task. Materials and methods: We used three years of medical and pharmacy claims data from 2013 to 2016 from a healthcare insurer, where the first two years were used to build the model to predict the costs in the third year. To prepare the data for modeling and prediction, the time series data of cost, visit and medical information were extracted in the form of fine-grain features (i.e., segmenting each time series into a sequence of consecutive windows and representing each window by various statistics such as sum). Then, temporal patterns of the time series were extracted and added to fine-grain features using a novel set of spike detection features (i.e., the fluctuation of data points). Gradient Boosting was applied on the final set of extracted features. Moreover, the contribution of each type of data (i.e., cost, visit and medical) was assessed. Conclusions: Leveraging fine-grain temporal patterns for healthcare cost prediction significantly improves prediction performance. Enhancing fine-grain features with extraction of temporal cost and visit patterns significantly improved the performance. However, medical features did not have a significant effect on prediction performance. Gradient Boosting outperformed all other prediction models. △ Less

Submitted 14 September, 2020; originally announced September 2020.

Journal ref: Journal of biomedical informatics, 91 (2019)

arXiv:1912.12675 [pdf, other]

Pipelined Training with Stale Weights of Deep Convolutional Neural Networks

Authors: Lifu Zhang, Tarek S. Abdelrahman

Abstract: The growth in the complexity of Convolutional Neural Networks (CNNs) is increasing interest in partitioning a network across multiple accelerators during training and pipelining the backpropagation computations over the accelerators. Existing approaches avoid or limit the use of stale weights through techniques such as micro-batching or weight stashing. These techniques either underutilize of acce… ▽ More The growth in the complexity of Convolutional Neural Networks (CNNs) is increasing interest in partitioning a network across multiple accelerators during training and pipelining the backpropagation computations over the accelerators. Existing approaches avoid or limit the use of stale weights through techniques such as micro-batching or weight stashing. These techniques either underutilize of accelerators or increase memory footprint. We explore the impact of stale weights on the statistical efficiency and performance in a pipelined backpropagation scheme that maximizes accelerator utilization and keeps memory overhead modest. We use 4 CNNs (LeNet-5, AlexNet, VGG and ResNet) and show that when pipelining is limited to early layers in a network, training with stale weights converges and results in models with comparable inference accuracies to those resulting from non-pipelined training on MNIST and CIFAR-10 datasets; a drop in accuracy of 0.4%, 4%, 0.83% and 1.45% for the 4 networks, respectively. However, when pipelining is deeper in the network, inference accuracies drop significantly. We propose combining pipelined and non-pipelined training in a hybrid scheme to address this drop. We demonstrate the implementation and performance of our pipelined backpropagation in PyTorch on 2 GPUs using ResNet, achieving speedups of up to 1.8X over a 1-GPU baseline, with a small drop in inference accuracy. △ Less

Submitted 29 December, 2019; originally announced December 2019.

arXiv:1811.04199 [pdf, other]

doi 10.1016/j.neucom.2019.08.063

Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks

Authors: Amir H. Ashouri, Tarek S. Abdelrahman, Alwyn Dos Remedios

Abstract: Modern Convolutional Neural Networks (CNNs) are complex, encompassing millions of parameters. Their deployment exerts computational, storage and energy demands, particularly on embedded platforms. Existing approaches to prune or sparsify CNNs require retraining to maintain inference accuracy. Such retraining is not feasible in some contexts. In this paper, we explore the sparsification of CNNs by… ▽ More Modern Convolutional Neural Networks (CNNs) are complex, encompassing millions of parameters. Their deployment exerts computational, storage and energy demands, particularly on embedded platforms. Existing approaches to prune or sparsify CNNs require retraining to maintain inference accuracy. Such retraining is not feasible in some contexts. In this paper, we explore the sparsification of CNNs by proposing three model-independent methods. Our methods are applied on-the-fly and require no retraining. We show that the state-of-the-art models' weights can be reduced by up to 73% (compression factor of 3.7x) without incurring more than 5% loss in Top-5 accuracy. Additional fine-tuning gains only 8% in sparsity, which indicates that our fast on-the-fly methods are effective. △ Less

Submitted 8 September, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

Comments: Extended Version of Our Accepted Paper in NIPS 2018, CDNNRIA Workshop: (https://nips.cc/Conferences/2018/Schedule?showEvent=10941)- Reviews are available at OpenReview (https://openreview.net/forum?id=rkz1YD0vjm)

Journal ref: Elsevier Neurocomputing, 2019

arXiv:1705.00761 [pdf]

F-tree: an algorithm for clustering transactional data using frequency tree

Authors: Mahmoud Mahdi, Samir Abdelrahman, Reem Bahgat, Ismail Ismail

Abstract: Clustering is an important data mining technique that groups similar data records, recently categorical transaction clustering is received more attention. In this research, we study the problem of categorical data clustering for transactional data characterized with high dimensionality and large volume. We propose a novel algorithm for clustering transactional data called F-Tree, which is based on… ▽ More Clustering is an important data mining technique that groups similar data records, recently categorical transaction clustering is received more attention. In this research, we study the problem of categorical data clustering for transactional data characterized with high dimensionality and large volume. We propose a novel algorithm for clustering transactional data called F-Tree, which is based on the idea of the frequent pattern algorithm FP-tree; the fastest approaches to the frequent item set mining. And the simple idea behind the F-Tree is to generate small high pure clusters, and then merge them. That makes it fast, and dynamic in clustering large transactional datasets with high dimensions. We also present a new solution to solve the overlap** problem between clusters, by defining a new criterion function, which is based on the probability of overlap** between weighted items. Our experimental evaluation on real datasets shows that: Firstly, F-Tree is effective in finding interesting clusters. Secondly, the usage of the tree structure reduces the clustering process time of the large data set with high attributes. Thirdly, the proposed evaluation metric used efficiently to solve the overlap** of transaction items generates high-quality clustering results. Finally, we have concluded that the process of merging pure and small clusters increases the purity of resulted clusters as well as it reduces the time of clustering better than the process of generating clusters directly from dataset then refine clusters. △ Less

Submitted 1 May, 2017; originally announced May 2017.

Comments: Appeared at Al-Azhar University Engineering Journal, JAUES, Vol.5, No. 8, Dec 2010

arXiv:1704.07499 [pdf]

PPMF: A Patient-based Predictive Modeling Framework for Early ICU Mortality Prediction

Authors: Mohammad Amin Morid, Olivia R. Liu Sheng, Samir Abdelrahman

Abstract: To date, develo** a good model for early intensive care unit (ICU) mortality prediction is still challenging. This paper presents a patient based predictive modeling framework (PPMF) to improve the performance of ICU mortality prediction using data collected during the first 48 hours of ICU admission. PPMF consists of three main components verifying three related research hypotheses. The first c… ▽ More To date, develo** a good model for early intensive care unit (ICU) mortality prediction is still challenging. This paper presents a patient based predictive modeling framework (PPMF) to improve the performance of ICU mortality prediction using data collected during the first 48 hours of ICU admission. PPMF consists of three main components verifying three related research hypotheses. The first component captures dynamic changes of patients status in the ICU using their time series data (e.g., vital signs and laboratory tests). The second component is a local approximation algorithm that classifies patients based on their similarities. The third component is a Gradient Decent wrapper that updates feature weights according to the classification feedback. Experiments using data from MIMICIII show that PPMF significantly outperforms: (1) the severity score systems, namely SASP III, APACHE IV, and MPM0III, (2) the aggregation based classifiers that utilize summarized time series, and (3) baseline feature selection methods. △ Less

Submitted 24 April, 2017; originally announced April 2017.

Comments: 10 pages, Healthcare Analytics and Medical Decision Making, INFORMS Workshop. Nashville, Tennessee, 2016

arXiv:1704.07498 [pdf]

Leveraging Patient Similarity and Time Series Data in Healthcare Predictive Models

Authors: Mohammad Amin Morid, Olivia R. Liu Sheng, Samir Abdelrahman

Abstract: Patient time series classification faces challenges in high degrees of dimensionality and missingness. In light of patient similarity theory, this study explores effective temporal feature engineering and reduction, missing value imputation, and change point detection methods that can afford similarity-based classification models with desirable accuracy enhancement. We select a piecewise aggregati… ▽ More Patient time series classification faces challenges in high degrees of dimensionality and missingness. In light of patient similarity theory, this study explores effective temporal feature engineering and reduction, missing value imputation, and change point detection methods that can afford similarity-based classification models with desirable accuracy enhancement. We select a piecewise aggregation approximation method to extract fine-grain temporal features and propose a minimalist method to impute missing values in temporal features. For dimensionality reduction, we adopt a gradient descent search method for feature weight assignment. We propose new patient status and directional change definitions based on medical knowledge or clinical guidelines about the value ranges for different patient status levels, and develop a method to detect change points indicating positive or negative patient status changes. We evaluate the effectiveness of the proposed methods in the context of early Intensive Care Unit mortality prediction. The evaluation results show that the k-Nearest Neighbor algorithm that incorporates methods we select and propose significantly outperform the relevant benchmarks for early ICU mortality prediction. This study makes contributions to time series classification and early ICU mortality prediction via identifying and enhancing temporal feature engineering and reduction methods for similarity-based time series classification. △ Less

Submitted 30 April, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

Comments: To appear:Twenty-third Americas Conference on Information Systems, Boston, 2017

arXiv:1412.6986 [pdf, ps, other]

Automatic Tuning of Local Memory Use on GPGPUs

Authors: Tianyi David Han, Tarek S. Abdelrahman

Abstract: The use of local memory is important to improve the performance of OpenCL programs. However, its use may not always benefit performance, depending on various application characteristics, and there is no simple heuristic for deciding when to use it. We develop a machine learning model to decide if the optimization is beneficial or not. We train the model with millions of synthetic benchmarks and sh… ▽ More The use of local memory is important to improve the performance of OpenCL programs. However, its use may not always benefit performance, depending on various application characteristics, and there is no simple heuristic for deciding when to use it. We develop a machine learning model to decide if the optimization is beneficial or not. We train the model with millions of synthetic benchmarks and show that it can predict if the optimization should be applied for a single array, in both synthetic and real benchmarks, with high accuracy. △ Less

Submitted 22 December, 2014; originally announced December 2014.

Comments: Part of ADAPT Workshop proceedings, 2015 (arXiv:1412.2347)

Report number: ADAPT/2015/04

arXiv:1206.1011 [pdf]

doi 10.5121/ijaia.2012.3205

A Machine Learning Approach For Opinion Holder Extraction In Arabic Language

Authors: Mohamed Elarnaoty, Samir AbdelRahman, Aly Fahmy

Abstract: Opinion mining aims at extracting useful subjective information from reliable amounts of text. Opinion mining holder recognition is a task that has not been considered yet in Arabic Language. This task essentially requires deep understanding of clauses structures. Unfortunately, the lack of a robust, publicly available, Arabic parser further complicates the research. This paper presents a leading… ▽ More Opinion mining aims at extracting useful subjective information from reliable amounts of text. Opinion mining holder recognition is a task that has not been considered yet in Arabic Language. This task essentially requires deep understanding of clauses structures. Unfortunately, the lack of a robust, publicly available, Arabic parser further complicates the research. This paper presents a leading research for the opinion holder extraction in Arabic news independent from any lexical parsers. We investigate constructing a comprehensive feature set to compensate the lack of parsing structural outcomes. The proposed feature set is tuned from English previous works coupled with our proposed semantic field and named entities features. Our feature analysis is based on Conditional Random Fields (CRF) and semi-supervised pattern recognition techniques. Different research models are evaluated via cross-validation experiments achieving 54.03 F-measure. We publicly release our own research outcome corpus and lexicon for opinion mining community to encourage further research. △ Less

Submitted 6 April, 2012; originally announced June 2012.

Journal ref: Mohamed Elarnaoty, Samir AbdelRahman and Aly Fahmy. "A Machine Learning Approach for Opinion Holder Extraction in Arabic Language", ISSN:0976-2191, vol 3, March 2012

arXiv:1011.0502 [pdf]

doi 10.5121/ijcsit.2010.2504

A New Email Retrieval Ranking Approach

Authors: Samir AbdelRahman, Basma Hassan, Reem Bahgat

Abstract: Email Retrieval task has recently taken much attention to help the user retrieve the email(s) related to the submitted query. Up to our knowledge, existing email retrieval ranking approaches sort the retrieved emails based on some heuristic rules, which are either search clues or some predefined user criteria rooted in email fields. Unfortunately, the user usually does not know the effective rule… ▽ More Email Retrieval task has recently taken much attention to help the user retrieve the email(s) related to the submitted query. Up to our knowledge, existing email retrieval ranking approaches sort the retrieved emails based on some heuristic rules, which are either search clues or some predefined user criteria rooted in email fields. Unfortunately, the user usually does not know the effective rule that acquires best ranking related to his query. This paper presents a new email retrieval ranking approach to tackle this problem. It ranks the retrieved emails based on a scoring function that depends on crucial email fields, namely subject, content, and sender. The paper also proposes an architecture to allow every user in a network/group of users to be able, if permissible, to know the most important network senders who are interested in his submitted query words. The experimental evaluation on Enron corpus prove that our approach outperforms known email retrieval ranking approaches. △ Less

Submitted 1 November, 2010; originally announced November 2010.

Report number: 100,101

Journal ref: International journal of computer science & information Technology (IJCSIT) Vol.2, No.5, October 2010

arXiv:1011.0404 [pdf]

doi 10.5121/ijcsit.2010.2504

A New Email Retrieval Ranking Approach

Authors: Samir AbdelRahman, Basma Hassan, Reem Bahgat

Abstract: Email Retrieval task has recently taken much attention to help the user retrieve the email(s) related to the submitted query. Up to our knowledge, existing email retrieval ranking approaches sort the retrieved emails based on some heuristic rules, which are either search clues or some predefined user criteria rooted in email fields. Unfortunately, the user usually does not know the effective rule… ▽ More Email Retrieval task has recently taken much attention to help the user retrieve the email(s) related to the submitted query. Up to our knowledge, existing email retrieval ranking approaches sort the retrieved emails based on some heuristic rules, which are either search clues or some predefined user criteria rooted in email fields. Unfortunately, the user usually does not know the effective rule that acquires best ranking related to his query. This paper presents a new email retrieval ranking approach to tackle this problem. It ranks the retrieved emails based on a scoring function that depends on crucial email fields, namely subject, content, and sender. The paper also proposes an architecture to allow every user in a network/group of users to be able, if permissible, to know the most important network senders who are interested in his submitted query words. The experimental evaluation on Enron corpus prove that our approach outperforms known email retrieval ranking approaches △ Less

Submitted 1 November, 2010; originally announced November 2010.

Comments: 20 pages

Journal ref: International journal of computer science & information Technology (IJCSIT), Vol.2, No.5, (October 2010) 44-63

Showing 1–13 of 13 results for author: Abdelrahman, S