-
Predicting the Performance-Cost Trade-off of Applications Across Multiple Systems
Authors:
Amir Nassereldine,
Safaa Diab,
Mohammed Baydoun,
Kenneth Leach,
Maxim Alt,
Dejan Milojicic,
Izzat El Hajj
Abstract:
In modern computing environments, users may have multiple systems accessible to them such as local clusters, private clouds, or public clouds. This abundance of choices makes it difficult for users to select the system and configuration for running an application that best meet their performance and cost objectives. To assist such users, we propose a prediction tool that predicts the full performa…
▽ More
In modern computing environments, users may have multiple systems accessible to them such as local clusters, private clouds, or public clouds. This abundance of choices makes it difficult for users to select the system and configuration for running an application that best meet their performance and cost objectives. To assist such users, we propose a prediction tool that predicts the full performance-cost trade-off space of an application across multiple systems. Our tool runs and profiles a submitted application on a small number of configurations from some of the systems, and uses that information to predict the application's performance on all configurations in all systems. The prediction models are trained offline with data collected from running a large number of applications on a wide variety of configurations. Notable aspects of our tool include: providing different scopes of prediction with varying online profiling requirements, automating the selection of the small number of configurations and systems used for online profiling, performing online profiling using partial runs thereby make predictions for applications without running them to completion, employing a classifier to distinguish applications that scale well from those that scale poorly, and predicting the sensitivity of applications to interference from other users. We evaluate our tool using 69 data analytics and scientific computing benchmarks executing on three different single-node CPU systems with 8-9 configurations each and show that it can achieve low prediction error with modest profiling overhead.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
A Framework for High-throughput Sequence Alignment using Real Processing-in-Memory Systems
Authors:
Safaa Diab,
Amir Nassereldine,
Mohammed Alser,
Juan Gómez-Luna,
Onur Mutlu,
Izzat El Hajj
Abstract:
Sequence alignment is a memory bound computation whose performance in modern systems is limited by the memory bandwidth bottleneck. Processing-in-memory architectures alleviate this bottleneck by providing the memory with computing competencies. We propose Alignment-in-Memory (AIM), a framework for high-throughput sequence alignment using processing-in-memory, and evaluate it on UPMEM, the first p…
▽ More
Sequence alignment is a memory bound computation whose performance in modern systems is limited by the memory bandwidth bottleneck. Processing-in-memory architectures alleviate this bottleneck by providing the memory with computing competencies. We propose Alignment-in-Memory (AIM), a framework for high-throughput sequence alignment using processing-in-memory, and evaluate it on UPMEM, the first publicly-available general-purpose programmable processing-in-memory system.
Our evaluation shows that a real processing-in-memory system can substantially outperform server-grade multi-threaded CPU systems running at full-scale when performing sequence alignment for a variety of algorithms, read lengths, and edit distance thresholds. We hope that our findings inspire more work on creating and accelerating bioinformatics algorithms for such real processing-in-memory systems.
Our code is available at https://github.com/safaad/aim.
△ Less
Submitted 27 March, 2023; v1 submitted 2 August, 2022;
originally announced August 2022.
-
High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory
Authors:
Safaa Diab,
Amir Nassereldine,
Mohammed Alser,
Juan Gómez Luna,
Onur Mutlu,
Izzat El Hajj
Abstract:
We show that the wavefront algorithm can achieve higher pairwise read alignment throughput on a UPMEM PIM system than on a server-grade multi-threaded CPU system.
We show that the wavefront algorithm can achieve higher pairwise read alignment throughput on a UPMEM PIM system than on a server-grade multi-threaded CPU system.
△ Less
Submitted 23 April, 2022; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Optimizing Stochastic Gradient Descent in Text Classification Based on Fine-Tuning Hyper-Parameters Approach. A Case Study on Automatic Classification of Global Terrorist Attacks
Authors:
Shadi Diab
Abstract:
The objective of this research is to enhance performance of Stochastic Gradient Descent (SGD) algorithm in text classification. In our research, we proposed using SGD learning with Grid-Search approach to fine-tuning hyper-parameters in order to enhance the performance of SGD classification. We explored different settings for representation, transformation and weighting features from the summary d…
▽ More
The objective of this research is to enhance performance of Stochastic Gradient Descent (SGD) algorithm in text classification. In our research, we proposed using SGD learning with Grid-Search approach to fine-tuning hyper-parameters in order to enhance the performance of SGD classification. We explored different settings for representation, transformation and weighting features from the summary description of terrorist attacks incidents obtained from the Global Terrorism Database as a pre-classification step, and validated SGD learning on Support Vector Machine (SVM), Logistic Regression and Perceptron classifiers by stratified 10-K-fold cross-validation to compare the performance of different classifiers embedded in SGD algorithm. The research concludes that using a grid-search to find the hyper-parameters optimize SGD classification, not in the pre-classification settings only, but also in the performance of the classifiers in terms of accuracy and execution time.
△ Less
Submitted 23 February, 2019; v1 submitted 18 February, 2019;
originally announced February 2019.
-
Generalization of a reduced Trefftz type approach
Authors:
Sulaiman Abo Diab
Abstract:
Summary This work presents variational concepts associated with reduced Trefftz type approaches and discusses the interrelationship between various concepts of the displacement, hybrid and Trefftz methods. The basic concept of the displacement version of the reduced Trefftz method operates on the natural boundary conditions enforced in an integral form whereas the stress version of the reduced Tre…
▽ More
Summary This work presents variational concepts associated with reduced Trefftz type approaches and discusses the interrelationship between various concepts of the displacement, hybrid and Trefftz methods. The basic concept of the displacement version of the reduced Trefftz method operates on the natural boundary conditions enforced in an integral form whereas the stress version of the reduced Trefftz type approach operates on the essential boundary conditions enforced in an integral sense. The application of the method proposed in the framework of the finite element method is briefly outlined. The methods used by the reduced Trefftz type approach for enforcing conformity and interelement continuity between neighboured elements are also discussed. Comparisons with other known methods for the same purpose are performed. General strategy for develo** finite elements of general geometric form such as quadrilateral elements with invariance properties is presented. The basic idea of this strategy consists in using the natural coordinate system only for defining the element geometry and performing the element integration in the biunit interval. For defining the approximation functions a local coordinate system defined from the directions of the covariant base vectors and the perpendicular contravariant base vectors computed in the geometric centre of the element is used. This strategy can also be used to implement other versions of finite elements and other forms of finite elements. Different numerical calculations and comparisons in the linear statics and kinetics are performed in order to assess the convergence and the numerical performance of finite elements developed by applying the reduced Trefftz type approach.
△ Less
Submitted 23 June, 2017;
originally announced June 2017.
-
Classification of Questions and Learning Outcome Statements (LOS) Into Blooms Taxonomy (BT) By Similarity Measurements Towards Extracting Of Learning Outcome from Learning Material
Authors:
Shadi Diab,
Badie Sartawi
Abstract:
Blooms Taxonomy (BT) have been used to classify the objectives of learning outcome by dividing the learning into three different domains; the cognitive domain, the effective domain and the psychomotor domain. In this paper, we are introducing a new approach to classify the questions and learning outcome statements (LOS) into Blooms taxonomy (BT) and to verify BT verb lists, which are being cited a…
▽ More
Blooms Taxonomy (BT) have been used to classify the objectives of learning outcome by dividing the learning into three different domains; the cognitive domain, the effective domain and the psychomotor domain. In this paper, we are introducing a new approach to classify the questions and learning outcome statements (LOS) into Blooms taxonomy (BT) and to verify BT verb lists, which are being cited and used by academicians to write questions and (LOS). An experiment was designed to investigate the semantic relationship between the action verbs used in both questions and LOS to obtain more accurate classification of the levels of BT. A sample of 775 different action verbs collected from different universities allows us to measure an accurate and clear-cut cognitive level for the action verb. It is worth mentioning that natural language processing techniques were used to develop our rules as to induce the questions into chunks in order to extract the action verbs. Our proposed solution was able to classify the action verb into a precise level of the cognitive domain. We, on our side, have tested and evaluated our proposed solution using confusion matrix. The results of evaluation tests yielded 97% for the macro average of precision and 90% for F1. Thus, the outcome of the research suggests that it is crucial to analyse and verify the action verbs cited and used by academicians to write LOS and classify their questions based on blooms taxonomy in order to obtain a definite and more accurate classification.
△ Less
Submitted 23 February, 2019; v1 submitted 10 June, 2017;
originally announced June 2017.