Skip to main content

Showing 1–10 of 10 results for author: Helal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08530  [pdf, other

    cs.DB

    Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ehab E. Hassanein, Ahmed Awad

    Abstract: Process mining extracts valuable insights from event data to help organizations improve their business processes, which is essential for their growth and success. By leveraging process mining techniques, organizations gain a comprehensive understanding of their processes' execution, enabling the discovery of process models, detection of deviations, identification of bottlenecks, and assessment of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2403.06348  [pdf, other

    cs.DC cs.DS cs.PF

    Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation

    Authors: Jan Laukemann, Ahmed E. Helal, S. Isaac Geronimo Anderson, Fabio Checconi, Yongseok Soh, Jesmin Jahan Tithi, Teresa Ranadive, Brian J Gravelle, Fabrizio Petrini, Jee Choi

    Abstract: High-dimensional sparse data emerge in many critical application domains such as cybersecurity, healthcare, anomaly detection, and trend analysis. To quickly extract meaningful insights from massive volumes of these multi-dimensional data, scientists employ unsupervised analysis tools based on tensor decomposition (TD) methods. However, real-world sparse tensors exhibit highly irregular shapes, da… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: We extend the results of our previous ICS paper to significantly improve the parallel performance of the Canonical Polyadic Alternating Least Squares (CP-ALS) algorithm for normally distributed data and the Canonical Polyadic Alternating Poisson Regression (CP-APR) algorithm for non-negative count data

  3. arXiv:2303.02204  [pdf, other

    cs.LG

    KGLiDS: A Platform for Semantic Abstraction, Linking, and Automation of Data Science

    Authors: Mossad Helali, Niki Monjazeb, Shubham Vashisth, Philippe Carrier, Ahmed Helal, Antonio Cavalcante, Khaled Ammar, Katja Hose, Essam Mansour

    Abstract: In recent years, we have witnessed the growing interest from academia and industry in applying data science technologies to analyze large amounts of data. In this process, a myriad of artifacts (datasets, pipeline scripts, etc.) are created. However, there has been no systematic attempt to holistically collect and exploit all the knowledge and experiences that are implicitly contained in those art… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 15 pages, 9 figures

  4. arXiv:2206.09336  [pdf, other

    cs.DB

    Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ahmed Awad, Ehab E. Hassanein

    Abstract: Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These appr… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 18 pages, 5 figures, 6 tables

    MSC Class: 68

  5. arXiv:2206.06251  [pdf

    cs.SE cs.AI cs.CY

    A Methodology and Software Architecture to Support Explainability-by-Design

    Authors: Trung Dong Huynh, Niko Tsakalakis, Ayah Helal, Sophie Stalla-Bourdillon, Luc Moreau

    Abstract: Algorithms play a crucial role in many technological systems that control or affect various aspects of our lives. As a result, providing explanations for their decisions to address the needs of users and organisations is increasingly expected by laws, regulations, codes of conduct, and the public. However, as laws and regulations do not prescribe how to meet such expectations, organisations are of… ▽ More

    Submitted 25 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

  6. arXiv:2201.12523  [pdf, other

    cs.DC cs.DS cs.PF

    Efficient, Out-of-Memory Sparse MTTKRP on Massively Parallel Architectures

    Authors: Andy Nguyen, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithi, Yongseok Soh, Teresa Ranadive, Fabrizio Petrini, Jee W. Choi

    Abstract: Tensor decomposition (TD) is an important method for extracting latent information from high-dimensional (multi-modal) sparse data. This study presents a novel framework for accelerating fundamental TD operations on massively parallel GPU architectures. In contrast to prior work, the proposed Blocked Linearized Coordinate (BLCO) format enables efficient out-of-memory computation of tensor algorith… ▽ More

    Submitted 27 June, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Comments: Accepted to ICS 2022

  7. arXiv:2102.10245  [pdf, other

    cs.DC cs.DS cs.PF

    ALTO: Adaptive Linearized Storage of Sparse Tensors

    Authors: Ahmed E. Helal, Jan Laukemann, Fabio Checconi, Jesmin Jahan Tithi, Teresa Ranadive, Fabrizio Petrini, Jeewhan Choi

    Abstract: The analysis of high-dimensional sparse data is becoming increasingly popular in many important domains. However, real-world sparse tensors are challenging to process due to their irregular shapes and data distributions. We propose the Adaptive Linearized Tensor Order (ALTO) format, a novel mode-agnostic (general) representation that keeps neighboring nonzero elements in the multi-dimensional spac… ▽ More

    Submitted 27 April, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: Accepted to ICS 2021

  8. arXiv:2011.10970  [pdf, other

    cs.AI

    DiaLex: A Benchmark for Evaluating Multidialectal Arabic Word Embeddings

    Authors: Muhammad Abdul-Mageed, Shady Elbassuoni, Jad Doughman, AbdelRahim Elmadany, El Moatez Billah Nagoudi, Yorgo Zoughby, Ahmad Shaher, Iskander Gaba, Ahmed Helal, Mohammed El-Razzaz

    Abstract: Word embeddings are a core component of modern natural language processing systems, making the ability to thoroughly evaluate them a vital task. We describe DiaLex, a benchmark for intrinsic evaluation of dialectal Arabic word embedding. DiaLex covers five important Arabic dialects: Algerian, Egyptian, Lebanese, Syrian, and Tunisian. Across these dialects, DiaLex provides a testbank for six syntac… ▽ More

    Submitted 12 March, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: WANLP2021

  9. arXiv:2010.10343  [pdf, other

    cs.LG cs.AI cs.DB

    Provenance Graph Kernel

    Authors: David Kohan Marzagão, Trung Dong Huynh, Ayah Helal, Sean Baccas, Luc Moreau

    Abstract: Provenance is a record that describes how entities, activities, and agents have influenced a piece of data; it is commonly represented as graphs with relevant labels on both their nodes and edges. With the growing adoption of provenance in a wide range of application domains, users are increasingly confronted with an abundance of graph data, which may prove challenging to process. Graph kernels, o… ▽ More

    Submitted 14 September, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 14 pages

    ACM Class: I.2.6

  10. arXiv:2004.09971  [pdf, other

    cs.OH

    Correlating Unlabeled Events at Runtime

    Authors: Iman M. A. Helal, Ahmed Awad

    Abstract: Process mining is of great importance for both data-centric and process-centric systems. Process mining receives so-called process logs which are collections of partially-ordered events. An event has to possess at least three attributes, case ID, task ID and a timestamp for mining approaches to work. When a case ID is unknown, the event is called unlabeled. Traditionally, process mining is an offl… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: 10 pages, 3 figures