Skip to main content

Showing 1–11 of 11 results for author: Sakr, S

.
  1. arXiv:2204.08358  [pdf, other

    cs.LG cs.AI

    AutoMLBench: A Comprehensive Experimental Evaluation of Automated Machine Learning Frameworks

    Authors: Hassan Eldeeb, Mohamed Maher, Radwa Elshawi, Sherif Sakr

    Abstract: With the booming demand for machine learning applications, it has been recognized that the number of knowledgeable data scientists can not scale with the growing data volumes and application needs in our digital world. In response to this demand, several automated machine learning (AutoML) frameworks have been developed to fill the gap of human expertise by automating the process of building machi… ▽ More

    Submitted 12 April, 2023; v1 submitted 18 April, 2022; originally announced April 2022.

  2. arXiv:2108.13066  [pdf, other

    cs.LG cs.AI

    To tune or not to tune? An Approach for Recommending Important Hyperparameters

    Authors: Mohamadjavad Bahmani, Radwa El Shawi, Nshan Potikyan, Sherif Sakr

    Abstract: Novel technologies in automated machine learning ease the complexity of algorithm selection and hyperparameter optimization. Hyperparameters are important for machine learning models as they significantly influence the performance of machine learning models. Many optimization techniques have achieved notable success in hyperparameter tuning and surpassed the performance of human experts. However,… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Presented on The Fifth International Workshop on Automation in Machine Learning, A workshop to be held in conjunction with the KDD 2021 Conference

  3. arXiv:2012.06171  [pdf, other

    cs.DC cs.DB

    The Future is Big Graphs! A Community View on Graph Processing Systems

    Authors: Sherif Sakr, Angela Bonifati, Hannes Voigt, Alexandru Iosup, Khaled Ammar, Renzo Angles, Walid Aref, Marcelo Arenas, Maciej Besta, Peter A. Boncz, Khuzaima Daudjee, Emanuele Della Valle, Stefania Dumbrava, Olaf Hartig, Bernhard Haslhofer, Tim Hegeman, Jan Hidders, Katja Hose, Adriana Iamnitchi, Vasiliki Kalavri, Hugo Kapp, Wim Martens, M. Tamer Özsu, Eric Peukert, Stefan Plantikow , et al. (16 additional authors not shown)

    Abstract: Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of graph instances and graph workloads understand these abstractions, future problems will require new abstractions and systems. What needs to happen in the next decade for big graph processing to continue t… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures, collaboration between the large-scale systems and data management communities, work started at the Dagstuhl Seminar 19491 on Big Graph Processing Systems, to be published in the Communications of the ACM

    ACM Class: C.3; E.0; H.2; J.0

  4. arXiv:2001.07906  [pdf, ps, other

    cs.DB cs.IR cs.SI

    Graph Generators: State of the Art and Open Challenges

    Authors: Angela Bonifati, Irena Holubová, Arnau Prat-Pérez, Sherif Sakr

    Abstract: The abundance of interconnected data has fueled the design and implementation of graph generators reproducing real-world linking properties, or gauging the effectiveness of graph algorithms, techniques and applications manipulating these data. We consider graph generation across multiple subfields, such as Semantic Web, graph databases, social networks, and community detection, along with general… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: ACM Computing Surveys, 32 pages

  5. arXiv:1906.02287  [pdf, other

    cs.LG stat.ML

    Automated Machine Learning: State-of-The-Art and Open Challenges

    Authors: Radwa Elshawi, Mohamed Maher, Sherif Sakr

    Abstract: With the continuous and vast increase in the amount of data in our digital world, it has been acknowledged that the number of knowledgeable data scientists can not scale to address these challenges. Thus, there was a crucial need for automating the process of building good machine learning models. In the last few years, several techniques and frameworks have been introduced to tackle the challenge… ▽ More

    Submitted 11 June, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  6. arXiv:1709.07493  [pdf, other

    cs.DB

    Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service

    Authors: Radwa Elshawi, Sherif Sakr

    Abstract: Recently, we have been witnessing huge advancements in the scale of data we routinely generate and collect in pretty much everything we do, as well as our ability to exploit modern technologies to process, analyze and understand this data. The intersection of these trends is what is called, nowadays, as Big Data Science. Cloud computing represents a practical and cost-effective solution for suppor… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

  7. arXiv:1702.08153  [pdf, other

    cs.DC

    HPDedup: A Hybrid Prioritized Data Deduplication Mechanism for Primary Storage in the Cloud

    Authors: Huijun Wu, Chen Wang, Yin** Fu, Sherif Sakr, Liming Zhu, Kai Lu

    Abstract: Eliminating duplicate data in primary storage of clouds increases the cost-efficiency of cloud service providers as well as reduces the cost of users for using cloud services. Existing primary deduplication techniques either use inline caching to exploit locality in primary workloads or use post-processing deduplication running in system idle time to avoid the negative impact on I/O performance. H… ▽ More

    Submitted 16 April, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: 14 pages, 11 figures, submitted to MSST2017

  8. arXiv:1302.2966  [pdf, other

    cs.DB

    The Family of MapReduce and Large Scale Data Processing Systems

    Authors: Sherif Sakr, Anna Liu, Ayman G. Fayoumi

    Abstract: In the last two decades, the continuous increase of computational power has produced an overwhelming flow of data which has called for a paradigm shift in the computing architecture and large scale data processing mechanisms. MapReduce is a simple and powerful programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of comm… ▽ More

    Submitted 12 February, 2013; originally announced February 2013.

    Comments: arXiv admin note: text overlap with arXiv:1105.4252 by other authors

  9. arXiv:1211.5817  [pdf, ps, other

    cs.DB

    Extending SPARQL to Support Entity Grou** and Path Queries

    Authors: Seyed-Mehdi-Reza Beheshti, Sherif Sakr, Boualem Benatallah, Hamid Reza Motahari-Nezhad

    Abstract: The ability to efficiently find relevant subgraphs and paths in a large graph to a given query is important in many applications including scientific data analysis, social networks, and business intelligence. Currently, there is little support and no efficient approaches for expressing and executing such queries. This paper proposes a data model and a query language to address this problem. The co… ▽ More

    Submitted 21 November, 2012; originally announced November 2012.

    Comments: 23 pages. arXiv admin note: text overlap with arXiv:1211.5009

    Report number: UNSW-CSE-TR-1019

  10. arXiv:1102.1064  [pdf, other

    cs.DL cs.DB

    A Decade of Database Research Publications

    Authors: Sherif Sakr, Mohammad Alomari

    Abstract: We analyze the database research publications of four major core database technology conferences (SIGMOD, VLDB, ICDE, EDBT), two main theoretical database conferences (PODS, ICDT) and three database journals (TODS, VLDB Journal, TKDE) over a period of 10 years (2001 - 2010). Our analysis considers only regular papers as we do not include short papers, demo papers, posters, tutorials or panels into… ▽ More

    Submitted 5 February, 2011; originally announced February 2011.

  11. arXiv:0806.0075  [pdf, other

    cs.DB

    An Experimental Investigation of XML Compression Tools

    Authors: Sherif Sakr

    Abstract: This paper presents an extensive experimental study of the state-of-the-art of XML compression tools. The study reports the behavior of nine XML compressors using a large corpus of XML documents which covers the different natures and scales of XML documents. In addition to assessing and comparing the performance characteristics of the evaluated XML compression tools, the study tries to assess th… ▽ More

    Submitted 31 May, 2008; originally announced June 2008.

    Comments: http://xmlcompbench.sourceforge.net/