Skip to main content

Showing 1–7 of 7 results for author: Ammar, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12032  [pdf, other

    cs.LG stat.ML

    Exact and general decoupled solutions of the LMC Multitask Gaussian Process model

    Authors: Olivier Truffinet, Karim Ammar, Jean-Philippe Argaud, Bertrand Bouriquet

    Abstract: The Linear Model of Co-regionalization (LMC) is a very general model of multitask gaussian process for regression or classification. While its expressivity and conceptual simplicity are appealing, naive implementations have cubic complexity in the number of datapoints and number of tasks, making approximations mandatory for most applications. However, recent work has shown that under some conditio… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 29 pages, 10 figures, submitted to UAI

    ACM Class: I.2.6

  2. arXiv:2303.02204  [pdf, other

    cs.LG

    KGLiDS: A Platform for Semantic Abstraction, Linking, and Automation of Data Science

    Authors: Mossad Helali, Niki Monjazeb, Shubham Vashisth, Philippe Carrier, Ahmed Helal, Antonio Cavalcante, Khaled Ammar, Katja Hose, Essam Mansour

    Abstract: In recent years, we have witnessed the growing interest from academia and industry in applying data science technologies to analyze large amounts of data. In this process, a myriad of artifacts (datasets, pipeline scripts, etc.) are created. However, there has been no systematic attempt to holistically collect and exploit all the knowledge and experiences that are implicitly contained in those art… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 15 pages, 9 figures

  3. Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs

    Authors: Khaled Ammar, Siddhartha Sahu, Semih Salihoglu, M. Tamer Ozsu

    Abstract: Differential computation (DC) is a highly general incremental computation/view maintenance technique that can maintain the output of an arbitrary and possibly recursive dataflow computation upon changes to its base inputs. As such, it is a promising technique for graph database management systems (GDBMS) that support continuous recursive queries over dynamic graphs. Although differential computati… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Journal ref: PVLDB, 15(11): 3186 - 3198, 2022

  4. arXiv:2012.06171  [pdf, other

    cs.DC cs.DB

    The Future is Big Graphs! A Community View on Graph Processing Systems

    Authors: Sherif Sakr, Angela Bonifati, Hannes Voigt, Alexandru Iosup, Khaled Ammar, Renzo Angles, Walid Aref, Marcelo Arenas, Maciej Besta, Peter A. Boncz, Khuzaima Daudjee, Emanuele Della Valle, Stefania Dumbrava, Olaf Hartig, Bernhard Haslhofer, Tim Hegeman, Jan Hidders, Katja Hose, Adriana Iamnitchi, Vasiliki Kalavri, Hugo Kapp, Wim Martens, M. Tamer Özsu, Eric Peukert, Stefan Plantikow , et al. (16 additional authors not shown)

    Abstract: Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of graph instances and graph workloads understand these abstractions, future problems will require new abstractions and systems. What needs to happen in the next decade for big graph processing to continue t… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures, collaboration between the large-scale systems and data management communities, work started at the Dagstuhl Seminar 19491 on Big Graph Processing Systems, to be published in the Communications of the ACM

    ACM Class: C.3; E.0; H.2; J.0

  5. arXiv:1806.08082  [pdf, other

    cs.DC

    Experimental Analysis of Distributed Graph Systems

    Authors: Khaled Ammar, Tamer Ozsu

    Abstract: This paper evaluates eight parallel graph processing systems: Hadoop, HaLoop, Vertica, Giraph, GraphLab (PowerGraph), Blogel, Flink Gelly, and GraphX (SPARK) over four very large datasets (Twitter, World Road Network, UK 200705, and ClueWeb) using four workloads (PageRank, WCC, SSSP and K-hop). The main objective is to perform an independent scale-out study by experimentally analyzing the performa… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: Volume 11 of Proc. VLDB Endowment

  6. arXiv:1805.11728  [pdf, other

    cs.DB

    Sapphire: Querying RDF Data Made Simple

    Authors: Ahmed El-Roby, Khaled Ammar, Ashraf Aboulnaga, Jimmy Lin

    Abstract: RDF data in the linked open data (LOD) cloud is very valuable for many different applications. In order to unlock the full value of this data, users should be able to issue complex queries on the RDF datasets in the LOD cloud. SPARQL can express such complex queries, but constructing SPARQL queries can be a challenge to users since it requires knowing the structure and vocabulary of the datasets b… ▽ More

    Submitted 13 September, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: 16 pages

  7. arXiv:1802.03760  [pdf, other

    cs.DC cs.DB

    Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows

    Authors: Khaled Ammar, Frank McSherry, Semih Salihoglu, Manas Joglekar

    Abstract: We study the problem of finding and monitoring fixed-size subgraphs in a continually changing large-scale graph. We present the first approach that (i) performs worst-case optimal computation and communication, (ii) maintains a total memory footprint linear in the number of input edges, and (iii) scales down per-worker computation, communication, and memory requirements linearly as the number of w… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.