Skip to main content

Showing 1–17 of 17 results for author: Hose, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.11042  [pdf, other

    cs.LG cs.AI

    Hospitalization Length of Stay Prediction using Patient Event Sequences

    Authors: Emil Riis Hansen, Thomas Dyhre Nielsen, Thomas Mulvad, Mads Nibe Strausholm, Tomer Sagi, Katja Hose

    Abstract: Predicting patients hospital length of stay (LOS) is essential for improving resource allocation and supporting decision-making in healthcare organizations. This paper proposes a novel approach for predicting LOS by modeling patient information as sequences of events. Specifically, we present a transformer-based model, termed Medic-BERT (M-BERT), for LOS prediction using the unique features descri… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 11 pages, 5 figures

    MSC Class: 68T07 ACM Class: I.2.7; J.3

  2. arXiv:2303.02204  [pdf, other

    cs.LG

    KGLiDS: A Platform for Semantic Abstraction, Linking, and Automation of Data Science

    Authors: Mossad Helali, Niki Monjazeb, Shubham Vashisth, Philippe Carrier, Ahmed Helal, Antonio Cavalcante, Khaled Ammar, Katja Hose, Essam Mansour

    Abstract: In recent years, we have witnessed the growing interest from academia and industry in applying data science technologies to analyze large amounts of data. In this process, a myriad of artifacts (datasets, pipeline scripts, etc.) are created. However, there has been no systematic attempt to holistically collect and exploit all the knowledge and experiences that are implicitly contained in those art… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 15 pages, 9 figures

  3. arXiv:2210.05781  [pdf, other

    cs.DB

    Transforming RDF-star to Property Graphs: A Preliminary Analysis of Transformation Approaches -- extended version

    Authors: Ghadeer Abuoda, Daniele Dell'Aglio, Arthur Keen, Katja Hose

    Abstract: RDF and property graph models have many similarities, such as using basic graph concepts like nodes and edges. However, such models differ in their modeling approach, expressivity, serialization, and the nature of applications. RDF is the de-facto standard model for knowledge graphs on the Semantic Web and supported by a rich ecosystem for inference and processing. The property graph model, in con… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  4. arXiv:2209.04185  [pdf, other

    cs.IR

    Simple and Powerful Architecture for Inductive Recommendation Using Knowledge Graph Convolutions

    Authors: Theis E. Jendal, Matteo Lissandrini, Peter Dolog, Katja Hose

    Abstract: Using graph models with relational information in recommender systems has shown promising results. Yet, most methods are transductive, i.e., they are based on dimensionality reduction architectures. Hence, they require heavy retraining every time new items or users are added. Conversely, inductive methods promise to solve these issues. Nonetheless, all inductive methods rely only on interactions,… ▽ More

    Submitted 13 September, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

  5. arXiv:2208.14692  [pdf, other

    cs.DB

    The Lothbrok approach for SPARQL Query Optimization over Decentralized Knowledge Graphs

    Authors: Christian Aebeloe, Gabriela Montoya, Katja Hose

    Abstract: While the Web of Data in principle offers access to a wide range of interlinked data, the architecture of the Semantic Web today relies mostly on the data providers to maintain access to their data through SPARQL endpoints. Several studies, however, have shown that such endpoints often experience downtime, meaning that the data they maintain becomes inaccessible. While decentralized systems based… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  6. arXiv:2204.12270  [pdf, other

    q-bio.GN cs.LG q-bio.QM

    Graph Neural Networks for Microbial Genome Recovery

    Authors: Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas Dyhre Nielsen

    Abstract: Microbes have a profound impact on our health and environment, but our understanding of the diversity and function of microbial communities is severely limited. Through DNA sequencing of microbial communities (metagenomics), DNA fragments (reads) of the individual microbes can be obtained, which through assembly graphs can be combined into long contiguous DNA sequences (contigs). Given the complex… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  7. arXiv:2111.13186  [pdf, other

    cs.LG

    Federated Data Science to Break Down Silos [Vision]

    Authors: Essam Mansour, Kavitha Srinivas, Katja Hose

    Abstract: Similar to Open Data initiatives, data science as a community has launched initiatives for sharing not only data but entire pipelines, derivatives, artifacts, etc. (Open Data Science). However, the few efforts that exist focus on the technical part on how to facilitate sharing, conversion, etc. This vision paper goes a step further and proposes KEK, an open federated data science platform that doe… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: Accepted at SIGMOD Record

  8. MindReader: Recommendation over Knowledge Graph Entities with Explicit User Ratings

    Authors: Anders H. Brams, Anders L. Jakobsen, Theis E. Jendal, Matteo Lissandrini, Peter Dolog, Katja Hose

    Abstract: Knowledge Graphs (KGs) have been integrated in several models of recommendation to augment the informational value of an item by means of its related entities in the graph. Yet, existing datasets only provide explicit ratings on items and no information is provided about user opinions of other (non-recommendable) entities. To overcome this limitation, we introduce a new dataset, called the MindRea… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  9. arXiv:2012.06171  [pdf, other

    cs.DC cs.DB

    The Future is Big Graphs! A Community View on Graph Processing Systems

    Authors: Sherif Sakr, Angela Bonifati, Hannes Voigt, Alexandru Iosup, Khaled Ammar, Renzo Angles, Walid Aref, Marcelo Arenas, Maciej Besta, Peter A. Boncz, Khuzaima Daudjee, Emanuele Della Valle, Stefania Dumbrava, Olaf Hartig, Bernhard Haslhofer, Tim Hegeman, Jan Hidders, Katja Hose, Adriana Iamnitchi, Vasiliki Kalavri, Hugo Kapp, Wim Martens, M. Tamer Özsu, Eric Peukert, Stefan Plantikow , et al. (16 additional authors not shown)

    Abstract: Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of graph instances and graph workloads understand these abstractions, future problems will require new abstractions and systems. What needs to happen in the next decade for big graph processing to continue t… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures, collaboration between the large-scale systems and data management communities, work started at the Dagstuhl Seminar 19491 on Big Graph Processing Systems, to be published in the Communications of the ACM

    ACM Class: C.3; E.0; H.2; J.0

  10. High-Level ETL for Semantic Data Warehouses -- Full Version

    Authors: Rudra Pratap Deb Nath, Oscar Romero, Torben Bach Pedersen, Katja Hose

    Abstract: The popularity of the Semantic Web (SW) encourages organizations to organize and publish semantic data using the RDF model. This growth poses new requirements to Business Intelligence (BI) technologies to enable On-Line Analytical Processing (OLAP)-like analysis over semantic data. The incorporation of semantic data into a Data Warehouse (DW) is not supported by the traditional Extract-Transform-L… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 44 pages including reference, 13 figures and 4 tables. This paper is submitted to Semantic Web Journal and now it is under review

    Journal ref: Semantic Web, vol. 13, no. 1, pp. 85-132, 2022

  11. arXiv:2002.09172  [pdf, other

    cs.DB

    Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns

    Authors: Christian Aebeloe, Ilkcan Keles, Gabriela Montoya, Katja Hose

    Abstract: The Semantic Web offers access to a vast Web of interlinked information accessible via SPARQL endpoints. Such endpoints offer a well-defined interface to retrieve results for complex SPARQL queries. The computational load for processing such SPARQL endpoints offer access to a vast amount of interlinked information. While they offer a well-defined interface for efficiently retrieving results for co… ▽ More

    Submitted 9 November, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

  12. arXiv:2002.06608  [pdf, other

    cs.DB

    Multidimensional Enrichment of Spatial RDF Data for SOLAP -- Full Version

    Authors: Nurefsan Gür, Torben Bach Pedersen, Katja Hose, Mikael Midtgaard

    Abstract: Large volumes of spatial data and multidimensional data are being published on the Semantic Web, which has led to new opportunities for advanced analysis, such as Spatial Online Analytical Processing (SOLAP). The RDF Data Cube (QB) and QB4OLAP vocabularies have been widely used for annotating and publishing statistical and multidimensional RDF data. Although such statistical data sets might have s… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

    Comments: 33 pages, 8 figures, 7 tables, 10 listings, 7 algorithms, under review in Semantic Web Journal, available on http://www.semantic-web-journal.net/content/multidimensional-enrichment-spatial-rdf-data-solap

  13. arXiv:1912.08010  [pdf, other

    cs.DB

    Querying Linked Data: An Experimental Evaluation of State-of-the-Art Interfaces

    Authors: Gabriela Montoya, Ilkcan Keles, Katja Hose

    Abstract: The adoption of Semantic Web technologies, and in particular the Open Data initiative, has contributed to the steady growth of the number of datasets and triples accessible on the Web. Most commonly, queries over RDF data are evaluated over SPARQL endpoints. Recently, however, alternatives such as TPF have been proposed with the goal of shifting query processing load from the server running the SP… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: 18 pages, 14 figures

  14. arXiv:1902.05134  [pdf, other

    cs.DS

    Efficient Continuous Multi-Query Processing over Graph Streams

    Authors: Lefteris Zervakis, Vinay Setty, Christos Tryfonopoulos, Katja Hose

    Abstract: Graphs are ubiquitous and ever-present data structures that have a wide range of applications involving social networks, knowledge bases and biological interactions. The evolution of a graph in such scenarios can yield important insights about the nature and activities of the underlying network, which can then be utilized for applications such as news dissemination, network monitoring, and content… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  15. The Odyssey Approach for Optimizing Federated SPARQL Queries

    Authors: Gabriela Montoya, Hala Skaf-Molli, Katja Hose

    Abstract: Answering queries over a federation of SPARQL endpoints requires combining data from more than one data source. Optimizing queries in such scenarios is particularly challenging not only because of (i) the large variety of possible query execution plans that correctly answer the query but also because (ii) there is only limited access to statistics about schema and instance data of remote sources.… ▽ More

    Submitted 2 November, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: 16 pages, 10 figures

  16. arXiv:1212.5636  [pdf, other

    cs.DB

    Partout: A Distributed Engine for Efficient RDF Processing

    Authors: Luis Galárraga, Katja Hose, Ralf Schenkel

    Abstract: The increasing interest in Semantic Web technologies has led not only to a rapid growth of semantic data on the Web but also to an increasing number of backend applications with already more than a trillion triples in some cases. Confronted with such huge amounts of data and the future growth, existing state-of-the-art systems for storing RDF and processing SPARQL queries are no longer sufficient.… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

  17. arXiv:1210.5403  [pdf, other

    cs.DB

    An Experience Report of Large Scale Federations

    Authors: Andreas Schwarte, Peter Haase, Michael Schmidt, Katja Hose, Ralf Schenkel

    Abstract: We present an experimental study of large-scale RDF federations on top of the Bio2RDF data sources, involving 29 data sets with more than four billion RDF triples deployed in a local federation. Our federation is driven by FedX, a highly optimized federation mediator for Linked Data. We discuss design decisions, technical aspects, and experiences made in setting up and optimizing the Bio2RDF feder… ▽ More

    Submitted 19 October, 2012; originally announced October 2012.

    ACM Class: H.2.3; H.2.4; H.3.4