-
Universal Knowledge Graph Embeddings
Authors:
N'Dah Jean Kouagou,
Caglar Demir,
Hamada M. Zahera,
Adrian Wilke,
Stefan Heindorf,
Jiayi Li,
Axel-Cyrille Ngonga Ngomo
Abstract:
A variety of knowledge graph embedding approaches have been developed. Most of them obtain embeddings by learning the structure of the knowledge graph within a link prediction setting. As a result, the embeddings reflect only the structure of a single knowledge graph, and embeddings for different knowledge graphs are not aligned, e.g., they cannot be used to find similar entities across knowledge…
▽ More
A variety of knowledge graph embedding approaches have been developed. Most of them obtain embeddings by learning the structure of the knowledge graph within a link prediction setting. As a result, the embeddings reflect only the structure of a single knowledge graph, and embeddings for different knowledge graphs are not aligned, e.g., they cannot be used to find similar entities across knowledge graphs via nearest neighbor search. However, knowledge graph embedding applications such as entity disambiguation require a more global representation, i.e., a representation that is valid across multiple sources. We propose to learn universal knowledge graph embeddings from large-scale interlinked knowledge sources. To this end, we fuse large knowledge graphs based on the owl:sameAs relation such that every entity is represented by a unique identity. We instantiate our idea by computing universal embeddings based on DBpedia and Wikidata yielding embeddings for about 180 million entities, 15 thousand relations, and 1.2 billion triples. We believe our computed embeddings will support the emerging field of graph foundation models. Moreover, we develop a convenient API to provide embeddings as a service. Experiments on link prediction suggest that universal knowledge graph embeddings encode better semantics compared to embeddings computed on a single knowledge graph. For reproducibility purposes, we provide our source code and datasets open access.
△ Less
Submitted 5 July, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
PSI/J: A Portable Interface for Submitting, Monitoring, and Managing Jobs
Authors:
Mihael Hategan-Marandiuc,
Andre Merzky,
Nicholson Collier,
Ketan Maheshwari,
Jonathan Ozik,
Matteo Turilli,
Andreas Wilke,
Justin M. Wozniak,
Kyle Chard,
Ian Foster,
Rafael Ferreira da Silva,
Shantenu Jha,
Daniel Laney
Abstract:
It is generally desirable for high-performance computing (HPC) applications to be portable between HPC systems, for example to make use of more performant hardware, make effective use of allocations, and to co-locate compute jobs with large datasets. Unfortunately, moving scientific applications between HPC systems is challenging for various reasons, most notably that HPC systems have different HP…
▽ More
It is generally desirable for high-performance computing (HPC) applications to be portable between HPC systems, for example to make use of more performant hardware, make effective use of allocations, and to co-locate compute jobs with large datasets. Unfortunately, moving scientific applications between HPC systems is challenging for various reasons, most notably that HPC systems have different HPC schedulers. We introduce PSI/J, a job management abstraction API intended to simplify the construction of software components and applications that are portable over various HPC scheduler implementations. We argue that such a system is both necessary and that no viable alternative currently exists. We analyze similar notable APIs and attempt to determine the factors that influenced their evolution and adoption by the HPC community. We base the design of PSI/J on that analysis. We describe how PSI/J has been integrated in three workflow systems and one application, and also show via experiments that PSI/J imposes minimal overhead.
△ Less
Submitted 20 September, 2023; v1 submitted 15 July, 2023;
originally announced July 2023.
-
Open Data Portal Germany (OPAL) Projektergebnisse
Authors:
Adrian Wilke,
Axel-Cyrille Ngonga Ngomo
Abstract:
In the Open Data Portal Germany (OPAL) project, a pipeline of the following data refinement steps has been developed: requirements analysis, data acquisition, analysis, conversion, integration and selection. 800,000 datasets in DCAT format have been produced.
In the Open Data Portal Germany (OPAL) project, a pipeline of the following data refinement steps has been developed: requirements analysis, data acquisition, analysis, conversion, integration and selection. 800,000 datasets in DCAT format have been produced.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
Fingerprinting Analog IoT Sensors for Secret-Free Authentication
Authors:
Felix Lorenz,
Lauritz Thamsen,
Andreas Wilke,
Ilja Behnke,
Jens Waldmüller-Littke,
Ilya Komarov,
Odej Kao,
Manfred Paeschke
Abstract:
Especially in context of critical urban infrastructures, trust in IoT data is of utmost importance. While most technology stacks provide means for authentication and encryption of device-to-cloud traffic, there are currently no mechanisms to rule out physical tampering with an IoT device's sensors. Addressing this gap, we introduce a new method for extracting a hardware fingerprint of an IoT senso…
▽ More
Especially in context of critical urban infrastructures, trust in IoT data is of utmost importance. While most technology stacks provide means for authentication and encryption of device-to-cloud traffic, there are currently no mechanisms to rule out physical tampering with an IoT device's sensors. Addressing this gap, we introduce a new method for extracting a hardware fingerprint of an IoT sensor which can be used for secret-free authentication. By comparing the fingerprint against reference measurements recorded prior to deployment, we can tell whether the sensing hardware connected to the IoT device has been changed by environmental effects or with malicious intent. Our approach exploits the characteristic behavior of analog circuits, which is revealed by applying a fixed-frequency alternating current to the sensor, while recording its output voltage. To demonstrate the general feasibility of our method, we apply it to four commercially available temperature sensors using laboratory equipment and evaluate the accuracy. The results indicate that with a sensible configuration of the two hyperparameters we can identify individual sensors with high probability, using only a few recordings from the target device.
△ Less
Submitted 11 June, 2020;
originally announced June 2020.
-
Critical Incidents for Technology Enhanced Learning in Vocational Education and Training - Observations from the field of mechanical engineering
Authors:
Adrian Wilke,
Johannes Magenheim
Abstract:
In this study, observations of the Vocational Education and Training (VET) in mechanical engineering companies are carried out. A Learning Management System (LMS) had been developed for the assistance in solving typical task structures, that are used for a period of three and a half years in the apprenticeship. In this study, the Critical Incident Technique (CIT) is applied for the observations. F…
▽ More
In this study, observations of the Vocational Education and Training (VET) in mechanical engineering companies are carried out. A Learning Management System (LMS) had been developed for the assistance in solving typical task structures, that are used for a period of three and a half years in the apprenticeship. In this study, the Critical Incident Technique (CIT) is applied for the observations. For the subsequent analysis, a classification of incidents is performed. The most important incidents as well as conclusions for Technical Enhanced Learning (TEL) in similar domains are presented.
△ Less
Submitted 13 January, 2019;
originally announced January 2019.