-
Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries
Authors:
Martin Lnenicka,
Anastasija Nikiforova,
Mariusz Luterek,
Petar Milic,
Daniel Rudmark,
Sebastian Neumaier,
Caterina Santoro,
Cesar Casiano Flores,
Marijn Janssen,
Manuel Pedro Rodríguez Bolívar
Abstract:
Open government and open (government) data are seen as tools to create new opportunities, eliminate or at least reduce information inequalities and improve public services. More than a decade of these efforts has provided much experience, practices, and perspectives to learn how to better deal with them. This paper focuses on benchmarking of open data initiatives over the years and attempts to ide…
▽ More
Open government and open (government) data are seen as tools to create new opportunities, eliminate or at least reduce information inequalities and improve public services. More than a decade of these efforts has provided much experience, practices, and perspectives to learn how to better deal with them. This paper focuses on benchmarking of open data initiatives over the years and attempts to identify patterns observed among European countries that could lead to disparities in the development, growth, and sustainability of open data ecosystems. To do this, we studied benchmarks and indices published over the last years (57 editions of 8 artifacts) and conducted a comparative case study of eight European countries, identifying patterns among them considering different potentially relevant contexts such as e-government, open government data, open data indices and rankings, and others relevant for the country under consideration. Using a Delphi method, we reached a consensus within a panel of experts and validated a final list of 94 patterns, including their frequency of occurrence among studied countries and their effects on the respective countries. Finally, we took a closer look at the developments in identified contexts over the years and defined 21 recommendations for more resilient and sustainable open government data initiatives and ecosystems and future steps in this area.
△ Less
Submitted 9 December, 2023; v1 submitted 1 December, 2023;
originally announced December 2023.
-
Policy Patterns for Usage Control in Data Spaces
Authors:
Tobias Dam,
Andreas Krimbacher,
Sebastian Neumaier
Abstract:
Data-driven technologies have the potential to initiate a transportation related revolution in the way we travel, commute and navigate within cities. As a major effort of this transformation relies on Mobility Data Spaces for the exchange of mobility data, the necessity to protect valuable data and formulate conditions for data exchange arises. This paper presents key contributions to the developm…
▽ More
Data-driven technologies have the potential to initiate a transportation related revolution in the way we travel, commute and navigate within cities. As a major effort of this transformation relies on Mobility Data Spaces for the exchange of mobility data, the necessity to protect valuable data and formulate conditions for data exchange arises. This paper presents key contributions to the development of automated contract negotiation and data usage policies in the Mobility Data Space. A comprehensive listing of policy patterns for usage control is provided, addressing common requirements and scenarios in data sharing and governance. The use of the Open Digital Rights Language (ODRL) is proposed to formalize the collected policies, along with an extension of the ODRL vocabulary for data space-specific properties.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
A Survey of Dataspace Connector Implementations
Authors:
Tobias Dam,
Lukas Daniel Klausner,
Sebastian Neumaier,
Torsten Priebe
Abstract:
The concept of dataspaces aims to facilitate secure and sovereign data exchange among multiple stakeholders. Technical implementations known as "connectors" support the definition of usage control policies and the verifiable enforcement of such policies. This paper provides an overview of existing literature and reviews current open-source dataspace connector implementations that are compliant wit…
▽ More
The concept of dataspaces aims to facilitate secure and sovereign data exchange among multiple stakeholders. Technical implementations known as "connectors" support the definition of usage control policies and the verifiable enforcement of such policies. This paper provides an overview of existing literature and reviews current open-source dataspace connector implementations that are compliant with the International Data Spaces (IDS) standard. To assess maturity and readiness, we review four implementations with regard to their architecture, underlying data model and usage control language.
△ Less
Submitted 9 January, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Towards a Critical Open-Source Software Database
Authors:
Tobias Dam,
Lukas Daniel Klausner,
Sebastian Neumaier
Abstract:
Open-source software (OSS) plays a vital role in the modern software ecosystem. However, the maintenance and sustainability of OSS projects can be challenging. In this paper, we present the CrOSSD project, which aims to build a database of OSS projects and measure their current project "health" status. In the project, we will use both quantitative and qualitative metrics to evaluate the health of…
▽ More
Open-source software (OSS) plays a vital role in the modern software ecosystem. However, the maintenance and sustainability of OSS projects can be challenging. In this paper, we present the CrOSSD project, which aims to build a database of OSS projects and measure their current project "health" status. In the project, we will use both quantitative and qualitative metrics to evaluate the health of OSS projects. The quantitative metrics will be gathered through automated crawling of meta information such as the number of contributors, commits and lines of code. Qualitative metrics will be gathered for selected "critical" projects through manual analysis and automated tools, including aspects such as sustainability, funding, community engagement and adherence to security policies. The results of the analysis will be presented on a user-friendly web platform, which will allow users to view the health of individual OSS projects as well as the overall health of the OSS ecosystem. With this approach, the CrOSSD project provides a comprehensive and up-to-date view of the health of OSS projects, making it easier for developers, maintainers and other stakeholders to understand the health of OSS projects and make informed decisions about their use and maintenance.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Building a Knowledge Graph of Distributed Ledger Technologies
Authors:
Lukas König,
Sebastian Neumaier
Abstract:
Distributed ledger systems have become more prominent and successful in recent years, with a focus on blockchains and cryptocurrency. This has led to various misunderstandings about both the technology itself and its capabilities, as in many cases blockchain and cryptocurrency is used synonymously and other applications are often overlooked. Therefore, as a whole, the view of distributed ledger te…
▽ More
Distributed ledger systems have become more prominent and successful in recent years, with a focus on blockchains and cryptocurrency. This has led to various misunderstandings about both the technology itself and its capabilities, as in many cases blockchain and cryptocurrency is used synonymously and other applications are often overlooked. Therefore, as a whole, the view of distributed ledger technology beyond blockchains and cryptocurrencies is very limited. Existing vocabularies and ontologies often focus on single aspects of the technology, or in some cases even just on one product. This potentially leads to other types of distributed ledgers and their possible use cases being neglected. In this paper, we present a knowledge graph and an ontology for distributed ledger technologies, which includes security considerations to model aspects such as threats and vulnerabilities, application domains, as well as relevant standards and regulations. Such a knowledge graph improves the overall understanding of distributed ledgers, reveals their strengths, and supports the work of security personnel, i.e. analysts and system architects. We discuss potential uses and follow semantic web best practices to evaluate and publish the ontology and knowledge graph.
△ Less
Submitted 29 March, 2023;
originally announced March 2023.
-
Von Data Warehouse bis Data Mesh: Ein Wegweiser durch den Dschungel analytischer Datenarchitekturen
Authors:
Torsten Priebe,
Sebastian Neumaier,
Stefan Markus
Abstract:
Data warehouse, data lake, data lakehouse, data mesh ... many new names for analytical data architectures are currently circulating in the scene. But are the various approaches really so different? This article attempts a structured comparison of the different architecture paradigms, methodically based on DAMA-DMBOK and ArchiMate. Differences, similarities and dependencies as well as overlap** a…
▽ More
Data warehouse, data lake, data lakehouse, data mesh ... many new names for analytical data architectures are currently circulating in the scene. But are the various approaches really so different? This article attempts a structured comparison of the different architecture paradigms, methodically based on DAMA-DMBOK and ArchiMate. Differences, similarities and dependencies as well as overlap** architectural building blocks are worked out and illustrated. This results in a first orientation guide for the choice of the right analytical data architecture for the respective use case.
--
Data Warehouse, Data Lake, Date Lakehouse, Data Mesh ... in der Szene kursieren derzeit viele neue Namen für analytische Datenarchitekturen. Doch sind die diversen Ansätze wirklich so unterschiedlich? Dieser Beitrag versucht einen strukturierten Vergleich der verschiedenen Architekturparadigmen, methodisch basierend auf DAMA-DMBOK und ArchiMate. Es werden Unterschiede, Gemeinsamkeiten und Abhängigkeiten sowie überlappende Architekturbausteine herausgearbeitet und illustriert. Daraus entsteht eine erste Orientierungshilfe für die Wahl der richtigen analytischen Datenarchitektur für den jeweiligen Anwendungsfall.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Towards Measuring Vulnerabilities and Exposures in Open-Source Packages
Authors:
Tobias Dam,
Sebastian Neumaier
Abstract:
Much of the current software depends on open-source components, which in turn have complex dependencies on other open-source libraries. Vulnerabilities in open source therefore have potentially huge impacts. The goal of this work is to get a quantitative overview of the frequency and evolution of existing vulnerabilities in popular software repositories and package managers. To this end, we provid…
▽ More
Much of the current software depends on open-source components, which in turn have complex dependencies on other open-source libraries. Vulnerabilities in open source therefore have potentially huge impacts. The goal of this work is to get a quantitative overview of the frequency and evolution of existing vulnerabilities in popular software repositories and package managers. To this end, we provide an up-to-date overview of the open source landscape and its most popular package managers, we discuss approaches to map entries of the Common Vulnerabilities and Exposures (CVE) list to open-source libraries and we show the frequency and distribution of existing CVE entries with respect to popular programming languages.
△ Less
Submitted 9 May, 2023; v1 submitted 29 June, 2022;
originally announced June 2022.
-
Finding Your Way Through the Jungle of Big Data Architectures
Authors:
Torsten Priebe,
Sebastian Neumaier,
Stefan Markus
Abstract:
This paper presents a systematic review of common analytical data architectures based on DAMA-DMBOK and ArchiMate. The paper is work in progress and provides a first view on Gartner's Logical Data Warehouse paradigm, Data Fabric and Dehghani's Data Mesh proposal as well as their interdependencies. It furthermore sketches the way forward how this work can be extended by covering more architecture p…
▽ More
This paper presents a systematic review of common analytical data architectures based on DAMA-DMBOK and ArchiMate. The paper is work in progress and provides a first view on Gartner's Logical Data Warehouse paradigm, Data Fabric and Dehghani's Data Mesh proposal as well as their interdependencies. It furthermore sketches the way forward how this work can be extended by covering more architecture paradigms (incl. classic Data Warehouse, Data Vault, Data Lake, Lambda and Kappa architectures) and introducing a template with among others "context", "problem" and "solution" descriptions, leading ultimately to a pattern system providing guidance for choosing the right architecture paradigm for the right situation.
△ Less
Submitted 11 January, 2022;
originally announced January 2022.
-
Challenges of Linking Organizational Information in Open Government Data to Knowledge Graphs
Authors:
Jan Portisch,
Omaima Fallatah,
Sebastian Neumaier,
Mohamad Yaser Jaradeh,
Axel Polleres
Abstract:
Open Government Data (OGD) is being published by various public administration organizations around the globe. Within the metadata of OGD data catalogs, the publishing organizations (1) are not uniquely and unambiguously identifiable and, even worse, (2) change over time, by public administration units being merged or restructured. In order to enable fine-grained analyses or searches on Open Gover…
▽ More
Open Government Data (OGD) is being published by various public administration organizations around the globe. Within the metadata of OGD data catalogs, the publishing organizations (1) are not uniquely and unambiguously identifiable and, even worse, (2) change over time, by public administration units being merged or restructured. In order to enable fine-grained analyses or searches on Open Government Data on the level of publishing organizations, linking those from OGD portals to publicly available knowledge graphs (KGs) such as Wikidata and DBpedia seems like an obvious solution. Still, as we show in this position paper, organization linking faces significant challenges, both in terms of available (portal) metadata and KGs in terms of data quality and completeness. We herein specifically highlight five main challenges, namely regarding (1) temporal changes in organizations and in the portal metadata, (2) lack of a base ontology for describing organizational structures and changes in public knowledge graphs, (3) metadata and KG data quality, (4) multilinguality, and (5) disambiguating public sector organizations. Based on available OGD portal metadata from the Open Data Portal Watch, we provide an in-depth analysis of these issues, make suggestions for concrete starting points on how to tackle them along with a call to the community to jointly work on these open challenges.
△ Less
Submitted 14 August, 2020;
originally announced August 2020.
-
Knowledge Graphs
Authors:
Aidan Hogan,
Eva Blomqvist,
Michael Cochez,
Claudia d'Amato,
Gerard de Melo,
Claudio Gutierrez,
José Emilio Labra Gayo,
Sabrina Kirrane,
Sebastian Neumaier,
Axel Polleres,
Roberto Navigli,
Axel-Cyrille Ngonga Ngomo,
Sabbir M. Rashid,
Anisa Rula,
Lukas Schmelzeisen,
Juan Sequeda,
Steffen Staab,
Antoine Zimmermann
Abstract:
In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After some opening remarks, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss th…
▽ More
In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After some opening remarks, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss the roles of schema, identity, and context in knowledge graphs. We explain how knowledge can be represented and extracted using a combination of deductive and inductive techniques. We summarise methods for the creation, enrichment, quality assessment, refinement, and publication of knowledge graphs. We provide an overview of prominent open knowledge graphs and enterprise knowledge graphs, their applications, and how they use the aforementioned techniques. We conclude with high-level future research directions for knowledge graphs.
△ Less
Submitted 11 September, 2021; v1 submitted 4 March, 2020;
originally announced March 2020.
-
Talking Open Data
Authors:
Sebastian Neumaier,
Vadim Savenkov,
Svitlana Vakulenko
Abstract:
Enticing users into exploring Open Data remains an important challenge for the whole Open Data paradigm. Standard stock interfaces often used by Open Data portals are anything but inspiring even for tech-savvy users, let alone those without an articulated interest in data science. To address a broader range of citizens, we designed an open data search interface supporting natural language interact…
▽ More
Enticing users into exploring Open Data remains an important challenge for the whole Open Data paradigm. Standard stock interfaces often used by Open Data portals are anything but inspiring even for tech-savvy users, let alone those without an articulated interest in data science. To address a broader range of citizens, we designed an open data search interface supporting natural language interactions via popular platforms like Facebook and Skype. Our data-aware chatbot answers search requests and suggests relevant open datasets, bringing fun factor and a potential of viral dissemination into Open Data exploration. The current system prototype is available for Facebook (https://m.me/OpenDataAssistant) and Skype (https://join.skype.com/bot/6db830ca-b365-44c4-9f4d-d423f728e741) users.
△ Less
Submitted 2 May, 2017;
originally announced May 2017.
-
Precision half-life measurement of the 4-fold forbidden electron capture of V-50
Authors:
H. Dombrowski,
S. Neumaier,
K. Zuber
Abstract:
A sensitive search of the 4-fold forbidden non-unique beta decay of V-50 has been performed. A total exposure of 185.8 kg x d has been accumulated. A reliable half-life value with the highest precision so far of $(2.29 \pm 0.25) \cdot 10^{17}$ years of the electron capture decay of V-50 into the first excited state of Ti-50 could be obtained. A photon emission line following the 4-fold forbidden b…
▽ More
A sensitive search of the 4-fold forbidden non-unique beta decay of V-50 has been performed. A total exposure of 185.8 kg x d has been accumulated. A reliable half-life value with the highest precision so far of $(2.29 \pm 0.25) \cdot 10^{17}$ years of the electron capture decay of V-50 into the first excited state of Ti-50 could be obtained. A photon emission line following the 4-fold forbidden beta decay into the first excited state of Cr-50 could not be observed, resulting in a lower limit on the half-life of the beta decay branch of $1.7 \cdot 10^{18}$ years. This is barely in agreement with a claimed observation of this decay branch.
△ Less
Submitted 6 July, 2011; v1 submitted 30 March, 2011;
originally announced March 2011.