-
Detecting Data Type Inconsistencies in a Property Graph Database
Abstract: Some property graph databases do not have a fixed schema, which can result in data type inconsistencies for properties on nodes and relationships, especially when importing data into a running database. Here we present a tool which can rapidly produce a detailed report on every property in the graph. When executed on a large knowledge graph, it allowed us to debug a complex ETL process and enforce… ▽ More
Submitted 8 February, 2023; originally announced February 2023.
Comments: 5 pages, 3 figures, general approach applied to production databases
ACM Class: E.0
-
Importing Relationships into a Running Graph Database Using Parallel Processing
Abstract: Importing relationships into a running graph database using multiple threads running concurrently is a difficult task, as multiple threads cannot write information to the same node at the same time. Here we present an algorithm in which relationships are sorted into bins, then imported such that no two threads ever access the same node concurrently. When this algorithm was implemented as a procedu… ▽ More
Submitted 5 May, 2020; originally announced May 2020.
Comments: 5 pages, code provided on GitHub https://github.com/Lnofeisone/graph-iterateRelationship