Skip to main content

Showing 1–46 of 46 results for author: Bussche, J V d

.
  1. arXiv:2305.04656  [pdf, other

    cs.LO

    Preservation theorems for Tarski's relation algebra

    Authors: Bart Bogaerts, Balder ten Cate, Brett McLean, Jan Van den Bussche

    Abstract: We investigate a number of semantically defined fragments of Tarski's algebra of binary relations, including the function-preserving fragment. We address the question whether they are generated by a finite set of operations. We obtain several positive and negative results along these lines. Specifically, the homomorphism-safe fragment is finitely generated (both over finite and over arbitrary stru… ▽ More

    Submitted 11 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

  2. arXiv:2304.14793  [pdf, other

    cs.LG cs.AI cs.DB

    Learning Graph Neural Networks using Exact Compression

    Authors: Jeroen Bollen, Jasper Steegmans, Jan Van den Bussche, Stijn Vansummeren

    Abstract: Graph Neural Networks (GNNs) are a form of deep learning that enable a wide range of machine learning applications on graph-structured data. The learning of GNNs, however, is known to pose challenges for memory-constrained devices such as GPUs. In this paper, we study exact compression as a way to reduce the memory requirements of learning GNNs on large graphs. In particular, we adopt a formal app… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Extended version of the paper to be published in the proceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), held in conjunction with SIGMOD/PODS 2023

  3. Expressiveness of SHACL Features and Extensions for Full Equality and Disjointness Tests

    Authors: Bart Bogaerts, Maxime Jakubowski, Jan Van den Bussche

    Abstract: SHACL is a W3C-proposed schema language for expressing structural constraints on RDF graphs. Recent work on formalizing this language has revealed a striking relationship to description logics. SHACL expressions can use three fundamental features that are not so common in description logics. These features are equality tests; disjointness tests; and closure constraints. Moreover, SHACL is peculiar… ▽ More

    Submitted 20 February, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

    Journal ref: Logical Methods in Computer Science (February 28, 2024) lmcs:10442

  4. arXiv:2210.00240  [pdf, other

    cs.LO

    Executable First-Order Queries in the Logic of Information Flows

    Authors: Heba Aamer, Bart Bogaerts, Dimitri Surinx, Eugenia Ternovska, Jan Van den Bussche

    Abstract: The logic of information flows (LIF) has recently been proposed as a general framework in the field of knowledge representation. In this framework, tasks of procedural nature can still be modeled in a declarative, logic-based fashion. In this paper, we focus on the task of query processing under limited access patterns, a well-studied problem in the database literature. We show that LIF is well-su… ▽ More

    Submitted 21 May, 2024; v1 submitted 1 October, 2022; originally announced October 2022.

    ACM Class: H.2.3; I.2.4

  5. arXiv:2209.06448  [pdf, ps, other

    cs.LO

    Inputs, Outputs, and Composition in the Logic of Information Flows

    Authors: Heba Aamer, Bart Bogaerts, Dimitri Surinx, Eugenia Ternovska, Jan Van den Bussche

    Abstract: The logic of information flows (LIF) is a general framework in which tasks of a procedural nature can be modeled in a declarative, logic-based fashion. The first contribution of this paper is to propose semantic and syntactic definitions of inputs and outputs of LIF expressions. We study how the two relate and show that our syntactic definition is optimal in a sense that is made precise. The secon… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: This paper is the extended version of a paper presented at KR 2020

  6. arXiv:2206.06754  [pdf, other

    cs.DB

    Expressiveness within Sequence Datalog

    Authors: Heba Aamer, Jan Hidders, Jan Paredaens, Jan Van den Bussche

    Abstract: Motivated by old and new applications, we investigate Datalog as a language for sequence databases. We reconsider classical features of Datalog programs, such as negation, recursion, intermediate predicates, and relations of higher arities. We also consider new features that are useful for sequences, notably, equations between path expressions, and "packing". Our goal is to clarify the relative ex… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: This paper is the extended version of a paper presented at PODS 2021

  7. arXiv:2206.00140  [pdf, other

    cs.DB

    What Can Database Query Processing Do for Instance-Spanning Constraints?

    Authors: Heba Aamer, Marco Montali, Jan Van den Bussche

    Abstract: In the last decade, the term instance-spanning constraint has been introduced in the process mining field to refer to constraints that span multiple process instances of one or several processes. Of particular relevance, in this setting, is checking whether process executions comply with constraints of interest, which at runtime calls for suitable monitoring techniques. Even though event data are… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  8. arXiv:2205.14269  [pdf, other

    cs.DB

    Temporal graph patterns by timed automata

    Authors: Amir Pouya Aghasadeghi, Jan Van den Bussche, Julia Stoyanovich

    Abstract: Temporal graphs represent graph evolution over time, and have been receiving considerable research attention. Work on expressing temporal graph patterns or discovering temporal motifs typically assumes relatively simple temporal constraints, such as journeys or, more generally, existential constraints, possibly with finite delays. In this paper we propose to use timed automata to express temporal… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  9. On the expressive power of message-passing neural networks as global feature map transformers

    Authors: Floris Geerts, Jasper Steegmans, Jan Van den Bussche

    Abstract: We investigate the power of message-passing neural networks (MPNNs) in their capacity to transform the numerical features stored in the nodes of their input graphs. Our focus is on global expressive power, uniformly over all input graphs, or over graphs of bounded degree with features from a bounded domain. Accordingly, we introduce the notion of a global feature map transformer (GFMT). As a yards… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 17 pages, 1 figure

    Journal ref: Proceedings FoIKS 2022 conference

  10. arXiv:2112.11796  [pdf, other

    cs.DB cs.AI

    Shape Fragments

    Authors: Thomas Delva, Anastasia Dimou, Maxime Jakubowski, Jan Van den Bussche

    Abstract: In constraint languages for RDF graphs, such as ShEx and SHACL, constraints on nodes and their properties in RDF graphs are known as "shapes". Schemas in these languages list the various shapes that certain targeted nodes must satisfy for the graph to conform to the schema. Using SHACL, we propose in this paper a novel use of shapes, by which a set of shapes is used to extract a subgraph from an R… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  11. arXiv:2108.06096  [pdf, ps, other

    cs.LO

    SHACL: A Description Logic in Disguise

    Authors: Bart Bogaerts, Maxime Jakubowski, Jan Van den Bussche

    Abstract: SHACL is a W3C-proposed language for expressing structural constraints on RDF graphs. In recent years, SHACL's popularity has risen quickly. This rise in popularity comes with questions related to its place in the semantic web, particularly about its relation to OWL (the de facto standard for expressing ontological information on the web) and description logics (which form the formal foundations o… ▽ More

    Submitted 28 June, 2022; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: Presented at LPNRM conference 2022

  12. arXiv:2006.04277  [pdf, ps, other

    cs.DB

    J-Logic: a Logic for Querying JSON

    Authors: Jan Hidders, Jan Paredaens, Jan Van den Bussche

    Abstract: We propose a logical framework, based on Datalog, to study the foundations of querying JSON data. The main feature of our approach, which we call J-Logic, is the emphasis on paths. Paths are sequences of keys and are used to access the tree structure of nested JSON objects. J-Logic also features ``packing'' as a means to generate a new key from a path or subpath. J-Logic with recursion is computat… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

  13. arXiv:2003.00644  [pdf, ps, other

    cs.LO cs.CC math.LO

    Descriptive complexity of real computation and probabilistic independence logic

    Authors: Miika Hannula, Juha Kontinen, Jan Van den Bussche, Jonni Virtema

    Abstract: We introduce a novel variant of BSS machines called Separate Branching BSS machines (S-BSS in short) and develop a Fagin-type logical characterisation for languages decidable in non-deterministic polynomial time by S-BSS machines. We show that NP on S-BSS machines is strictly included in NP on BSS machines and that every NP language on S-BSS machines is a countable union of closed sets in the usua… ▽ More

    Submitted 7 July, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), 2020. Association for Computing Machinery, New York, NY, USA, 550-563

  14. arXiv:1904.03934  [pdf, ps, other

    cs.DB cs.LO

    On matrices and $K$-relations

    Authors: Robert Brijder, Marc Gyssens, Jan Van den Bussche

    Abstract: We show that the matrix query language $\mathsf{MATLANG}$ corresponds to a natural fragment of the positive relational algebra on $K$-relations. The fragment is defined by introducing a composition operator and restricting $K$-relation arities to two. We then proceed to show that $\mathsf{MATLANG}$ can express all matrix queries expressible in the positive relational algebra on $K$-relations, when… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 17 pages, 3 figures

  15. arXiv:1903.03413  [pdf, ps, other

    cs.LO

    Descriptive Complexity of Deterministic Polylogarithmic Time and Space

    Authors: Flavio Ferrarotti, Senén González, José María Turull Torres, Jan Van den Bussche, Jonni Virtema

    Abstract: We propose logical characterizations of problems solvable in deterministic polylogarithmic time (PolylogTime) and polylogarithmic space (PolylogSpace). We introduce a novel two-sorted logic that separates the elements of the input domain from the bit positions needed to address these elements. We prove that the inflationary and partial fixed point vartiants of this logic capture PolylogTime and Po… ▽ More

    Submitted 1 December, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Submitted to the Journal of Computer and System Sciences

  16. arXiv:1808.08822  [pdf, ps, other

    cs.DB

    A Monotone Preservation Result for Boolean Queries Expressed as a Containment of Conjunctive Queries

    Authors: Dimitri Surinx, Jan Van den Bussche

    Abstract: When a relational database is queried, the result is normally a relation. Some queries, however, only require a yes/no answer; such queries are often called boolean queries. It is customary in database theory to express boolean queries by testing nonemptiness of query expressions. Another interesting way for expressing boolean queries are containment statements of the form $Q_1 \subseteq Q_2$ wher… ▽ More

    Submitted 1 March, 2019; v1 submitted 27 August, 2018; originally announced August 2018.

  17. arXiv:1804.05926  [pdf, other

    cs.LO

    Expressivity within second-order transitive-closure logic

    Authors: Flavio Ferrarotti, Jan Van den Bussche, Jonni Virtema

    Abstract: Second-order transitive-closure logic, SO(TC), is an expressive declarative language that captures the complexity class PSPACE. Already its monadic fragment, MSO(TC), allows the expression of various NP-hard and even PSPACE-hard problems in a natural and elegant manner. As SO(TC) offers an attractive framework for expressing properties in terms of declaratively specified computations, it is intere… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

  18. arXiv:1803.01390  [pdf, ps, other

    cs.DB

    Comparing Downward Fragments of the Relational Calculus with Transitive Closure on Trees

    Authors: Jelle Hellings, Marc Gyssens, Yuqing Wu, Dirk Van Gucht, Jan Van den Bussche, Stijn Vansummeren, George H. L. Fletcher

    Abstract: Motivated by the continuing interest in the tree data model, we study the expressive power of downward navigational query languages on trees and chains. Basic navigational queries are built from the identity relation and edge relations using composition and union. We study the effects on relative expressiveness when we add transitive closure, projections, coprojections, intersection, and differenc… ▽ More

    Submitted 4 March, 2018; originally announced March 2018.

  19. arXiv:1709.08359  [pdf, ps, other

    cs.DB

    On the expressive power of query languages for matrices

    Authors: Robert Brijder, Floris Geerts, Jan Van den Bussche, Timmy Weerwag

    Abstract: We investigate the expressive power of $\mathsf{MATLANG}$, a formal language for matrix manipulation based on common matrix operations and linear algebra. The language can be extended with the operation $\mathsf{inv}$ of inverting a matrix. In $\mathsf{MATLANG}+\mathsf{inv}$ we can compute the transitive closure of directed graphs, whereas we show that this is not possible without inversion. Indee… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 21 pages, 3 figures

  20. arXiv:1608.03647  [pdf, other

    cs.LG

    Learning with Value-Ramp

    Authors: Tom J. Ameloot, Jan Van den Bussche

    Abstract: We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.

    Submitted 23 April, 2017; v1 submitted 12 August, 2016; originally announced August 2016.

    Comments: Version 2: fixed notation in definition of transition + clarified a sentence in the Introduction

  21. arXiv:1511.08724  [pdf, other

    cs.LG cs.AI

    On the convergence of cycle detection for navigational reinforcement learning

    Authors: Tom J. Ameloot, Jan Van den Bussche

    Abstract: We consider a reinforcement learning framework where agents have to navigate from start states to goal states. We prove convergence of a cycle-detection learning algorithm on a class of tasks that we call reducible. Reducible tasks have an acyclic solution. We also syntactically characterize the form of the final policy. This characterization can be used to precisely detect the convergence point i… ▽ More

    Submitted 5 January, 2016; v1 submitted 27 November, 2015; originally announced November 2015.

  22. Putting Logic-Based Distributed Systems on Stable Grounds

    Authors: Tom J. Ameloot, Jan Van den Bussche, William R. Marczak, Peter Alvaro, Joseph M. Hellerstein

    Abstract: In the Declarative Networking paradigm, Datalog-like languages are used to express distributed computations. Whereas recently formal operational semantics for these languages have been developed, a corresponding declarative semantics has been lacking so far. The challenge is to capture precisely the amount of nondeterminism that is inherent to distributed computations due to concurrency, networkin… ▽ More

    Submitted 25 July, 2015; v1 submitted 20 July, 2015; originally announced July 2015.

    Comments: To appear in Theory and Practice of Logic Programming (TPLP)

    Journal ref: Theory and Practice of Logic Programming 16 (2016) 378-417

  23. arXiv:1503.01707  [pdf, ps, other

    cs.DB cs.AI cs.LO

    Map**-equivalence and oid-equivalence of single-function object-creating conjunctive queries

    Authors: Angela Bonifati, Werner Nutt, Riccardo Torlone, Jan Van den Bussche

    Abstract: Conjunctive database queries have been extended with a mechanism for object creation to capture important applications such as data exchange, data integration, and ontology-based data access. Object creation generates new object identifiers in the result, that do not belong to the set of constants in the source database. The new object identifiers can be also seen as Skolem terms. Hence, object-cr… ▽ More

    Submitted 12 January, 2016; v1 submitted 5 March, 2015; originally announced March 2015.

    Comments: This revised version has been accepted on 11 January 2016 for publication in The VLDB Journal

  24. Positive Neural Networks in Discrete Time Implement Monotone-Regular Behaviors

    Authors: Tom J. Ameloot, Jan Van den Bussche

    Abstract: We study the expressive power of positive neural networks. The model uses positive connection weights and multiple input neurons. Different behaviors can be expressed by varying the connection weights. We show that in discrete time, and in absence of noise, the class of positive neural networks captures the so-called monotone-regular behaviors, that are based on regular languages. A finer picture… ▽ More

    Submitted 1 December, 2015; v1 submitted 21 February, 2015; originally announced February 2015.

    Journal ref: Neural Computation, December 2015, Vol. 27, No. 12 , Pages 2623-2660

  25. arXiv:1406.1404  [pdf, ps, other

    cs.DB cs.AI

    On the satisfiability problem for SPARQL patterns

    Authors: Xiaowang Zhang, Jan Van den Bussche, François Picalausa

    Abstract: The satisfiability problem for SPARQL patterns is undecidable in general, since the expressive power of SPARQL 1.0 is comparable with that of the relational algebra. The goal of this paper is to delineate the boundary of decidability of satisfiability in terms of the constraints allowed in filter conditions. The classes of constraints considered are bound-constraints, negated bound-constraints, eq… ▽ More

    Submitted 1 June, 2016; v1 submitted 5 June, 2014; originally announced June 2014.

    Comments: Major revision, erroneous polynomial-time claims corrected, NP-completeness result added, detailed proofs added, experimental section added

  26. arXiv:1406.0349  [pdf, ps, other

    cs.LO cs.DB

    Undecidability of satisfiability in the algebra of finite binary relations with union, composition, and difference

    Authors: Tony Tan, Jan Van den Bussche, Xiaowang Zhang

    Abstract: We consider expressions built up from binary relation names using the operators union, composition, and set difference. We show that it is undecidable to test whether a given such expression $e$ is finitely satisfiable, i.e., whether there exist finite binary relations that can be substituted for the relation names so that $e$ evaluates to a nonempty result. This result already holds in restrictio… ▽ More

    Submitted 2 June, 2014; originally announced June 2014.

  27. arXiv:1405.1833  [pdf, ps, other

    cs.LO cs.AI

    FO(C): A Knowledge Representation Language of Causality

    Authors: Bart Bogaerts, Joost Vennekens, Marc Denecker, Jan Van den Bussche

    Abstract: Cause-effect relations are an important part of human knowledge. In real life, humans often reason about complex causes linked to complex effects. By comparison, existing formalisms for representing knowledge about causal relations are quite limited in the kind of specifications of causes and effects they allow. In this paper, we present the new language C-Log, which offers a significantly more ex… ▽ More

    Submitted 9 May, 2014; v1 submitted 8 May, 2014; originally announced May 2014.

    Comments: To appear in Theory and Practice of Logic Programming (TPLP)

  28. arXiv:1404.6394  [pdf, ps, other

    cs.LO

    FO(C) and Related Modelling Paradigms

    Authors: Bart Bogaerts, Joost Vennekens, Marc Denecker, Jan Van den Bussche

    Abstract: Recently, C-Log was introduced as a language for modelling causal processes. Its formal semantics has been defined together with introductory examples, but the study of this language is far from finished. In this paper, we compare C-Log to other declarative modelling languages. More specifically, we compare to first-order logic (FO), and argue that C-Log and FO are orthogonal and that their integr… ▽ More

    Submitted 25 April, 2014; originally announced April 2014.

    Comments: This paper appears in the Proceedings of the 15th International Workshop on Non-Monotonic Reasoning (NMR 2014)

  29. arXiv:1404.6368  [pdf, ps, other

    cs.LO

    Inference in the FO(C) Modelling Language

    Authors: Bart Bogaerts, Joost Vennekens, Marc Denecker, Jan Van den Bussche

    Abstract: Recently, FO(C), the integration of C-Log with classical logic, was introduced as a knowledge representation language. Up to this point, no systems exist that perform inference on FO(C), and very little is known about properties of inference in FO(C). In this paper, we study both of the above problems. We define normal forms for FO(C), one of which corresponds to FO(ID). We define transformations… ▽ More

    Submitted 25 April, 2014; originally announced April 2014.

    Comments: The paper appears in the Proceedings of the 15th International Workshop on Non-Monotonic Reasoning (NMR 2014)

  30. Relative Expressive Power of Navigational Querying on Graphs

    Authors: George H. L. Fletcher, Marc Gyssens, Dirk Leinders, Dimitri Surinx, Jan Van den Bussche, Dirk Van Gucht, Stijn Vansummeren, Yuqing Wu

    Abstract: Motivated by both established and new applications, we study navigational query languages for graphs (binary relations). The simplest language has only the two operators union and composition, together with the identity relation. We make more powerful languages by adding any of the following operators: intersection; set difference; projection; coprojection; converse; and the diversity relation. Al… ▽ More

    Submitted 28 November, 2014; v1 submitted 31 January, 2014; originally announced January 2014.

    Comments: An extended abstract announcing the results of this paper was presented at the 14th International Conference on Database Theory, Uppsala, Sweden, March 2011

    Journal ref: Information Sciences (2015), pp. 390-406

  31. Similarity and bisimilarity notions appropriate for characterizing indistinguishability in fragments of the calculus of relations

    Authors: George H. L. Fletcher, Marc Gyssens, Dirk Leinders, Jan Van den Bussche, Dirk Van Gucht, Stijn Vansummeren

    Abstract: Motivated by applications in databases, this paper considers various fragments of the calculus of binary relations. The fragments are obtained by leaving out, or kee** in, some of the standard operators, along with some derived operators such as set difference, projection, coprojection, and residuation. For each considered fragment, a characterization is obtained for when two given binary relati… ▽ More

    Submitted 28 March, 2014; v1 submitted 9 October, 2012; originally announced October 2012.

    Comments: 36 pages, Journal of Logic and Computation 2014

  32. Relational transducers for declarative networking

    Authors: Tom Ameloot, Frank Neven, Jan Van den Bussche

    Abstract: Motivated by a recent conjecture concerning the expressiveness of declarative networking, we propose a formal computation model for "eventually consistent" distributed querying, based on relational transducers. A tight link has been conjectured between coordination-freeness of computations, and monotonicity of the queries expressed by such computations. Indeed, we propose a formal definition of co… ▽ More

    Submitted 13 December, 2010; originally announced December 2010.

    Journal ref: 30th ACM Symposium on Principles of Database Systems, 2011

  33. arXiv:1008.2626  [pdf, ps, other

    cs.DB cs.AI

    Mining tree-query associations in graphs

    Authors: Eveline Hoekx, Jan Van den Bussche

    Abstract: New applications of data mining, such as in biology, bioinformatics, or sociology, are faced with large datasetsstructured as graphs. We introduce a novel class of tree-shapedpatterns called tree queries, and present algorithms for miningtree queries and tree-query associations in a large data graph. Novel about our class of patterns is that they can containconstants, and can contain existential n… ▽ More

    Submitted 16 August, 2010; originally announced August 2010.

    Comments: Full version of two earlier conference papers presented at KDD 2005 and ICDM 2006

    ACM Class: H.2.8

  34. arXiv:0709.4655  [pdf, ps, other

    cs.DB cs.AI

    Mining for trees in a graph is NP-complete

    Authors: Jan Van den Bussche

    Abstract: Mining for trees in a graph is shown to be NP-complete.

    Submitted 28 September, 2007; originally announced September 2007.

    ACM Class: H.2.8

  35. arXiv:cs/0608091  [pdf, ps, other

    cs.DS cs.DB

    On-line topological simplification of weighted graphs

    Authors: Floris Geerts, Peter Revesz, Jan Van den Bussche

    Abstract: We describe two efficient on-line algorithms to simplify weighted graphs by eliminating degree-two vertices. Our algorithms are on-line in that they react to updates on the data, kee** the simplification up-to-date. The supported updates are insertions of vertices and edges; hence, our algorithms are partially dynamic. We provide both analytical and empirical evaluations of the efficiency of o… ▽ More

    Submitted 23 August, 2006; originally announced August 2006.

    Comments: This is the full techreport corresponding to the paper "On-line maintenance of simplified weighted graphs for efficient distance queries" in the proceedings of ACM-GIS 2006

    Journal ref: Proceedings ACM-GIS 2006, ACM Press

  36. On the tree-transformation power of XSLT

    Authors: Wim Janssen, Alexandr Korlyukov, Jan Van den Bussche

    Abstract: XSLT is a standard rule-based programming language for expressing transformations of XML data. The language is currently in transition from version 1.0 to 2.0. In order to understand the computational consequences of this transition, we restrict XSLT to its pure tree-transformation capabilities. Under this focus, we observe that XSLT~1.0 was not yet a computationally complete tree-transformation… ▽ More

    Submitted 8 March, 2006; originally announced March 2006.

    ACM Class: D.3.1; H.2.3; F.1.1

    Journal ref: Acta Informatica, Volume 43, Number 6 / January, 2007

  37. arXiv:cs/0407007  [pdf, ps, other

    cs.DB cs.LO

    The semijoin algebra and the guarded fragment

    Authors: Dirk Leinders, Jerzy Tyszkiewicz, Jan Van den Bussche

    Abstract: The semijoin algebra is the variant of the relational algebra obtained by replacing the join operator by the semijoin operator. We discuss some interesting connections between the semijoin algebra and the guarded fragment of first-order logic. We also provide an Ehrenfeucht-Fraisse game, characterizing the discerning power of the semijoin algebra. This game gives a method for showing that certai… ▽ More

    Submitted 2 July, 2004; originally announced July 2004.

    Comments: 11 pages, 2 figures

    ACM Class: H.2.3; F.4.1

  38. arXiv:cs/0406060  [pdf, ps, other

    cs.DB cs.PL

    Well-Definedness and Semantic Type-Checking in the Nested Relational Calculus and XQuery

    Authors: Jan Van den Bussche, Dirk Van Gucht, Stijn Vansummeren

    Abstract: Two natural decision problems regarding the XML query language XQuery are well-definedness and semantic type-checking. We study these problems in the setting of a relational fragment of XQuery. We show that well-definedness and semantic type-checking are undecidable, even in the positive-existential case. Nevertheless, for a ``pure'' variant of XQuery, in which no identification is made between… ▽ More

    Submitted 29 June, 2004; originally announced June 2004.

  39. arXiv:cs/0308014  [pdf, ps, other

    cs.DB cs.LO

    On the expressive power of semijoin queries

    Authors: Dirk Leinders, Jerzy Tyszkiewicz, Jan Van den Bussche

    Abstract: The semijoin algebra is the variant of the relational algebra obtained by replacing the join operator by the semijoin operator. We provide an Ehrenfeucht-Fraissé game, characterizing the discerning power of the semijoin algebra. This game gives a method for showing that queries are not expressible in the semijoin algebra.

    Submitted 3 March, 2004; v1 submitted 6 August, 2003; originally announced August 2003.

    Comments: 9 pages, to appear in Information Processing Letters; added results that more clearly delineate the expressive power of SA, added a section that discusses the impact of order on the expressive power of SA, deemphasized the discussion on the relationship with GF

    ACM Class: H.2.3; F.4.1

  40. arXiv:cs/0308001  [pdf, ps, other

    cs.LO cs.CG cs.DB

    Two- versus three-dimensional connectivity testing of first-order queries to semi-algebraic sets

    Authors: Floris Geerts, Lieven Smits, Jan Van den Bussche

    Abstract: This paper addresses the question whether one can determine the connectivity of a semi-algebraic set in three dimensions by testing the connectivity of a finite number of two-dimensional ``samples'' of the set, where these samples are defined by first-order queries. The question is answered negatively for two classes of first-order queries: cartesian-product-free, and positive one-pass.

    Submitted 1 August, 2003; v1 submitted 31 July, 2003; originally announced August 2003.

    Comments: corrected minor confusion in Proof of Theorem 1

    ACM Class: F.4.1; F.2.2; H.2.8

    Journal ref: A revised version has been published online (21 July 2005) in Acta Informatica under the title "N- versus (N-1)-dimensional connectivity testing of first-order queries to semi-algebraic sets"

  41. arXiv:cs/0206023  [pdf, ps, other

    cs.DB cs.AI

    Relational Association Rules: getting WARMeR

    Authors: Bart Goethals, Jan Van den Bussche

    Abstract: In recent years, the problem of association rule mining in transactional data has been well studied. We propose to extend the discovery of classical association rules to the discovery of association rules of conjunctive queries in arbitrary relational data, inspired by the WARMR algorithm, developed by Dehaspe and Toivonen, that discovers association rules over a limited set of conjunctive queri… ▽ More

    Submitted 15 June, 2002; originally announced June 2002.

    ACM Class: H.2.8

  42. Towards practical meta-querying

    Authors: Jan Van den Bussche, Stijn Vansummeren, Gottfried Vossen

    Abstract: We describe a meta-querying system for databases containing queries in addition to ordinary data. In the context of such databases, a meta-query is a query about queries. Representing stored queries in XML, and using the standard XML manipulation language XSLT as a sublanguage, we show that just a few features need to be added to SQL to turn it into a fully-fledged meta-query language. The good… ▽ More

    Submitted 13 October, 2003; v1 submitted 25 February, 2002; originally announced February 2002.

    Comments: Includes a new section "Experimental performance evaluation"

    ACM Class: H.2.3

    Journal ref: Information Systems, Volume 30, Issue 4 , June 2005, Pages 317-332

  43. arXiv:cs/0112011  [pdf, ps, other

    cs.DB cs.AI

    Interactive Constrained Association Rule Mining

    Authors: Bart Goethals, Jan Van den Bussche

    Abstract: We investigate ways to support interactive mining sessions, in the setting of association rule mining. In such sessions, users specify conditions (queries) on the associations to be generated. Our approach is a combination of the integration of querying conditions inside the mining phase, and the incremental querying of already generated associations. We present several concrete algorithms and c… ▽ More

    Submitted 5 February, 2003; v1 submitted 10 December, 2001; originally announced December 2001.

    Comments: A preliminary report on this work was presented at the Second International Conference on Knowledge Discovery and Data Mining (DaWaK 2000)

    ACM Class: H.2.8

  44. arXiv:cs/0112007  [pdf, ps, other

    cs.DB cs.AI

    A Tight Upper Bound on the Number of Candidate Patterns

    Authors: Floris Geerts, Bart Goethals, Jan Van den Bussche

    Abstract: In the context of mining for frequent patterns using the standard levelwise algorithm, the following question arises: given the current level and the current set of frequent patterns, what is the maximal number of candidate patterns that can be generated on the next level? We answer this question by providing a tight upper bound, derived from a combinatorial result from the sixties by Kruskal an… ▽ More

    Submitted 29 November, 2002; v1 submitted 7 December, 2001; originally announced December 2001.

    ACM Class: H.2.8

  45. arXiv:cs/0106035  [pdf, ps, other

    cs.LO cs.DB

    Polymorphic type inference for the relational algebra

    Authors: Jan Van den Bussche, Emmanuel Waller

    Abstract: We give a polymorphic account of the relational algebra. We introduce a formalism of ``type formulas'' specifically tuned for relational algebra expressions, and present an algorithm that computes the ``principal'' type for a given expression. The principal type of an expression is a formula that specifies, in a clear and concise manner, all assignments of types (sets of attributes) to relation… ▽ More

    Submitted 14 June, 2001; originally announced June 2001.

    ACM Class: D.3.3, H.2.3

  46. arXiv:cs/0106034  [pdf, ps, other

    cs.LO cs.DB

    Solving equations in the relational algebra

    Authors: Joachim Biskup, Jan Paredaens, Thomas Schwentick, Jan Van den Bussche

    Abstract: Enumerating all solutions of a relational algebra equation is a natural and powerful operation which, when added as a query language primitive to the nested relational algebra, yields a query language for nested relational databases, equivalent to the well-known powerset algebra. We study \emph{sparse} equations, which are equations with at most polynomially many solutions. We look at their comp… ▽ More

    Submitted 10 December, 2003; v1 submitted 14 June, 2001; originally announced June 2001.

    Comments: Minor revision, accepted for publication in SIAM Journal on Computing

    ACM Class: F.4.2; H.2.3