Skip to main content

Showing 1–25 of 25 results for author: Riveros, C

.
  1. arXiv:2405.12358  [pdf, other

    cs.DB cs.LO

    Using Color Refinement to Boost Enumeration and Counting for Acyclic CQs of Binary Schemas

    Authors: Cristian Riveros, Benjamin Scheidt, Nicole Schweikardt

    Abstract: We present an index structure, called the color-index, to boost the evaluation of acyclic conjunctive queries (ACQs) over binary schemas. The color-index is based on the color refinement algorithm, a widely used subroutine for graph isomorphism testing algorithms. Given a database $D$, we use a suitable version of the color refinement algorithm to produce a stable coloring of $D$, an assignment fr… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2405.12350  [pdf, ps, other

    cs.DB cs.FL

    A framework for extraction and transformation of documents

    Authors: Cristian Riveros, Markus L. Schmid, Nicole Schweikardt

    Abstract: We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the second phase maps the content of the span-tuples into new documents. We base the extraction phase on the framework of document spanners and the transformation phase on the theory of polyregular function… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  3. arXiv:2310.04118  [pdf, other

    cs.CC cs.DB cs.DS

    Enumeration and updates for conjunctive linear algebra queries through expressibility

    Authors: Thomas Muñoz, Cristian Riveros, Stijn Vansummeren

    Abstract: Due to the importance of linear algebra and matrix operations in data analytics, there is significant interest in using relational query optimization and processing techniques for evaluating (sparse) linear algebra programs. In particular, in recent years close connections have been established between linear algebra programs and relational algebra that allow transferring optimization techniques o… ▽ More

    Submitted 9 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 80 pages total: 16 main body, 3 of references and 61 of appendix which contains detailed proofs

    ACM Class: H.2.1; H.2.5; F.2.2; F.4.3

  4. arXiv:2209.12301  [pdf, other

    cs.DS cs.FL cs.LO

    Constant-delay enumeration for SLP-compressed documents

    Authors: Martín Muñoz, Cristian Riveros

    Abstract: We study the problem of enumerating results from a query over a compressed document. The model we use for compression are straight-line programs (SLPs), which are defined by a context-free grammar that produces a single string. For our queries, we use a model called Annotated Automata, an extension of regular automata that allows annotations on letters. This model extends the notion of Regular Spa… ▽ More

    Submitted 30 October, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: 27 pages

  5. Probabilistic Automata of Bounded Ambiguity

    Authors: Nathanaël Fijalkow, Cristian Riveros, James Worrell

    Abstract: Probabilistic automata are an extension of nondeterministic finite automata in which transitions are annotated with probabilities. Despite its simplicity, this model is very expressive and many of the associated algorithmic questions are undecidable. In this work we focus on the emptiness problem (and its variant the value problem), which asks whether a given probabilistic automaton accepts some w… ▽ More

    Submitted 19 May, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: Short version in CONCUR'17, Long version in Information and Computation (special issue on Weighted Automata)

    Journal ref: Information and Computation, Volume 282, January 2022, 104648

  6. arXiv:2204.00090  [pdf, ps, other

    gr-qc astro-ph.CO hep-th math-ph

    Gravitational Waves in Einstein-Cartan Theory: On the Effects of Dark Matter Spin Tensor

    Authors: E. Elizalde, F. Izaurieta, C. Riveros, G. Salgado, O. Valdivia

    Abstract: This article studies the effects of an arbitrary dark matter spin tensor on the propagation of gravitational wave amplitude in the context of Einstein-Cartan theory. We choose to work with an arbitrary spin tensor because, given our ignorance of the nature of dark matter, it is sensible not to make further hypotheses on its spin and not to assume any particular model for its spin tensor (or its va… ▽ More

    Submitted 20 February, 2023; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: 29 pages, 2 figures, accepted for publication in Physics of the Dark Universe

  7. Efficient Enumeration Algorithms for Annotated Grammars

    Authors: Antoine Amarilli, Louis Jachiet, Martín Muñoz, Cristian Riveros

    Abstract: We introduce annotated grammars, an extension of context-free grammars which allows annotations on terminals. Our model extends the standard notion of regular spanners, and is more expressive than the extraction grammars recently introduced by Peterfreund. We study the enumeration problem for annotated grammars: fixing a grammar, and given a string as input, enumerate all annotations of the string… ▽ More

    Submitted 17 May, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 54 pages. Full version with proofs of the article to appear at PODS'22. Except formatting and minor differences, this article contains all the contents of the PODS'22 article, plus the technical appendices

  8. arXiv:2112.13736  [pdf, other

    cs.DB cs.DM cs.SI

    How do centrality measures choose the root of trees?

    Authors: Cristian Riveros, Jorge Salas, Oskar Skibski

    Abstract: Centrality measures are widely used to assign importance to graph-structured data. Recently, understanding the principles of such measures has attracted a lot of attention. Given that measures are diverse, this research has usually focused on classes of centrality measures. In this work, we provide a different approach by focusing on classes of graphs instead of classes of measures to understand t… ▽ More

    Submitted 29 November, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    MSC Class: 68P15; 68P01; 68R10 ACM Class: E.0; G.2.2

  9. arXiv:2111.04635  [pdf, other

    cs.DB cs.DS

    CORE: a Complex Event Recognition Engine

    Authors: Marco Bucchi, Alejandro Grez, Andrés Quintana, Cristian Riveros, Stijn Vansummeren

    Abstract: Complex Event Recognition (CER) systems are a prominent technology for finding user-defined query patterns over large data streams in real time. CER query evaluation is known to be computationally challenging, since it requires maintaining a set of partial matches, and this set quickly grows super-linearly in the number of processed events. We present CORE, a novel COmplex event Recognition Engine… ▽ More

    Submitted 26 May, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: 30 pages, 11 figures

    MSC Class: 68W27 ACM Class: E.1; F.2.0

  10. arXiv:2111.01540  [pdf, other

    cs.DB

    MillenniumDB: A Persistent, Open-Source, Graph Database

    Authors: Domagoj Vrgoc, Carlos Rojas, Renzo Angles, Marcelo Arenas, Diego Arroyuelo, Carlos Buil Aranda, Aidan Hogan, Gonzalo Navarro, Cristian Riveros, Juan Romero

    Abstract: In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain graphs, that provides a simple abstraction upon which a variety of popular graph models can be supported. The engine itself is founded on a combination of tried and tested techniques from relational data manage… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  11. arXiv:2012.04064  [pdf, ps, other

    math.GT

    $ε$-isothermic surfaces in pseudo-Euclidean 3-space

    Authors: Armando M. V. Corro, Carlos M. C. Riveros, Marcelo L. Ferro

    Abstract: In this paper we describe the $ε$-isothermic surfaces in the pseudo-Euclidean 3-space and we obtain the pseudo-Calapso equation. In sequence, we classify the Dupin surfaces in pseudo-Euclidean 3-space having distinct principal curvatures and provide explicit coordinates for such surfaces. As application of the theory, we give explicit solutions to the pseudo-Calapso equation.

    Submitted 7 December, 2020; originally announced December 2020.

  12. arXiv:2010.13717  [pdf, other

    cs.CC cs.DB cs.DS

    Expressive power of linear algebra query languages

    Authors: Floris Geerts, Thomas Muñoz, Cristian Riveros, Domagoj Vrgoč

    Abstract: Linear algebra algorithms often require some sort of iteration or recursion as is illustrated by standard algorithms for Gaussian elimination, matrix inversion, and transitive closure. A key characteristic shared by these algorithms is that they allow loo** for a number of steps that is bounded by the matrix dimension. In this paper we extend the matrix query language MATLANG with this type of r… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  13. arXiv:2010.08042  [pdf, other

    cs.FL cs.DS

    Ranked enumeration of MSO logic on words

    Authors: Pierre Bourhis, Alejandro Grez, Louis Jachiet, Cristian Riveros

    Abstract: In the last years, enumeration algorithms with bounded delay have attracted a lot of attention for several data management tasks. Given a query and the data, the task is to preprocess the data and then enumerate all the answers to the query one by one and without repetitions. This enumeration scheme is typically useful when the solutions are treated on the fly or when we want to stop the enumerati… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 29 pages (with appendix), 2 figures, submitted to the ICDT21 conference

    MSC Class: 68W32 ACM Class: E.1; F.2.0

  14. arXiv:2010.06037  [pdf, other

    cs.DB cs.DS cs.FL

    Streaming enumeration on nested documents

    Authors: Martín Muñoz, Cristian Riveros

    Abstract: Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especially relevant. We focus on the streaming evaluation of queries with outputs of varied sizes over nested documents. We model queries of this kind as Visibly Pushdown Transducers (VPT), a computational… ▽ More

    Submitted 7 January, 2022; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: 39 pages

  15. arXiv:2005.10029  [pdf, ps, other

    cs.DS

    When is Approximate Counting for Conjunctive Queries Tractable?

    Authors: Marcelo Arenas, Luis Alberto Croquevielle, Rajesh Jayaram, Cristian Riveros

    Abstract: Conjunctive queries are one of the most common class of queries used in database systems, and the best studied in the literature. A seminal result of Grohe, Schwentick, and Segoufin (STOC 2001) demonstrates that for every class $G$ of graphs, the evaluation of all conjunctive queries whose underlying graph is in $G$ is tractable if, and only if, $G$ has bounded treewidth. In this work, we extend t… ▽ More

    Submitted 20 November, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

  16. arXiv:2002.07049  [pdf, other

    cs.FL

    The monitoring problem for timed automata

    Authors: Alejandro Grez, Filip Mazowiecki, Michał Pilipczuk, Gabriele Puppis, Cristian Riveros

    Abstract: We study a variant of the classical membership problem in automata theory, which consists of deciding whether a given input word is accepted by a given automaton. We do so under a different perspective, that is, we consider a dynamic version of the problem, called monitoring problem, where the automaton is fixed and the input is revealed as in a stream, one symbol at a time following the natural o… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  17. Pum** lemmas for weighted automata

    Authors: Agnishom Chattopadhyay, Filip Mazowiecki, Anca Muscholl, Cristian Riveros

    Abstract: We present pum** lemmas for five classes of functions definable by fragments of weighted automata over the min-plus semiring, the max-plus semiring and the semiring of natural numbers. As a corollary we show that the hierarchy of functions definable by unambiguous, finitely-ambiguous, polynomially-ambiguous weighted automata, and the full class of weighted automata is strict for the min-plus and… ▽ More

    Submitted 20 July, 2021; v1 submitted 17 January, 2020; originally announced January 2020.

    Journal ref: Logical Methods in Computer Science, Volume 17, Issue 3 (July 21, 2021) lmcs:6039

  18. arXiv:1906.09226  [pdf, ps, other

    cs.DS cs.CC

    $\text{#NFA}$ admits an FPRAS: Efficient Enumeration, Counting, and Uniform Generation for Logspace Classes

    Authors: Marcelo Arenas, Luis Alberto Croquevielle, Rajesh Jayaram, Cristian Riveros

    Abstract: In this work, we study two simple yet general complexity classes, based on logspace Turing machines, which provide a unifying framework for efficient query evaluation in areas like information extraction and graph databases, among others. We investigate the complexity of three fundamental algorithmic problems for these classes: enumeration, counting and uniform generation of solutions, and show th… ▽ More

    Submitted 23 June, 2021; v1 submitted 21 June, 2019; originally announced June 2019.

  19. Descriptive Complexity for Counting Complexity Classes

    Authors: Marcelo Arenas, Martin Muñoz, Cristian Riveros

    Abstract: Descriptive Complexity has been very successful in characterizing complexity classes of decision problems in terms of the properties definable in some logics. However, descriptive complexity for counting complexity classes, such as FP and #P, has not been systematically studied, and it is not as developed as its decision counterpart. In this paper, we propose a framework based on Weighted Logics t… ▽ More

    Submitted 7 February, 2020; v1 submitted 7 May, 2018; originally announced May 2018.

    Journal ref: Logical Methods in Computer Science, Volume 16, Issue 1 (February 10, 2020) lmcs:4493

  20. arXiv:1803.05277  [pdf, ps, other

    cs.DB cs.FL

    Constant delay algorithms for regular document spanners

    Authors: Fernando Florenzano, Cristian Riveros, Martin Ugarte, Stijn Vansummeren, Domagoj Vrgoc

    Abstract: Regular expressions and automata models with capture variables are core tools in rule-based information extraction. These formalisms, also called regular document spanners, use regular languages in order to locate the data that a user wants to extract from a text document, and then store this data into variables. Since document spanners can easily generate large outputs, it is important to have go… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

  21. arXiv:1712.01063  [pdf, other

    cs.DB cs.FL cs.LO

    A Second-Order Approach to Complex Event Recognition

    Authors: Alejandro Grez, Cristian Riveros, Martin Ugarte, Stijn Vansummeren

    Abstract: Complex Event Recognition (CER for short) refers to the activity of detecting patterns in streams of continuously arriving data. This field has been traditionally approached from a practical point of view, resulting in heterogeneous implementations with fundamentally different capabilities. The main reason behind this is that defining formal semantics for a CER language is not trivial: they usuall… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  22. arXiv:1709.05369  [pdf, other

    cs.DB cs.LO

    Foundations of Complex Event Processing

    Authors: Marco Bucchi, Alejandro Grez, Cristian Riveros, Martín Ugarte

    Abstract: Complex Event Processing (CEP) has emerged as the unifying field for technologies that require processing and correlating distributed data sources in real-time. CEP finds applications in diverse domains, which has resulted in a large number of proposals for expressing and processing complex events. However, existing CEP languages lack from a clear semantics, making them hard to understand and gene… ▽ More

    Submitted 30 August, 2018; v1 submitted 15 September, 2017; originally announced September 2017.

    Comments: Conference version

  23. arXiv:1707.00827  [pdf, ps, other

    cs.DB

    Document Spanners for Extracting Incomplete Information: Expressiveness and Complexity

    Authors: Francisco Maturana, Cristian Riveros, Domagoj Vrgoč

    Abstract: Rule-based information extraction has lately received a fair amount of attention from the database community, with several languages appearing in the last few years. Although information extraction systems are intended to deal with semistructured data, all language proposals introduced so far are designed to output relations, thus making them incapable of handling incomplete information. To remedy… ▽ More

    Submitted 29 December, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

  24. arXiv:1504.01709  [pdf, ps, other

    cs.FL

    Copyless Cost-Register Automata: Structure, Expressiveness, and Closure Properties

    Authors: Filip Mazowiecki, Cristian Riveros

    Abstract: Cost register automata (CRA) and its subclass, copyless CRA, were recently proposed by Alur et al. as a new model for computing functions over strings. We study some structural properties, expressiveness, and closure properties of copyless CRA. We show that copyless CRA are strictly less expressive than weighted automata and are not closed under reverse operation. To find a better class we impose… ▽ More

    Submitted 27 April, 2018; v1 submitted 7 April, 2015; originally announced April 2015.

  25. arXiv:0910.3372  [pdf, ps, other

    cs.DB

    Composition and Inversion of Schema Map**s

    Authors: Marcelo Arenas, Jorge Perez, Juan Reutter, Cristian Riveros

    Abstract: In the recent years, a lot of attention has been paid to the development of solid foundations for the composition and inversion of schema map**s. In this paper, we review the proposals for the semantics of these crucial operators. For each of these proposals, we concentrate on the three following problems: the definition of the semantics of the operator, the language needed to express the oper… ▽ More

    Submitted 4 March, 2010; v1 submitted 18 October, 2009; originally announced October 2009.