Skip to main content

Showing 1–22 of 22 results for author: Schmid, M L

.
  1. arXiv:2405.12350  [pdf, ps, other

    cs.DB cs.FL

    A framework for extraction and transformation of documents

    Authors: Cristian Riveros, Markus L. Schmid, Nicole Schweikardt

    Abstract: We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the second phase maps the content of the span-tuples into new documents. We base the extraction phase on the framework of document spanners and the transformation phase on the theory of polyregular function… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2404.10497  [pdf, ps, other

    cs.DS

    Subsequences With Generalised Gap Constraints: Upper and Lower Complexity Bounds

    Authors: Florin Manea, Jonas Richardsen, Markus L. Schmid

    Abstract: For two strings u, v over some alphabet A, we investigate the problem of embedding u into w as a subsequence under the presence of generalised gap constraints. A generalised gap constraint is a triple (i, j, C_{i, j}), where 1 <= i < j <= |u| and C_{i, j} is a subset of A^*. Embedding u as a subsequence into v such that (i, j, C_{i, j}) is satisfied means that if u[i] and u[j] are mapped to v[k] a… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2403.03067  [pdf, other

    cs.FL cs.DB

    Enumeration for MSO-Queries on Compressed Trees

    Authors: Markus Lohrey, Markus L. Schmid

    Abstract: We present a linear preprocessing and output-linear delay enumeration algorithm for MSO-queries over trees that are compressed in the well-established grammar-based framework. Time bounds are measured with respect to the size of the compressed representation of the tree. Our result extends previous work on the enumeration of MSO-queries over uncompressed trees and on the enumeration of document sp… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2206.13918  [pdf, ps, other

    cs.FL cs.DB

    Extending Shinohara's Algorithm for Computing Descriptive (Angluin-Style) Patterns to Subsequence Patterns

    Authors: Markus L. Schmid

    Abstract: The introduction of pattern languages in the seminal work [Angluin, ``Finding Patterns Common to a Set of Strings'', JCSS 1980] has revived the classical model of inductive inference (learning in the limit, gold-style learning). In [Shinohara, ``Polynomial Time Inference of Pattern Languages and Its Application'', 7th IBM Symposium on Mathematical Foundations of Computer Science 1982] a simple and… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  5. arXiv:2206.13896  [pdf, other

    cs.CC cs.DS cs.FL

    Subsequences With Gap Constraints: Complexity Bounds for Matching and Analysis Problems

    Authors: Joel D. Day, Maria Kosche, Florin Manea, Markus L. Schmid

    Abstract: We consider subsequences with gap constraints, i.e., length-k subsequences p that can be embedded into a string w such that the induced gaps (i.e., the factors of w between the positions to which p is mapped to) satisfy given gap constraints $gc = (C_1, C_2, ..., C_{k-1})$; we call p a gc-subsequence of w. In the case where the gap constraints gc are defined by lower and upper length bounds… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  6. arXiv:2101.10890  [pdf, other

    cs.DS cs.FL

    Spanner Evaluation over SLP-Compressed Documents

    Authors: Markus L. Schmid, Nicole Schweikardt

    Abstract: We consider the problem of evaluating regular spanners over compressed documents, i.e., we wish to solve evaluation tasks directly on the compressed data, without decompression. As compressed forms of the documents we use straight-line programs (SLPs) -- a lossless compression scheme for textual data widely used in different areas of theoretical computer science and particularly well-suited for al… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  7. arXiv:2101.01945  [pdf, other

    cs.DS cs.CC cs.DB cs.FL

    Fine-Grained Complexity of Regular Path Queries

    Authors: Katrin Casel, Markus L. Schmid

    Abstract: A regular path query (RPQ) is a regular expression q that returns all node pairs (u, v) from a graph database that are connected by an arbitrary path labelled with a word from L(q). The obvious algorithmic approach to RPQ-evaluation (called PG-approach), i.e., constructing the product graph between an NFA for q and the graph database, is appealing due to its simplicity and also leads to efficient… ▽ More

    Submitted 24 November, 2023; v1 submitted 6 January, 2021; originally announced January 2021.

    Journal ref: Logical Methods in Computer Science, Volume 19, Issue 4 (November 27, 2023) lmcs:8625

  8. arXiv:2010.13442  [pdf, ps, other

    cs.DB cs.FL cs.IR

    A Purely Regular Approach to Non-Regular Core Spanners

    Authors: Markus L. Schmid, Nicole Schweikardt

    Abstract: The regular spanners (characterised by vset-automata) are closed under the algebraic operations of union, join and projection, and have desirable algorithmic properties. The core spanners (introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (PODS 2013, JACM 2015) as a formalisation of the core functionality of the query language AQL used in IBM's SystemT) additionally need string-equality selec… ▽ More

    Submitted 12 February, 2024; v1 submitted 26 October, 2020; originally announced October 2020.

  9. Shortest Distances as Enumeration Problem

    Authors: Katrin Casel, Tobias Friedrich, Stefan Neubert, Markus L. Schmid

    Abstract: We investigate the single source shortest distance (SSSD) and all pairs shortest distance (APSD) problems as enumeration problems (on unweighted and integer weighted graphs), meaning that the elements $(u, v, d(u, v))$ -- where $u$ and $v$ are vertices with shortest distance $d(u, v)$ -- are produced and listed one by one without repetition. The performance is measured in the RAM model of computat… ▽ More

    Submitted 17 February, 2021; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: Updated version adds the study of space complexity

  10. arXiv:1912.09326  [pdf, other

    cs.FL cs.DB

    Conjunctive Regular Path Queries with String Variables

    Authors: Markus L. Schmid

    Abstract: We introduce the class CXRPQ of conjunctive xregex path queries, which are obtained from conjunctive regular path queries (CRPQs) by adding string variables (also called backreferences) as found in practical implementations of regular expressions. CXRPQs can be considered user-friendly, since they combine two concepts that are well-established in practice: pattern-based graph queries and regular e… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  11. arXiv:1906.06965  [pdf, ps, other

    cs.DS cs.CC cs.FL

    Matching Patterns with Variables

    Authors: Florin Manea, Markus L. Schmid

    Abstract: A pattern p (i.e., a string of variables and terminals) matches a word w, if w can be obtained by uniformly replacing the variables of p by terminal words. The respective matching problem, i.e., deciding whether or not a given pattern matches a given word, is generally NP-complete, but can be solved in polynomial-time for classes of patterns with restricted structure. In this paper we overview a s… ▽ More

    Submitted 29 July, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

  12. arXiv:1903.05896  [pdf, other

    cs.FL cs.IR

    Regular Expressions with Backreferences: Polynomial-Time Matching Techniques

    Authors: Markus L. Schmid

    Abstract: Regular expressions with backreferences (regex, for short), as supported by most modern libraries for regular expression matching, have an NP-complete matching problem. We define a complexity parameter of regex, called active variable degree, such that regex with this parameter bounded by a constant can be matched in polynomial-time. Moreover, we formulate a novel type of determinism for regex (on… ▽ More

    Submitted 8 February, 2024; v1 submitted 14 March, 2019; originally announced March 2019.

    ACM Class: F.1.1; F.2.2; F.4.3; H.3.3

  13. arXiv:1902.10983  [pdf, other

    cs.DS

    Graph and String Parameters: Connections Between Pathwidth, Cutwidth and the Locality Number

    Authors: Katrin Casel, Joel D. Day, Pamela Fleischmann, Tomasz Kociumaka, Florin Manea, Markus L. Schmid

    Abstract: We investigate the locality number, a recently introduced structural parameter for strings (with applications in pattern matching with variables), and its connection to two important graph-parameters, cutwidth and pathwidth. These connections allow us to show that computing the locality number is NP-hard, but fixed-parameter tractable, if parameterised by the locality number or by the alphabet siz… ▽ More

    Submitted 25 April, 2024; v1 submitted 28 February, 2019; originally announced February 2019.

  14. arXiv:1802.01508  [pdf, ps, other

    cs.FL cs.DB

    Deterministic Regular Expressions With Back-References

    Authors: Dominik D. Freydenberger, Markus L. Schmid

    Abstract: Most modern libraries for regular expression matching allow back-references (i.e., repetition operators) that substantially increase expressive power, but also lead to intractability. In order to find a better balance between expressiveness and tractability, we combine these with the notion of determinism for regular expressions used in XML DTDs and XML Schema. This includes the definition of a su… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    MSC Class: 68Q45; 68Q19; 68Q17

  15. arXiv:1707.04097  [pdf, ps, other

    cs.FL

    A Polynomial Time Match Test for Large Classes of Extended Regular Expressions

    Authors: Daniel Reidenbach, Markus L. Schmid

    Abstract: In the present paper, we study the match test for extended regular expressions. We approach this NP-complete problem by introducing a novel variant of two-way multihead automata, which reveals that the complexity of the match test is determined by a hidden combinatorial property of extended regular expressions, and it shows that a restriction of the corresponding parameter leads to rich classes wi… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  16. arXiv:1707.04083  [pdf, ps, other

    cs.FL

    Two-Dimensional Pattern Languages

    Authors: Henning Fernau, Markus L. Schmid, K. G. Subramanian

    Abstract: We introduce several classes of array languages obtained by generalising Angluin's pattern languages to the two-dimensional case. These classes of two-dimensional pattern languages are compared with respect to their expressive power and their closure properties are investigated.

    Submitted 13 July, 2017; originally announced July 2017.

  17. arXiv:1706.05906  [pdf, ps, other

    cs.CG cs.CC

    Combinatorial Properties and Recognition of Unit Square Visibility Graphs

    Authors: Katrin Casel, Henning Fernau, Alexander Grigoriev, Markus L. Schmid, Sue Whitesides

    Abstract: Unit square (grid) visibility graphs (USV and USGV, resp.) are described by axis-parallel visibility between unit squares placed (on integer grid coordinates) in the plane. We investigate combinatorial properties of these graph classes and the hardness of variants of the recognition problem, i.e., the problem of representing USGV with fixed visibilities within small area and, for USV, the general… ▽ More

    Submitted 20 October, 2017; v1 submitted 19 June, 2017; originally announced June 2017.

    ACM Class: F.2.2

  18. arXiv:1512.00482  [pdf, ps, other

    cs.FL cs.CC

    Characterization and Complexity Results on Jum** Finite Automata

    Authors: Henning Fernau, Meenakshi Paramasivan, Markus L. Schmid, Vojtěch Vorel

    Abstract: In a jum** finite automaton, the input head can jump to an arbitrary position within the remaining input after reading and consuming a symbol. We characterize the corresponding class of languages in terms of special shuffle expressions and survey other equivalent notions from the existing literature. Moreover, we present several results concerning computational hardness and algorithms for pa… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.

  19. Hyperquarks and bosonic preon bound states

    Authors: Michael L. Schmid, Alfons J. Buchmann

    Abstract: In a model in which leptons, quarks, and the recently introduced hyperquarks are built up from two fundamental spin 1/2 preons, the standard model weak gauge bosons emerge as preon bound states. In addition, the model predicts a host of new composite gauge bosons, in particular those responsible for hyperquark and proton decay. Their presence entails a left-right symmetric extension of the standar… ▽ More

    Submitted 23 July, 2013; originally announced July 2013.

    Comments: 19 pages, 6 figures

    Journal ref: Phys. Rev. D80, 095020 (2009)

  20. arXiv:1304.3685  [pdf, ps, other

    hep-ph astro-ph.CO

    Metaspin and dirishonic dark matter

    Authors: Alfons J. Buchmann, Michael L. Schmid

    Abstract: The antisymmetry requirement of rishon bound state wave functions suggests a new rishon quantum number called M spin. From M spin conservation and the Nussinov-Weingarten-Witten theorem we predict the existence of a stable pseudoscalar dirishonic meson, called zeta, that is lighter than the lightest neutrino. Its mass is estimated as m(zeta) = 10^{-9} eV. This particle could make up the major part… ▽ More

    Submitted 12 April, 2013; originally announced April 2013.

    Comments: 8 pages, 1 figure

    Journal ref: Phys. Rev D 86, 115021 (2012)

  21. Hyperquarks and generation number

    Authors: Alfons J. Buchmann, Michael L. Schmid

    Abstract: In a model in which quarks and leptons are built up from two spin 1/2 preons as fundamental entities, a new class of fermionic bound states (hyperquarks) arises. It turns out that these hyperquarks are necessary to fulfill the 't Hooft anomaly constraint, which then links the number of fermionic generations to the number of colors and hypercolors.

    Submitted 11 February, 2013; v1 submitted 8 February, 2013; originally announced February 2013.

    Comments: 4 pages, corrected formatting problem in Eq.(4.6)

    Journal ref: Phys. Rev. D 71, 055002 (2005)

  22. arXiv:hep-th/9911250  [pdf, ps, other

    hep-th gr-qc physics.class-ph

    Coupling of vector fields at high energies

    Authors: Michael L. Schmid

    Abstract: In general relativity and electrodynamics fields are always generated from static monopoles (like mass or electric charge) or their corresponding currents by surrounding them in a spherical configuration. We investigate a generation of fields from primary fields by a scalar coupling. The generated secondary fields fulfill the condition of source-freedom and therefore cannot occur in a spherical… ▽ More

    Submitted 22 April, 2001; v1 submitted 30 November, 1999; originally announced November 1999.

    Comments: 20 pages, no figures