Search | arXiv e-print repository

A framework for extraction and transformation of documents

Authors: Cristian Riveros, Markus L. Schmid, Nicole Schweikardt

Abstract: We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the second phase maps the content of the span-tuples into new documents. We base the extraction phase on the framework of document spanners and the transformation phase on the theory of polyregular function… ▽ More We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the second phase maps the content of the span-tuples into new documents. We base the extraction phase on the framework of document spanners and the transformation phase on the theory of polyregular functions, the class of regular string-to-string functions with polynomial growth. For supporting practical extract-transform scenarios, we propose an extension of document spanners described by regex formulas from span-tuples to so-called multispan-tuples, where variables are mapped to sets of spans. We prove that this extension, called regex multispanners, has the same desirable properties as standard spanners described by regex formulas. In our framework, an Extract-Transform (ET) program is given by a regex multispanner followed by a polyregular function. In this paper, we study the expressibility and evaluation problem of ET programs when the transformation function is linear, called linear ET programs. We show that linear ET programs are equally expressive as non-deterministic streaming string transducers under bag semantics. Moreover, we show that linear ET programs are closed under composition. Finally, we present an enumeration algorithm for evaluating every linear ET program over a document with linear time preprocessing and constant delay. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2404.10497 [pdf, ps, other]

Subsequences With Generalised Gap Constraints: Upper and Lower Complexity Bounds

Authors: Florin Manea, Jonas Richardsen, Markus L. Schmid

Abstract: For two strings u, v over some alphabet A, we investigate the problem of embedding u into w as a subsequence under the presence of generalised gap constraints. A generalised gap constraint is a triple (i, j, C_{i, j}), where 1 <= i < j <= |u| and C_{i, j} is a subset of A^*. Embedding u as a subsequence into v such that (i, j, C_{i, j}) is satisfied means that if u[i] and u[j] are mapped to v[k] a… ▽ More For two strings u, v over some alphabet A, we investigate the problem of embedding u into w as a subsequence under the presence of generalised gap constraints. A generalised gap constraint is a triple (i, j, C_{i, j}), where 1 <= i < j <= |u| and C_{i, j} is a subset of A^*. Embedding u as a subsequence into v such that (i, j, C_{i, j}) is satisfied means that if u[i] and u[j] are mapped to v[k] and v[l], respectively, then the induced gap v[k + 1..l - 1] must be a string from C_{i, j}. This generalises the setting recently investigated in [Day et al., ISAAC 2022], where only gap constraints of the form C_{i, i + 1} are considered, as well as the setting from [Kosche et al., RP 2022], where only gap constraints of the form C_{1, |u|} are considered. We show that subsequence matching under generalised gap constraints is NP-hard, and we complement this general lower bound with a thorough (parameterised) complexity analysis. Moreover, we identify several efficiently solvable subclasses that result from restricting the interval structure induced by the generalised gap constraints. △ Less

Submitted 16 April, 2024; originally announced April 2024.

arXiv:2403.03067 [pdf, other]

Enumeration for MSO-Queries on Compressed Trees

Authors: Markus Lohrey, Markus L. Schmid

Abstract: We present a linear preprocessing and output-linear delay enumeration algorithm for MSO-queries over trees that are compressed in the well-established grammar-based framework. Time bounds are measured with respect to the size of the compressed representation of the tree. Our result extends previous work on the enumeration of MSO-queries over uncompressed trees and on the enumeration of document sp… ▽ More We present a linear preprocessing and output-linear delay enumeration algorithm for MSO-queries over trees that are compressed in the well-established grammar-based framework. Time bounds are measured with respect to the size of the compressed representation of the tree. Our result extends previous work on the enumeration of MSO-queries over uncompressed trees and on the enumeration of document spanners over compressed text documents. △ Less

Submitted 5 March, 2024; originally announced March 2024.

arXiv:2206.13918 [pdf, ps, other]

Extending Shinohara's Algorithm for Computing Descriptive (Angluin-Style) Patterns to Subsequence Patterns

Authors: Markus L. Schmid

Abstract: The introduction of pattern languages in the seminal work [Angluin, ``Finding Patterns Common to a Set of Strings'', JCSS 1980] has revived the classical model of inductive inference (learning in the limit, gold-style learning). In [Shinohara, ``Polynomial Time Inference of Pattern Languages and Its Application'', 7th IBM Symposium on Mathematical Foundations of Computer Science 1982] a simple and… ▽ More The introduction of pattern languages in the seminal work [Angluin, ``Finding Patterns Common to a Set of Strings'', JCSS 1980] has revived the classical model of inductive inference (learning in the limit, gold-style learning). In [Shinohara, ``Polynomial Time Inference of Pattern Languages and Its Application'', 7th IBM Symposium on Mathematical Foundations of Computer Science 1982] a simple and elegant algorithm has been introduced that, based on membership queries, computes a pattern that is descriptive for a given sample of input strings (and, consequently, can be employed in strategies for inductive inference). In this paper, we give a brief survey of the recent work [Kleest-Meißner et al., ``Discovering Event Queries from Traces: Laying Foundations for Subsequence-Queries with Wildcards and Gap-Size Constraints'', ICDT 2022], where the classical concepts of Angluin-style (descriptive) patterns and the respective Shinohara's algorithm are extended to a query class with applications in complex event recognition -- a modern topic from databases. △ Less

Submitted 28 June, 2022; originally announced June 2022.

arXiv:2206.13896 [pdf, other]

Subsequences With Gap Constraints: Complexity Bounds for Matching and Analysis Problems

Authors: Joel D. Day, Maria Kosche, Florin Manea, Markus L. Schmid

Abstract: We consider subsequences with gap constraints, i.e., length-k subsequences p that can be embedded into a string w such that the induced gaps (i.e., the factors of w between the positions to which p is mapped to) satisfy given gap constraints $gc = (C_1, C_2, ..., C_{k-1})$; we call p a gc-subsequence of w. In the case where the gap constraints gc are defined by lower and upper length bounds… ▽ More We consider subsequences with gap constraints, i.e., length-k subsequences p that can be embedded into a string w such that the induced gaps (i.e., the factors of w between the positions to which p is mapped to) satisfy given gap constraints $gc = (C_1, C_2, ..., C_{k-1})$; we call p a gc-subsequence of w. In the case where the gap constraints gc are defined by lower and upper length bounds $C_i = (L^-_i, L^+_i) \in \mathbb{N}^2$ and/or regular languages $C_i \in REG$, we prove tight (conditional on the orthogonal vectors (OV) hypothesis) complexity bounds for checking whether a given p is a gc-subsequence of a string w. We also consider the whole set of all gc-subsequences of a string, and investigate the complexity of the universality, equivalence and containment problems for these sets of gc-subsequences. △ Less

Submitted 28 June, 2022; originally announced June 2022.

arXiv:2101.10890 [pdf, other]

Spanner Evaluation over SLP-Compressed Documents

Authors: Markus L. Schmid, Nicole Schweikardt

Abstract: We consider the problem of evaluating regular spanners over compressed documents, i.e., we wish to solve evaluation tasks directly on the compressed data, without decompression. As compressed forms of the documents we use straight-line programs (SLPs) -- a lossless compression scheme for textual data widely used in different areas of theoretical computer science and particularly well-suited for al… ▽ More We consider the problem of evaluating regular spanners over compressed documents, i.e., we wish to solve evaluation tasks directly on the compressed data, without decompression. As compressed forms of the documents we use straight-line programs (SLPs) -- a lossless compression scheme for textual data widely used in different areas of theoretical computer science and particularly well-suited for algorithmics on compressed data. In terms of data complexity, our results are as follows. For a regular spanner M and an SLP S that represents a document D, we can solve the tasks of model checking and of checking non-emptiness in time O(size(S)). Computing the set M(D) of all span-tuples extracted from D can be done in time O(size(S) size(M(D))), and enumeration of M(D) can be done with linear preprocessing O(size(S)) and a delay of O(depth(S)), where depth(S) is the depth of S's derivation tree. Note that size(S) can be exponentially smaller than the document's size |D|; and, due to known balancing results for SLPs, we can always assume that depth(S) = O(log(|D|)) independent of D's compressibility. Hence, our enumeration algorithm has a delay logarithmic in the size of the non-compressed data and a preprocessing time that is at best (i.e., in the case of highly compressible documents) also logarithmic, but at worst still linear. Therefore, in a big-data perspective, our enumeration algorithm for SLP-compressed documents may nevertheless beat the known linear preprocessing and constant delay algorithms for non-compressed documents. △ Less

Submitted 25 January, 2021; originally announced January 2021.

arXiv:2101.01945 [pdf, other]

doi 10.46298/lmcs-19(4:15)2023

Fine-Grained Complexity of Regular Path Queries

Authors: Katrin Casel, Markus L. Schmid

Abstract: A regular path query (RPQ) is a regular expression q that returns all node pairs (u, v) from a graph database that are connected by an arbitrary path labelled with a word from L(q). The obvious algorithmic approach to RPQ-evaluation (called PG-approach), i.e., constructing the product graph between an NFA for q and the graph database, is appealing due to its simplicity and also leads to efficient… ▽ More A regular path query (RPQ) is a regular expression q that returns all node pairs (u, v) from a graph database that are connected by an arbitrary path labelled with a word from L(q). The obvious algorithmic approach to RPQ-evaluation (called PG-approach), i.e., constructing the product graph between an NFA for q and the graph database, is appealing due to its simplicity and also leads to efficient algorithms. However, it is unclear whether the PG-approach is optimal. We address this question by thoroughly investigating which upper complexity bounds can be achieved by the PG-approach, and we complement these with conditional lower bounds (in the sense of the fine-grained complexity framework). A special focus is put on enumeration and delay bounds, as well as the data complexity perspective. A main insight is that we can achieve optimal (or near optimal) algorithms with the PG-approach, but the delay for enumeration is rather high (linear in the database). We explore three successful approaches towards enumeration with sub-linear delay: super-linear preprocessing, approximations of the solution sets, and restricted classes of RPQs. △ Less

Submitted 24 November, 2023; v1 submitted 6 January, 2021; originally announced January 2021.

Journal ref: Logical Methods in Computer Science, Volume 19, Issue 4 (November 27, 2023) lmcs:8625

arXiv:2010.13442 [pdf, ps, other]

A Purely Regular Approach to Non-Regular Core Spanners

Authors: Markus L. Schmid, Nicole Schweikardt

Abstract: The regular spanners (characterised by vset-automata) are closed under the algebraic operations of union, join and projection, and have desirable algorithmic properties. The core spanners (introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (PODS 2013, JACM 2015) as a formalisation of the core functionality of the query language AQL used in IBM's SystemT) additionally need string-equality selec… ▽ More The regular spanners (characterised by vset-automata) are closed under the algebraic operations of union, join and projection, and have desirable algorithmic properties. The core spanners (introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (PODS 2013, JACM 2015) as a formalisation of the core functionality of the query language AQL used in IBM's SystemT) additionally need string-equality selections and it has been shown by Freydenberger and Holldack (ICDT 2016, Theory of Computing Systems 2018) that this leads to high complexity and even undecidability of the typical problems in static analysis and query evaluation. We propose an alternative approach to core spanners: by incorporating the string-equality selections directly into the regular language that represents the underlying regular spanner (instead of treating it as an algebraic operation on the table extracted by the regular spanner), we obtain a fragment of core spanners that, while having slightly weaker expressive power than the full class of core spanners, arguably still covers the intuitive applications of string-equality selections for information extraction and has much better upper complexity bounds of the typical problems in static analysis and query evaluation. △ Less

Submitted 12 February, 2024; v1 submitted 26 October, 2020; originally announced October 2020.

arXiv:2005.06827 [pdf, other]

doi 10.1016/j.dam.2023.08.027

Shortest Distances as Enumeration Problem

Authors: Katrin Casel, Tobias Friedrich, Stefan Neubert, Markus L. Schmid

Abstract: We investigate the single source shortest distance (SSSD) and all pairs shortest distance (APSD) problems as enumeration problems (on unweighted and integer weighted graphs), meaning that the elements $(u, v, d(u, v))$ -- where $u$ and $v$ are vertices with shortest distance $d(u, v)$ -- are produced and listed one by one without repetition. The performance is measured in the RAM model of computat… ▽ More We investigate the single source shortest distance (SSSD) and all pairs shortest distance (APSD) problems as enumeration problems (on unweighted and integer weighted graphs), meaning that the elements $(u, v, d(u, v))$ -- where $u$ and $v$ are vertices with shortest distance $d(u, v)$ -- are produced and listed one by one without repetition. The performance is measured in the RAM model of computation with respect to preprocessing time and delay, i.e., the maximum time that elapses between two consecutive outputs. This point of view reveals that specific types of output (e.g., excluding the non-reachable pairs $(u, v, \infty)$, or excluding the self-distances $(u, u, 0)$) and the order of enumeration (e.g., sorted by distance, sorted row-wise with respect to the distance matrix) have a huge impact on the complexity of APSD while they appear to have no effect on SSSD. In particular, we show for APSD that enumeration without output restrictions is possible with delay in the order of the average degree. Excluding non-reachable pairs, or requesting the output to be sorted by distance, increases this delay to the order of the maximum degree. Further, for weighted graphs, a delay in the order of the average degree is also not possible without preprocessing or considering self-distances as output. In contrast, for SSSD we find that a delay in the order of the maximum degree without preprocessing is attainable and unavoidable for any of these requirements. △ Less

Submitted 17 February, 2021; v1 submitted 14 May, 2020; originally announced May 2020.

Comments: Updated version adds the study of space complexity

arXiv:1912.09326 [pdf, other]

Conjunctive Regular Path Queries with String Variables

Authors: Markus L. Schmid

Abstract: We introduce the class CXRPQ of conjunctive xregex path queries, which are obtained from conjunctive regular path queries (CRPQs) by adding string variables (also called backreferences) as found in practical implementations of regular expressions. CXRPQs can be considered user-friendly, since they combine two concepts that are well-established in practice: pattern-based graph queries and regular e… ▽ More We introduce the class CXRPQ of conjunctive xregex path queries, which are obtained from conjunctive regular path queries (CRPQs) by adding string variables (also called backreferences) as found in practical implementations of regular expressions. CXRPQs can be considered user-friendly, since they combine two concepts that are well-established in practice: pattern-based graph queries and regular expressions with backreferences. Due to the string variables, CXRPQs can express inter-path dependencies, which are not expressible by CRPQs. The evaluation complexity of CXRPQs, if not further restricted, is PSPACE-hard in data-complexity. We identify three natural fragments with more acceptable evaluation complexity: their data-complexity is in NL, while their combined complexity varies between EXPSPACE, PSPACE and NP. In terms of expressive power, we compare the CXRPQ-fragments with CRPQs and unions of CRPQs, and with extended conjunctive regular path queries (ECRPQs) and unions of ECRPQs. △ Less

Submitted 19 December, 2019; originally announced December 2019.

arXiv:1906.06965 [pdf, ps, other]

Matching Patterns with Variables

Authors: Florin Manea, Markus L. Schmid

Abstract: A pattern p (i.e., a string of variables and terminals) matches a word w, if w can be obtained by uniformly replacing the variables of p by terminal words. The respective matching problem, i.e., deciding whether or not a given pattern matches a given word, is generally NP-complete, but can be solved in polynomial-time for classes of patterns with restricted structure. In this paper we overview a s… ▽ More A pattern p (i.e., a string of variables and terminals) matches a word w, if w can be obtained by uniformly replacing the variables of p by terminal words. The respective matching problem, i.e., deciding whether or not a given pattern matches a given word, is generally NP-complete, but can be solved in polynomial-time for classes of patterns with restricted structure. In this paper we overview a series of recent results related to efficient matching for patterns with variables, as well as a series of extensions of this problem. △ Less

Submitted 29 July, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

arXiv:1903.05896 [pdf, other]

Regular Expressions with Backreferences: Polynomial-Time Matching Techniques

Authors: Markus L. Schmid

Abstract: Regular expressions with backreferences (regex, for short), as supported by most modern libraries for regular expression matching, have an NP-complete matching problem. We define a complexity parameter of regex, called active variable degree, such that regex with this parameter bounded by a constant can be matched in polynomial-time. Moreover, we formulate a novel type of determinism for regex (on… ▽ More Regular expressions with backreferences (regex, for short), as supported by most modern libraries for regular expression matching, have an NP-complete matching problem. We define a complexity parameter of regex, called active variable degree, such that regex with this parameter bounded by a constant can be matched in polynomial-time. Moreover, we formulate a novel type of determinism for regex (on an automaton-theoretic level), which yields the class of memory-deterministic regex that can be matched in time O(|w|p(|r|)) for a polynomial p (where r is the regex and w the word). Natural extensions of these concepts lead to properties of regex that are intractable to check. △ Less

Submitted 8 February, 2024; v1 submitted 14 March, 2019; originally announced March 2019.

ACM Class: F.1.1; F.2.2; F.4.3; H.3.3

arXiv:1902.10983 [pdf, other]

Graph and String Parameters: Connections Between Pathwidth, Cutwidth and the Locality Number

Authors: Katrin Casel, Joel D. Day, Pamela Fleischmann, Tomasz Kociumaka, Florin Manea, Markus L. Schmid

Abstract: We investigate the locality number, a recently introduced structural parameter for strings (with applications in pattern matching with variables), and its connection to two important graph-parameters, cutwidth and pathwidth. These connections allow us to show that computing the locality number is NP-hard, but fixed-parameter tractable, if parameterised by the locality number or by the alphabet siz… ▽ More We investigate the locality number, a recently introduced structural parameter for strings (with applications in pattern matching with variables), and its connection to two important graph-parameters, cutwidth and pathwidth. These connections allow us to show that computing the locality number is NP-hard, but fixed-parameter tractable, if parameterised by the locality number or by the alphabet size, which has been formulated as open problems in the literature. Moreover, the locality number can be approximated with ratio O(sqrt(log(opt)) log(n)). An important aspect of our work -- that is relevant in its own right and of independent interest -- is that we identify connections between the string parameter of the locality number on the one hand, and the famous graph parameters of cutwidth and pathwidth, on the other hand. These two parameters have been jointly investigated in the literature and are arguably among the most central graph parameters that are based on "linearisations" of graphs. In this way, we also identify a direct approximation preserving reduction from cutwidth to pathwidth, which shows that any polynomial f(opt,|V|)-approximation algorithm for pathwidth yields a polynomial 2f(2 opt,h)-approximation algorithm for cutwidth on multigraphs (where h is the number of edges). In particular, this translates known approximation ratios for pathwidth into new approximation ratios for cutwidth, namely O(sqrt(log(opt)) log(h)) and O(sqrt(log(opt)) opt) for (multi) graphs with h edges. △ Less

Submitted 25 April, 2024; v1 submitted 28 February, 2019; originally announced February 2019.

arXiv:1802.01508 [pdf, ps, other]

Deterministic Regular Expressions With Back-References

Authors: Dominik D. Freydenberger, Markus L. Schmid

Abstract: Most modern libraries for regular expression matching allow back-references (i.e., repetition operators) that substantially increase expressive power, but also lead to intractability. In order to find a better balance between expressiveness and tractability, we combine these with the notion of determinism for regular expressions used in XML DTDs and XML Schema. This includes the definition of a su… ▽ More Most modern libraries for regular expression matching allow back-references (i.e., repetition operators) that substantially increase expressive power, but also lead to intractability. In order to find a better balance between expressiveness and tractability, we combine these with the notion of determinism for regular expressions used in XML DTDs and XML Schema. This includes the definition of a suitable automaton model, and a generalization of the Glushkov construction. We demonstrate that, compared to their non-deterministic superclass, these deterministic regular expressions with back-references have desirable algorithmic properties (i.e., efficiently solvable membership problem and some decidable problems in static analysis), while, at the same time, their expressive power exceeds that of deterministic regular expressions without back-references. △ Less

Submitted 5 February, 2018; originally announced February 2018.

MSC Class: 68Q45; 68Q19; 68Q17

arXiv:1707.04097 [pdf, ps, other]

A Polynomial Time Match Test for Large Classes of Extended Regular Expressions

Authors: Daniel Reidenbach, Markus L. Schmid

Abstract: In the present paper, we study the match test for extended regular expressions. We approach this NP-complete problem by introducing a novel variant of two-way multihead automata, which reveals that the complexity of the match test is determined by a hidden combinatorial property of extended regular expressions, and it shows that a restriction of the corresponding parameter leads to rich classes wi… ▽ More In the present paper, we study the match test for extended regular expressions. We approach this NP-complete problem by introducing a novel variant of two-way multihead automata, which reveals that the complexity of the match test is determined by a hidden combinatorial property of extended regular expressions, and it shows that a restriction of the corresponding parameter leads to rich classes with a polynomial time match test. For presentational reasons, we use the concept of pattern languages in order to specify extended regular expressions. While this decision, formally, slightly narrows the scope of our results, an extension of our concepts and results to more general notions of extended regular expressions is straightforward. △ Less

Submitted 13 July, 2017; originally announced July 2017.

arXiv:1707.04083 [pdf, ps, other]

Two-Dimensional Pattern Languages

Authors: Henning Fernau, Markus L. Schmid, K. G. Subramanian

Abstract: We introduce several classes of array languages obtained by generalising Angluin's pattern languages to the two-dimensional case. These classes of two-dimensional pattern languages are compared with respect to their expressive power and their closure properties are investigated. We introduce several classes of array languages obtained by generalising Angluin's pattern languages to the two-dimensional case. These classes of two-dimensional pattern languages are compared with respect to their expressive power and their closure properties are investigated. △ Less

Submitted 13 July, 2017; originally announced July 2017.

arXiv:1706.05906 [pdf, ps, other]

Combinatorial Properties and Recognition of Unit Square Visibility Graphs

Authors: Katrin Casel, Henning Fernau, Alexander Grigoriev, Markus L. Schmid, Sue Whitesides

Abstract: Unit square (grid) visibility graphs (USV and USGV, resp.) are described by axis-parallel visibility between unit squares placed (on integer grid coordinates) in the plane. We investigate combinatorial properties of these graph classes and the hardness of variants of the recognition problem, i.e., the problem of representing USGV with fixed visibilities within small area and, for USV, the general… ▽ More Unit square (grid) visibility graphs (USV and USGV, resp.) are described by axis-parallel visibility between unit squares placed (on integer grid coordinates) in the plane. We investigate combinatorial properties of these graph classes and the hardness of variants of the recognition problem, i.e., the problem of representing USGV with fixed visibilities within small area and, for USV, the general recognition problem. △ Less

Submitted 20 October, 2017; v1 submitted 19 June, 2017; originally announced June 2017.

ACM Class: F.2.2

arXiv:1512.00482 [pdf, ps, other]

Characterization and Complexity Results on Jum** Finite Automata

Authors: Henning Fernau, Meenakshi Paramasivan, Markus L. Schmid, Vojtěch Vorel

Abstract: In a jum** finite automaton, the input head can jump to an arbitrary position within the remaining input after reading and consuming a symbol. We characterize the corresponding class of languages in terms of special shuffle expressions and survey other equivalent notions from the existing literature. Moreover, we present several results concerning computational hardness and algorithms for pa… ▽ More In a jum** finite automaton, the input head can jump to an arbitrary position within the remaining input after reading and consuming a symbol. We characterize the corresponding class of languages in terms of special shuffle expressions and survey other equivalent notions from the existing literature. Moreover, we present several results concerning computational hardness and algorithms for parsing and other basic tasks concerning jum** finite automata. △ Less

Submitted 1 December, 2015; originally announced December 2015.

arXiv:1307.6133 [pdf, ps, other]

doi 10.1103/PhysRevD.80.095020

Hyperquarks and bosonic preon bound states

Authors: Michael L. Schmid, Alfons J. Buchmann

Abstract: In a model in which leptons, quarks, and the recently introduced hyperquarks are built up from two fundamental spin 1/2 preons, the standard model weak gauge bosons emerge as preon bound states. In addition, the model predicts a host of new composite gauge bosons, in particular those responsible for hyperquark and proton decay. Their presence entails a left-right symmetric extension of the standar… ▽ More In a model in which leptons, quarks, and the recently introduced hyperquarks are built up from two fundamental spin 1/2 preons, the standard model weak gauge bosons emerge as preon bound states. In addition, the model predicts a host of new composite gauge bosons, in particular those responsible for hyperquark and proton decay. Their presence entails a left-right symmetric extension of the standard model weak interactions and a scheme for a partial and grand unification of nongravitational interactions based on respectively the effective gauge groups SU(6)_P and SU(9)_G. This leads to a prediction of the Weinberg angle at low energies in good agreement with experiment. Furthermore, using evolution equations for the effective coupling strengths, we calculate the partial and grand unification scales, the hyperquark mass scale, as well as the mass and decay rate of the lightest hyperhadron. △ Less

Submitted 23 July, 2013; originally announced July 2013.

Comments: 19 pages, 6 figures

Journal ref: Phys. Rev. D80, 095020 (2009)

arXiv:1304.3685 [pdf, ps, other]

doi 10.1103/PhysRevD.86.115021

Metaspin and dirishonic dark matter

Authors: Alfons J. Buchmann, Michael L. Schmid

Abstract: The antisymmetry requirement of rishon bound state wave functions suggests a new rishon quantum number called M spin. From M spin conservation and the Nussinov-Weingarten-Witten theorem we predict the existence of a stable pseudoscalar dirishonic meson, called zeta, that is lighter than the lightest neutrino. Its mass is estimated as m(zeta) = 10^{-9} eV. This particle could make up the major part… ▽ More The antisymmetry requirement of rishon bound state wave functions suggests a new rishon quantum number called M spin. From M spin conservation and the Nussinov-Weingarten-Witten theorem we predict the existence of a stable pseudoscalar dirishonic meson, called zeta, that is lighter than the lightest neutrino. Its mass is estimated as m(zeta) = 10^{-9} eV. This particle could make up the major part of cold dark matter in the Universe. △ Less

Submitted 12 April, 2013; originally announced April 2013.

Comments: 8 pages, 1 figure

Journal ref: Phys. Rev D 86, 115021 (2012)

arXiv:1302.2118 [pdf, ps, other]

doi 10.1103/PhysRevD.71.055002

Hyperquarks and generation number

Authors: Alfons J. Buchmann, Michael L. Schmid

Abstract: In a model in which quarks and leptons are built up from two spin 1/2 preons as fundamental entities, a new class of fermionic bound states (hyperquarks) arises. It turns out that these hyperquarks are necessary to fulfill the 't Hooft anomaly constraint, which then links the number of fermionic generations to the number of colors and hypercolors. In a model in which quarks and leptons are built up from two spin 1/2 preons as fundamental entities, a new class of fermionic bound states (hyperquarks) arises. It turns out that these hyperquarks are necessary to fulfill the 't Hooft anomaly constraint, which then links the number of fermionic generations to the number of colors and hypercolors. △ Less

Submitted 11 February, 2013; v1 submitted 8 February, 2013; originally announced February 2013.

Comments: 4 pages, corrected formatting problem in Eq.(4.6)

Journal ref: Phys. Rev. D 71, 055002 (2005)

arXiv:hep-th/9911250 [pdf, ps, other]

Coupling of vector fields at high energies

Authors: Michael L. Schmid

Abstract: In general relativity and electrodynamics fields are always generated from static monopoles (like mass or electric charge) or their corresponding currents by surrounding them in a spherical configuration. We investigate a generation of fields from primary fields by a scalar coupling. The generated secondary fields fulfill the condition of source-freedom and therefore cannot occur in a spherical… ▽ More In general relativity and electrodynamics fields are always generated from static monopoles (like mass or electric charge) or their corresponding currents by surrounding them in a spherical configuration. We investigate a generation of fields from primary fields by a scalar coupling. The generated secondary fields fulfill the condition of source-freedom and therefore cannot occur in a spherical configuration. The coupling strength depends on the energies of the primary fields. In most cases these fields can be approximately considered as dipole fields. We discuss two applications of couplings for electromagnetic and gravitational spin-1 fields and for electric and magnetic fields. We calculate for both applications the threshold values of field energy for the maximum coupling strength. The proposed approach yields to a further step towards an unification of electromagnetism and gravitation and has important consequences for the discrete symmetries. △ Less

Submitted 22 April, 2001; v1 submitted 30 November, 1999; originally announced November 1999.

Comments: 20 pages, no figures

Showing 1–22 of 22 results for author: Schmid, M L