-
Constrained Multi-Tildes: Derived Term and Position Automata
Authors:
Samira Attou,
Ludovic Mignot,
Clément Miklarz,
Florent Nicart
Abstract:
Multi-tildes are regular operators that were introduced to enhance the factorization power of regular expressions, allowing us to add the empty word in several factors of a catenation product of languages. In addition to multi-bars, which dually remove the empty word, they allow representing any acyclic automaton by a linear-sized expression, whereas the lower bound is exponential in the classic c…
▽ More
Multi-tildes are regular operators that were introduced to enhance the factorization power of regular expressions, allowing us to add the empty word in several factors of a catenation product of languages. In addition to multi-bars, which dually remove the empty word, they allow representing any acyclic automaton by a linear-sized expression, whereas the lower bound is exponential in the classic case.
In this paper, we extend multi-tildes from disjunctive combinations to any Boolean combination, allowing us to exponentially enhance the factorization power of tildes expressions. Moreover, we show how to convert these expressions into finite automata and give a Haskell implementation of them using advanced techniques of functional programming.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Monadic Expressions and their Derivatives [extended version]
Authors:
Samira Attou,
Ludovic Mignot,
Clément Miklarz,
Florent Nicart
Abstract:
We propose another interpretation of well-known derivatives computations from regular expressions, due to Brzozowski, Antimirov or Lombardy and Sakarovitch,
in order to abstract the underlying data structures (e.g. sets or linear combinations) using the notion of monad.
As an example of this generalization advantage, we first introduce a new derivation technique based on the graded module mona…
▽ More
We propose another interpretation of well-known derivatives computations from regular expressions, due to Brzozowski, Antimirov or Lombardy and Sakarovitch,
in order to abstract the underlying data structures (e.g. sets or linear combinations) using the notion of monad.
As an example of this generalization advantage, we first introduce a new derivation technique based on the graded module monad
and then show an application of this technique to generalize the parsing of expression with capture groups and back references.
We also extend operators defining expressions to any n-ary functions over value sets, such as
classical operations (like negation or intersection for Boolean weights) or more exotic ones (like algebraic mean for rational weights).
Moreover, we present how to compute a (non-necessarily finite) automaton from such an extended expression, using the Colcombet and Petrisan categorical definition of automata.
These category theory concepts allow us to perform this construction in a unified way, whatever the underlying monad.
Finally, to illustrate our work, we present a Haskell implementation of these notions using advanced techniques of functional programming,
and we provide a web interface to manipulate concrete examples.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
Monadic Expressions and their Derivatives
Authors:
Samira Attou,
Ludovic Mignot,
Clément Miklarz,
Florent Nicart
Abstract:
We propose another interpretation of well-known derivatives computations from regular expressions, due to Brzozowski, Antimirov or Lombardy and Sakarovitch, in order to abstract the underlying data structures (e.g. sets or linear combinations) using the notion of monad. As an example of this generalization advantage, we introduce a new derivation technique based on the graded module monad.
We al…
▽ More
We propose another interpretation of well-known derivatives computations from regular expressions, due to Brzozowski, Antimirov or Lombardy and Sakarovitch, in order to abstract the underlying data structures (e.g. sets or linear combinations) using the notion of monad. As an example of this generalization advantage, we introduce a new derivation technique based on the graded module monad.
We also extend operators defining expressions to any n-ary functions over value sets, such as classical operations (like negation or intersection for Boolean weights) or more exotic ones (like algebraic mean for rational weights).
Moreover, we present how to compute a (non-necessarily finite) automaton from such an extended expression, using the Colcombet and Petrisan categorical definition of automata. These category theory concepts allow us to perform this construction in a unified way, whatever the underlying monad.
Finally, to illustrate our work, we present a Haskell implementation of these notions using advanced techniques of functional programming, and we provide a web interface to manipulate concrete examples.
△ Less
Submitted 31 August, 2022;
originally announced August 2022.
-
Bottom-Up Derivatives of Tree Expressions
Authors:
Samira Attou,
Ludovic Mignot,
Djelloul Ziadi
Abstract:
In this paper, we extend the notion of (word) derivatives and partial derivatives due to (respectively) Brzozowski and Antimirov to tree derivatives using already known inductive formulae of quotients.
We define a new family of extended regular tree expressions (using negation or intersection operators), and we show how to compute a Brzozowski-like inductive tree automaton; the fixed point of th…
▽ More
In this paper, we extend the notion of (word) derivatives and partial derivatives due to (respectively) Brzozowski and Antimirov to tree derivatives using already known inductive formulae of quotients.
We define a new family of extended regular tree expressions (using negation or intersection operators), and we show how to compute a Brzozowski-like inductive tree automaton; the fixed point of this construction, when it exists, is the derivative tree automaton.
Such a deterministic tree automaton can be used to solve the membership test efficiently: the whole structure is not necessarily computed, and the derivative computations can be performed in parallel.
We also show how to solve the membership test using our (Bottom-Up) partial derivatives, without computing an automaton.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
The Bottom-Up Position Tree Automaton, the Father Automaton and their Compact Versions
Authors:
Samira Attou,
Ludovic Mignot,
Djelloul Ziadi
Abstract:
The conversion of a given regular tree expression into a tree automaton has been widely studied. However, classical interpretations are based upon a Top-Down interpretation of tree automata. In this paper, we propose new constructions based on the Gluskov's one and on the one of Ilie and Yu one using a Bottom-Up interpretation. One of the main goals of this technique is to consider as a next step…
▽ More
The conversion of a given regular tree expression into a tree automaton has been widely studied. However, classical interpretations are based upon a Top-Down interpretation of tree automata. In this paper, we propose new constructions based on the Gluskov's one and on the one of Ilie and Yu one using a Bottom-Up interpretation. One of the main goals of this technique is to consider as a next step the links with deterministic recognizers, consideration that cannot be performed with classical Top-Down approaches. Furthermore, we exhibit a method to factorize transitions of tree automata and show that this technique is particularly interesting for these constructions, by considering natural factorizations due to the structure of regular expression.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.