Search | arXiv e-print repository

Improved Randomized Approximation of Hard Universality and Emptiness Problems

Authors: Pantelis Andreou, Stavros Konstantinidis, Taylor J. Smith

Abstract: We build on recent research on polynomial randomized approximation (PRAX) algorithms for the hard problems of NFA universality and NFA equivalence. Loosely speaking, PRAX algorithms use sampling of infinite domains within any desired accuracy $δ$. In the spirit of experimental mathematics, we extend the concept of PRAX algorithms to be applicable to the emptiness and universality problems in any d… ▽ More We build on recent research on polynomial randomized approximation (PRAX) algorithms for the hard problems of NFA universality and NFA equivalence. Loosely speaking, PRAX algorithms use sampling of infinite domains within any desired accuracy $δ$. In the spirit of experimental mathematics, we extend the concept of PRAX algorithms to be applicable to the emptiness and universality problems in any domain whose instances admit a tractable distribution as defined in this paper. A technical result here is that a linear (w.r.t. $1/δ$) number of samples is sufficient, as opposed to the quadratic number of samples in previous papers. We show how the improved and generalized PRAX algorithms apply to universality and emptiness problems in various domains: ordinary automata, tautology testing of propositions, 2D automata, and to solution sets of certain Diophantine equations. △ Less

Submitted 13 March, 2024; originally announced March 2024.

MSC Class: 68W25 (primary); 68W20; 68Q45 (secondary)

arXiv:2202.01594 [pdf, ps, other]

Approximate NFA Universality and Related Problems Motivated by Information Theory

Authors: Stavros Konstantinidis, Mitja Mastnak, Nelma Moreira, Rogério Reis

Abstract: In coding and information theory, it is desirable to construct maximal codes that can be either variable length codes or error control codes of fixed length. However deciding code maximality boils down to deciding whether a given NFA is universal, and this is a hard problem (including the case of whether the NFA accepts all words of a fixed length). On the other hand, it is acceptable to know whet… ▽ More In coding and information theory, it is desirable to construct maximal codes that can be either variable length codes or error control codes of fixed length. However deciding code maximality boils down to deciding whether a given NFA is universal, and this is a hard problem (including the case of whether the NFA accepts all words of a fixed length). On the other hand, it is acceptable to know whether a code is `approximately' maximal, which then boils down to whether a given NFA is `approximately' universal. Here we introduce the notion of a $(1-ε)$-universal automaton and present polynomial randomized approximation algorithms to test NFA universality and related hard automata problems, for certain natural probability distributions on the set of words. We also conclude that the randomization aspect is necessary, as approximate universality remains hard for any fixed polynomially computable $ε$. △ Less

Submitted 11 April, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

Comments: 23 pages

MSC Class: 68Q45 (Primary) 68W25 (Secondary) ACM Class: F.1.1; F.4.3

arXiv:1903.10740 [pdf, ps, other]

Partitioning a Symmetric Rational Relation into Two Asymmetric Rational Relations

Authors: Stavros Konstantinidis, Mitja Mastnak, Juraj Sebej

Abstract: We consider the problem of partitioning effectively a given symmetric (and irreflexive) rational relation R into two asymmetric rational relations. This problem is motivated by a recent method of embedding an R-independent language into one that is maximal R-independent, where the method requires to use an asymmetric partition of R. We solve the problem when R is realized by a zero-avoiding transd… ▽ More We consider the problem of partitioning effectively a given symmetric (and irreflexive) rational relation R into two asymmetric rational relations. This problem is motivated by a recent method of embedding an R-independent language into one that is maximal R-independent, where the method requires to use an asymmetric partition of R. We solve the problem when R is realized by a zero-avoiding transducer (with some bound k): if the absolute value of the input-output length discrepancy of a computation exceeds k then the length discrepancy of the computation cannot become zero. This class of relations properly contains all recognizable, all left synchronous, and all right synchronous relations. We leave the asymmetric partition problem open when R is not realized by a zero-avoiding transducer. We also show examples of total wordorderings for which there is a relation R that cannot be partitioned into two asymmetric rational relations such that one of them is decreasing with respect to the given word-ordering. △ Less

Submitted 26 March, 2019; originally announced March 2019.

Comments: 19 pages, 4 figures. Submitted to the 24th International Conference on Implementation and Application of Automata, July 22-25, 2019, Kosice, Slovakia

arXiv:1805.01829 [pdf, ps, other]

Regular Expressions and Transducers over Alphabet-invariant and User-defined Labels

Authors: Stavros Konstantinidis, Nelma Moreira, Rogerio Reis, Joshua Young

Abstract: We are interested in regular expressions and transducers that represent word relations in an alphabet-invariant way---for example, the set of all word pairs u,v where v is a prefix of u independently of what the alphabet is. Current software systems of formal language objects do not have a mechanism to define such objects. We define transducers in which transition labels involve what we call set s… ▽ More We are interested in regular expressions and transducers that represent word relations in an alphabet-invariant way---for example, the set of all word pairs u,v where v is a prefix of u independently of what the alphabet is. Current software systems of formal language objects do not have a mechanism to define such objects. We define transducers in which transition labels involve what we call set specifications, some of which are alphabet invariant. In fact, we give a more broad definition of automata-type objects, called labelled graphs, where each transition label can be any string, as long as that string represents a subset of a certain monoid. Then, the behaviour of the labelled graph is a subset of that monoid. We do the same for regular expressions. We obtain extensions of a few classic algorithmic constructions on ordinary regular expressions and transducers at the broad level of labelled graphs and in such a way that the computational efficiency of the extended constructions is not sacrificed. For regular expressions with set specs we obtain the corresponding partial derivative automata. For transducers with set specs we obtain further algorithms that can be applied to questions about independent regular languages, in particular the witness version of the independent property satisfaction question. △ Less

Submitted 4 May, 2018; originally announced May 2018.

arXiv:1601.06312 [pdf, ps, other]

Channels with Synchronization/Substitution Errors and Computation of Error Control Codes

Authors: Stavros Konstantinidis, Nelma Moreira, Rogerio Reis

Abstract: We introduce the concept of an \ff-maximal error-detecting block code, for some parameter \ff{} between 0 and 1, in order to formalize the situation where a block code is close to maximal with respect to being error-detecting. Our motivation for this is that constructing a maximal error-detecting code is a computationally hard problem. We present a randomized algorithm that takes as input two posi… ▽ More We introduce the concept of an \ff-maximal error-detecting block code, for some parameter \ff{} between 0 and 1, in order to formalize the situation where a block code is close to maximal with respect to being error-detecting. Our motivation for this is that constructing a maximal error-detecting code is a computationally hard problem. We present a randomized algorithm that takes as input two positive integers $N,\ell$, a probability value \ff, and a specification of the errors permitted in some application, and generates an error-detecting, or error-correcting, block code having up to $N$ codewords of length $\ell$. If the algorithm finds less than $N$ codewords, then those codewords constitute a code that is \ff-maximal with high probability. The error specification (also called channel) is modelled as a transducer, which allows one to model any rational combination of substitution and synchronization errors. We also present some elements of our implementation of various error-detecting properties and their associated methods. Then, we show several tests of the implemented randomized algorithm on various channels. A methodological contribution is the presentation of how various desirable error combinations can be expressed formally and processed algorithmically. △ Less

Submitted 28 July, 2016; v1 submitted 23 January, 2016; originally announced January 2016.

MSC Class: 94B; 68P30 ACM Class: E.4

arXiv:1507.00600 [pdf, ps, other]

Embedding rationally independent languages into maximal ones

Authors: Stavros Konstantinidis, Mitja Mastnak

Abstract: We consider the embedding problem in coding theory: given an independence (a code-related property) and an independent language $L$, find a maximal independent language containing $L$. We consider the case where the code-related property is defined via a rational binary relation that is decreasing with respect to any fixed total order on the set of words. Our method works by iterating a max-min op… ▽ More We consider the embedding problem in coding theory: given an independence (a code-related property) and an independent language $L$, find a maximal independent language containing $L$. We consider the case where the code-related property is defined via a rational binary relation that is decreasing with respect to any fixed total order on the set of words. Our method works by iterating a max-min operator that has been used before for the embedding problem for properties defined by length-increasing-and-transitive binary relations. By going to order-decreasing rational relations, represented by input-decreasing transducers, we are able to include many known properties from both the noiseless and noisy domains of coding theory, as well as any combination of such properties. Moreover, in many cases the desired maximal embedding is effectively computable. △ Less

Submitted 2 July, 2015; originally announced July 2015.

Comments: 20 pages, 5 figures

MSC Class: 68Q45; 94A45; 94B99 ACM Class: F.4.3; E.4

arXiv:1504.04715 [pdf, other]

Symbolic Manipulation of Code Properties

Authors: Stavros Konstantinidis, Casey Meijer, Nelma Moreira, Rogério Reis

Abstract: The FAdo system is a symbolic manipulator of formal languages objects, implemented in Python. In this work, we extend its capabilities by implementing methods to manipulate transducers and we go one level higher than existing formal language systems and implement methods to manipulate objects representing classes of independent languages (widely known as code properties). Our methods allow users t… ▽ More The FAdo system is a symbolic manipulator of formal languages objects, implemented in Python. In this work, we extend its capabilities by implementing methods to manipulate transducers and we go one level higher than existing formal language systems and implement methods to manipulate objects representing classes of independent languages (widely known as code properties). Our methods allow users to define their own code properties and combine them between themselves or with fixed properties such as prefix codes, suffix codes, error detecting codes, etc. The satisfaction and maximality decision questions are solvable for any of the definable properties. The new online system LaSer allows to query about code properties and obtain the answer in a batch mode. Our work is founded on independence theory as well as the theory of rational relations and transducers and contributes with improveded algorithms on these objects. △ Less

Submitted 1 August, 2016; v1 submitted 18 April, 2015; originally announced April 2015.

Comments: Extended version of the CIAA 2016 paper, "Implementation of Code Properties via Transducers", LNCS 9705, pp. 189-201, Springer, 2016

arXiv:1503.00035 [pdf, ps, other]

Transducer Descriptions of DNA Code Properties and Undecidability of Antimorphic Problems

Authors: Lila Kari, Stavros Konstantinidis, Steffen Kopecki

Abstract: This work concerns formal descriptions of DNA code properties, and builds on previous work on transducer descriptions of classic code properties and on trajectory descriptions of DNA code properties. This line of research allows us to give a property as input to an algorithm, in addition to any regular language, which can then answer questions about the language and the property. Here we define DN… ▽ More This work concerns formal descriptions of DNA code properties, and builds on previous work on transducer descriptions of classic code properties and on trajectory descriptions of DNA code properties. This line of research allows us to give a property as input to an algorithm, in addition to any regular language, which can then answer questions about the language and the property. Here we define DNA code properties via transducers and show that this method is strictly more expressive than that of trajectories, without sacrificing the efficiency of deciding the satisfaction question. We also show that the maximality question can be undecidable. Our undecidability results hold not only for the fixed DNA involution but also for any fixed antimorphic permutation. Moreover, we also show the undecidability of the antimorphic version of the Post Corresponding Problem, for any fixed antimorphic permutation. △ Less

Submitted 27 February, 2015; originally announced March 2015.

arXiv:1406.1041 [pdf, ps, other]

An efficient algorithm for computing the edit distance of a regular language via input-altering transducers

Authors: Lila Kari, Stavros Konstantinidis, Steffen Kopecki, Meng Yang

Abstract: We revisit the problem of computing the edit distance of a regular language given via an NFA. This problem relates to the inherent maximal error-detecting capability of the language in question. We present an efficient algorithm for solving this problem which executes in time $O(r^2n^2d)$, where $r$ is the cardinality of the alphabet involved, $n$ is the number of transitions in the given NFA, and… ▽ More We revisit the problem of computing the edit distance of a regular language given via an NFA. This problem relates to the inherent maximal error-detecting capability of the language in question. We present an efficient algorithm for solving this problem which executes in time $O(r^2n^2d)$, where $r$ is the cardinality of the alphabet involved, $n$ is the number of transitions in the given NFA, and $d$ is the computed edit distance. We have implemented the algorithm and present here performance tests. The correctness of the algorithm is based on the result (also presented here) that the particular error-detection property related to our problem can be defined via an input-altering transducer. △ Less

Submitted 4 June, 2014; originally announced June 2014.

MSC Class: 68Q45

Showing 1–9 of 9 results for author: Konstantinidis, S