\newtheoremrep

thm[theorem]Theorem \newtheoremreplemma[theorem]Lemma \newtheoremrepobservation[theorem]Observation \newtheoremrepcorollary[theorem]Corollary \hideLIPIcs Computer Science, University of California–Davis, CA, USA and https://web.cs.ucdavis.edu/~doty/[email protected]://orcid.org/0000-0002-3922-172XNSF awards 2211793, 1900931, 1844976, and DoE EXPRESS award SC0024467 CIT, Technical University of Munich, Germany and Computer Science, University of California–Davis, CA, [email protected] award 1844976 \CopyrightDavid Doty and Ben Heckmann \ccsdescTheory of computation Models of computation

The computational power of discrete chemical reaction networks with bounded executions

David Doty Ben Heckmann

Abstract

Chemical reaction networks (CRNs) model systems where molecules interact according to a finite set of reactions such as $A+B\to C$ , representing that if a molecule of $A$ and $B$ collide, they disappear and a molecule of $C$ is produced. CRNs can compute Boolean-valued predicates $\phi:\mathbb{N}^{d}\to\{0,1\}$ and integer-valued functions $f:\mathbb{N}^{d}\to\mathbb{N}$ ; for instance $X_{1}+X_{2}\to Y$ computes the function $\min(x_{1},x_{2}).$

We study the computational power of execution bounded CRNs, in which only a finite number of reactions can occur from the initial configuration (e.g., ruling out reversible reactions such as $A\mathop{\rightleftharpoons}\limits B$ ). The power and composability of such CRNs depends crucially on some other modeling choices that do not affect the computational power of CRNs with unbounded executions, namely whether an initial leader is present, and whether (for predicates) all species are required to “vote” for the Boolean output. If the CRN starts with an initial leader, and can allow only the leader to vote, then all semilinear predicates and functions can be stably computed in $O(n\log n)$ parallel time by execution bounded CRNs.

However, if no initial leader is allowed, all species vote, and the CRN is “noncollapsing” (does not shrink from initially large to final $O(1)$ size configurations), then execution bounded CRNs are severely limited, able to compute only eventually constant predicates. A key tool is to characterize execution bounded CRNs as precisely those with a nonnegative linear potential function that is strictly decreased by every reaction, a result that may be of independent interest.

keywords:

chemical reaction networks, population protocols, stable computation

1 Introduction

Chemical reaction networks (CRNs) are a fundamental tool for understanding and designing molecular systems. By abstracting chemical reactions into a set of finite, rule-based transformations, CRNs allow us to model the behavior of complex systems. For instance, the CRN with a single reaction $2X\rightarrow Y$ , produces one $Y$ every time two $X$ molecules randomly react together, effectively calculating the function $f(x)=\lfloor x/2\rfloor$ if the initial count of $X$ molecules is interpreted as the input and $Y$ as the output. A commonly studied special case of CRNs is the population protocol model of distributed computing [Angluin2006ComputationalPower], in which each reaction has exactly two reactants and two products, e.g., $A+B\to C+D$ . This model assumes idealized conditions where reactions can proceed indefinitely, constrained only by the availability of reactants in the well-mixed solution.

Precisely the semilinear predicates $\phi:\mathbb{N}^{d}\to\{0,1\}$ and functions $f:\mathbb{N}^{d}\to\mathbb{N}$ can be computed stably, roughly meaning that the output is correct no matter the order in which reactions happen. In population protocols or other CRNs with a finite reachable configuration space, this means that the output is correct with probability 1 under a stochastic scheduler that picks the next molecules to react at random. However, existing constructions to compute semilinear predicates and functions use CRNs with unbounded executions, meaning that it is possible to execute infinitely many reactions from the initial configuration. CRNs with bounded executions have several advantages. With an absolute guarantee on how many reactions will happen before the CRN terminates, wet-lab implementations need only supply a bounded amount of fuel to power the reactions. Such CRNs are simpler to reason about: each reaction brings it “closer” to the answer. They also lead to a simpler definition of stable computation than is typically employed: an execution bounded CRN stably computes a predicate/function if it gets the correct answer after sufficiently many reactions.

To study this topic, we limit the classical, discrete CRN model to networks that must eventually reach a configuration where no further reactions can occur, regardless of the sequence of reactions executed. By guaranteeing a finite endpoint for CRN computations and later integrating the concept of decreasing potential, we aim to align our models more closely with their implementations in the physical world.

This restriction is nontrivial because the techniques in [Chen2012DeterministicFunction] and [Doty2013Leaderless] rely on reversible reactions catalyzed by species we expect to be depleted once a computational step has terminated. This trick seems to add computational power to our system by undoing certain reactions as long as a specific species is present. Consider the following CRN computing $f(x_{1},x_{2},x_{3})=\min\left(x_{1}-x_{2},x_{3}\right)$ . The input values $x_{i}$ are given as counts of copies of $X_{i}$ , and the count of $Z$ molecules in the stable output:

$\displaystyle X_{1}$	$\displaystyle\rightarrow Y$	(1)
$\displaystyle X_{2}+Y$	$\displaystyle\rightarrow\varnothing$	(2)
$\displaystyle Y+X_{3}$	$\displaystyle\rightarrow Z$	(3)
$\displaystyle Z+X_{2}$	$\displaystyle\rightarrow X_{2}+X_{3}+Y$	(4)

Reactions (1) and (2) compute $x_{1}-x_{2}$ , storing the result in the count of $Y$ . Next, reaction (3) can be applied exactly $\min(y,x_{3})$ times. But since the order of reactions is a stochastic process, we might consume copies of $Y$ in (3), before all of $x_{2}$ is subtracted from it. Therefore, we add reaction (4), using $X_{2}$ as a catalyst to undo reaction (3) as long as copies of $X_{2}$ are present, indicating that the first step of computation has not terminated. A similar technique is used in [Chen2012DeterministicFunction], where semilinear sets are understood as a finite union of linear sets, shown to be computable in parallel by CRNs. A reversible, catalyzed reaction finally converts the output of one of the CRNs to the global output. Among other questions, we explore how the constructions of [Chen2012DeterministicFunction] and [Doty2013Leaderless] can be modified to provide equal computational power while guaranteeing bounded execution.

Section 3 defines execution boundedness (Definition 3.1). Furthermore, we introduce alternative characterizations of the class for use in later proofs, such as the lack of self-covering execution paths. Section 4 and 5 contain the main positive results of the paper and provide the concrete constructions used to decide semilinear sets and functions using execution bounded CRNs whose initial configurations contain a single leader. Section 6 discusses the limitations of execution bounded CRNs, introducing the concept of a “linear potential function” as a core characterization of these systems. We demonstrate that entirely execution bounded CRNs that are leaderless and non-collapsing (such as all population protocols), can only stably decide trivial semilinear predicates: the eventually constant predicates (Definition 6.11).

2 Preliminaries

We use established notation from [Chen2012DeterministicFunction, Doty2013Leaderless] and stable computation definitions from [Angluin2006ComputationalPower] for (discrete) chemical reaction networks.

2.1 Notation

Let $\mathbb{N}$ denote the nonnegative integers. For any finite set $\Lambda$ , we write $\mathbb{N}^{\Lambda}$ to mean the set of functions $f:\Lambda\rightarrow\mathbb{N}$ . Equivalently, $\mathbb{N}^{\Lambda}$ can be interpreted as the set of vectors indexed by the elements of $\Lambda$ , and so $\mathbf{c}\in\mathbb{N}^{\Lambda}$ specifies nonnegative integer counts for all elements of $\Lambda$ . $\mathbf{c}(i)$ denotes the $i$ -th coordinate of $\mathbf{c}$ , and if $\mathbf{c}$ is indexed by elements of $\Lambda$ , then $\mathbf{c}(Y)$ denotes the count of species $Y\in\Lambda$ .

For two vectors $\mathbf{x},\mathbf{y}\in\mathbb{R}^{k}$ , we write $\mathbf{x}\geqq\mathbf{y}$ to denote that $\mathbf{x}(i)\geq\mathbf{y}(i)$ for all $1\leq i\leq k$ , $\mathbf{x}\geq\mathbf{y}$ to denote that $\mathbf{x}\geqq\mathbf{y}$ but $\mathbf{x}\neq\mathbf{y}$ , and $\mathbf{x}>\mathbf{y}$ to denote that $\mathbf{x}(i)>\mathbf{y}(i)$ for all $1\leq i\leq k$ . In the case that $\mathbf{y}=\mathbf{0}$ , we say that $\mathbf{x}$ is nonnegative, semipositive, and positive, respectively. Similarly define $\leqq,\leq,<$ .

For a matrix or vector $\mathbf{x}$ , define $\|\mathbf{x}\|=\|\mathbf{x}\|_{1}=\sum_{i}|\mathbf{x}(i)|$ , $i$ ranges over all the entries of $\mathbf{x}$ .

2.2 Chemical Reaction Networks

A chemical reaction network (CRN) is a pair $\mathcal{C}=(\Lambda,R)$ , where $\Lambda$ is a finite set of chemical species, and $R$ is a finite set of reactions over $\Lambda$ , where each reaction is a pair $(\mathbf{r},\mathbf{p})\in\mathbb{N}^{\Lambda}\times\mathbb{N}^{\Lambda}$ indicating the reactants $\mathbf{r}$ and products $\mathbf{p}$ . A population protocol [angluin2004computation] is a CRN in which all reactions $(\mathbf{r},\mathbf{p})$ obey $\|\mathbf{r}\|=\|\mathbf{p}\|=2$ . We write reactions such as $A+2B\mathop{\rightarrow}\limits A+3C$ to represent the reaction $(\{A,2B\},\{A,3C\})$ . A configuration $\mathbf{c}\in\mathbb{N}^{\Lambda}$ of a CRN assigns integer counts to every species $S\in\Lambda$ . When convenient, we use the notation $\left\{n_{1}S_{1},n_{2}S_{2},\ldots,n_{k}S_{k}\right\}$ to describe a configuration $\mathbf{c}$ with $n_{i}\in\mathbb{N}$ copies of species $S_{i}$ , i.e., $\mathbf{c}(S_{i})=n_{i}$ , and any species that is not listed is assumed to have a zero count. If some configuration $\mathbf{c}$ is understood from context, for a species $S$ , we write $\#S$ to denote $\mathbf{c}(S).$ A reaction $(\mathbf{r},\mathbf{p})$ is said to be applicable in configuration $\mathbf{c}$ if $\mathbf{r}\leqq\mathbf{c}$ . If the reaction $(\mathbf{r},\mathbf{p})$ is applicable, applying it results in configuration $\mathbf{c}^{\prime}=\mathbf{c}-\mathbf{r}+\mathbf{p}$ , and we write $\mathbf{c}\rightarrow\mathbf{c}^{\prime}$ .

An execution $\mathcal{E}$ is a finite or infinite sequence of one or more configurations $\mathcal{E}=(\mathbf{c}_{0},\mathbf{c}_{1},\mathbf{c}_{2},\ldots)$ such that, for all $i\in\{1,\ldots,|\mathcal{E}|-1\},\mathbf{c}_{i-1}\rightarrow\mathbf{c}_{i}$ and $\mathbf{c}_{i-1}\neq\mathbf{c}_{i}$ . $\mathbf{x}\Rightarrow_{P}\mathbf{y}$ denotes that $P$ is finite, starts at $\mathbf{x}$ , and ends at $\mathbf{y}$ . In this case we say $\mathbf{y}$ is reachable from $\mathbf{x}$ . Let $\mathsf{reach}(\mathbf{x})=\{\mathbf{y}\mid\mathbf{x}\Rightarrow\mathbf{y}\}$ . Note that the reachability relation is additive: if $\mathbf{x}\Rightarrow\mathbf{y}$ , then for all $\mathbf{c}\in\mathbb{N}^{\Lambda}$ , $\mathbf{x}+\mathbf{c}\Rightarrow\mathbf{y}+\mathbf{c}$ .

For a CRN $\mathcal{C}=(\Lambda,R)$ where $|\Lambda|=n$ and $|R|=m$ , define the $n\times m$ stoichiometric matrix $\mathbf{M}$ of $\mathcal{C}$ as follows. The species are ordered $S_{1},\dots,S_{n}$ , and the reactions are ordered $(\mathbf{r}_{1},\mathbf{p}_{1}),\dots,(\mathbf{r}_{m},\mathbf{p}_{m})$ , and $\mathbf{M}_{ij}=\mathbf{p}_{j}(S_{i})-\mathbf{r}_{j}(S_{i})$ . In other words, $\mathbf{M}_{ij}$ is the net amount of $S_{i}$ produced when executing the $j$ ’th reaction. For instance, if the CRN has two reactions $S_{1}\mathop{\rightarrow}\limits S_{2}+2S_{3}$ and $3S_{2}+S_{3}\mathop{\rightarrow}\limits S_{1}+S_{2}+S_{3}$ , then \optfull

\mathbf{M}=\begin{pmatrix}-1&1\\ 1&-2\\ 2&0\end{pmatrix}.

\opt

submission,final $\mathbf{M}=\begin{pmatrix}-1&1\\ 1&-2\\ 2&0\end{pmatrix}.$

Remark 2.1.

Let $\mathbf{u}\in\mathbb{N}^{R}$ . Then the vector $\mathbf{M}\mathbf{u}\in\mathbb{Z}^{\Lambda}$ represents the change in species counts that results from applying reactions by amounts described in $\mathbf{u}$ . In the above example, if $\mathbf{u}=(2,1)$ , then $\mathbf{M}\mathbf{u}=(-1,0,4)$ , meaning that executing the first reaction twice ( $\mathbf{u}_{1}=2$ ) and the second reaction once ( $\mathbf{u}_{2}=1$ ) causes $S_{1}$ to decrease by 1, $S_{2}$ to stay the same, and $S_{3}$ to increase by 4.

2.3 Stable computation with CRNs

To capture the result of computations done by a CRN, we generalize the definitions to include information about how to interpret the final configuration after letting the CRN run until the result cannot change anymore (characterized below as stable computation). Computation primarily involves two classes of functions: 1. evaluating predicates to determine properties of the input (akin to deciding a set defined by these properties), and 2. executing general functions that map an input configuration to an output, denoted as $f:\mathbb{N}^{k}\rightarrow\mathbb{N}$ .

A chemical reaction decider (CRD) is a tuple $\mathcal{D}=(\Lambda,R,\Sigma,\Upsilon_{\mathrm{Y}},\Upsilon_{\mathrm{N}},% \mathbf{s})$ , where $(\Lambda,R)$ is a CRN, $\Sigma\subseteq\Lambda$ is the set of input species, $\Upsilon_{\mathrm{Y}}\subseteq\Lambda$ is the set of yes voters, and $\Upsilon_{\mathrm{N}}\subseteq\Lambda$ is the set of no voters. If $\Upsilon_{\mathrm{Y}}\cup\Upsilon_{\mathrm{N}}=\Lambda$ , we say the CRD is all-voting. We define a global output partial function $\Phi:\mathbb{N}^{\Lambda}\longrightarrow\{0,1\}$ as follows. $\Phi(\mathbf{c})$ is undefined if either $\mathbf{c}=\mathbf{0}$ , or if there exist $S_{\mathrm{N}}\in\Upsilon_{\mathrm{N}}$ and $S_{\mathrm{Y}}\in\Upsilon_{\mathrm{Y}}$ such that $\mathbf{c}\left(S_{\mathrm{N}}\right)>0$ and $\mathbf{c}\left(S_{\mathrm{Y}}\right)>0$ . In other words, we require a unanimous vote as our output. We say $\mathbf{c}$ is stable if, for all $\mathbf{c}^{\prime}$ such that $\mathbf{c}\Rightarrow\mathbf{c}^{\prime}$ , $\Phi(\mathbf{c})=\Phi(\mathbf{c}^{\prime}).$ We say a CRD $\mathcal{D}$ stably decides the predicate $\psi:\mathbb{N}^{\Sigma}\rightarrow\{0,1\}$ if, for any valid initial configuration $\mathbf{i}\in\mathbb{N}^{\Lambda}$ with $\mathbf{i}\upharpoonright\Sigma=\mathbf{i}_{0}$ , for all configurations $\mathbf{c}\in\mathbb{N}^{\Lambda},\mathbf{i}\Rightarrow\mathbf{c}$ implies $\mathbf{c}\Rightarrow\mathbf{c}^{\prime}$ such that $\mathbf{c}^{\prime}$ is stable and $\Phi\left(\mathbf{c}^{\prime}\right)=\psi\left(\mathbf{i}_{0}\right)$ . We associate to a predicate $\psi$ the set $A=\psi^{-1}(1)$ of inputs on which $\psi$ outputs 1, so we can equivalently say the CRD stably decides the set $A.$

A chemical reaction computer $(CRC)$ is a tuple $\mathcal{C}=(\Lambda,R,\Sigma,Y,\mathbf{s})$ , where $(\Lambda,R)$ is a CRN, $\Sigma\subset\Lambda$ is the set of input species, $Y\in\Lambda\backslash\Sigma$ is the output species, and $\mathbf{s}\in\mathbb{N}^{\Lambda\backslash\Sigma}$ is the initial context. A configuration $\mathbf{o}\in\mathbb{N}^{\Lambda}$ is stable if, for every $\mathbf{c}$ such that $\mathbf{o}\Rightarrow\mathbf{c},\mathbf{o}(Y)=\mathbf{c}(Y)$ , i.e. the output can never change again. We say that $\mathcal{C}$ stably computes a function $f:\mathbb{N}^{k}\rightarrow\mathbb{N}$ if for any valid initial configuration $\mathbf{i}\in\mathbb{N}^{\Sigma}$ and any $\mathbf{c}\in\mathbb{N}^{\Lambda},\mathbf{i}\Rightarrow\mathbf{c}$ implies $\mathbf{c}\Rightarrow\mathbf{o}$ such that $\mathbf{o}$ is stable and $f(\mathbf{i}\upharpoonright\Sigma)=\mathbf{o}(Y)$ where $\mathbf{i}\upharpoonright\Sigma$ denotes restriction of $\mathbf{i}$ to $\Sigma$ .

For a CRD or CRC with initial context $\mathbf{s}$ and input species $\Sigma$ , we say a $\mathbf{i}$ is a valid initial configuration if $\mathbf{i}=\mathbf{s}+\mathbf{x}$ , where $\mathbf{x}(S)=0$ for all $S\in\Lambda\setminus\Sigma$ ; i.e., $\mathbf{i}$ is the initial context plus only input species.table configuration.

2.4 Time model

The following model of stochastic chemical kinetics is widely used in quantitative biology and other fields dealing with chemical reactions between species present in small counts [Gillespie77]. It ascribes probabilities to execution sequences, and also defines the time of reactions, allowing us to study the computational complexity of the CRN computation in Sections 4 and 5. If the volume is defined to be $n$ , the total number of molecules, then the time model is essentially equivalent to the notion of parallel time studied in population protocols [AngluinAE2008Fast]. In this paper, the rate constants of all reactions are $1$ , and we define the kinetic model with this assumption. A reaction is unimolecular if it has one reactant and bimolecular if it has two reactants. We use no higher-order reactions in this paper.

The kinetics of a CRN is described by a continuous-time Markov process as follows. Given a fixed volume $v>0$ , the propensity of a unimolecular reaction $\alpha:X\to\ldots$ in configuration $\mathbf{c}$ is $\rho(\mathbf{c},\alpha)=\mathbf{c}(X)$ . The propensity of a bimolecular reaction $\alpha:X+Y\to\ldots$ , where $X\neq Y$ , is $\rho(\mathbf{c},\alpha)=\frac{\mathbf{c}(X)\mathbf{c}(Y)}{v}$ . The propensity of a bimolecular reaction $\alpha:X+X\to\ldots$ is $\rho(\mathbf{c},\alpha)=\frac{1}{2}\frac{\mathbf{c}(X)(\mathbf{c}(X)-1)}{v}$ . The propensity function determines the evolution of the system as follows. The time until the next reaction occurs is an exponential random variable with rate $\rho(\mathbf{c})=\sum_{\alpha\in R}\rho(\mathbf{c},\alpha)$ (note that $\rho(\mathbf{c})=0$ if no reactions are applicable to $\mathbf{c}$ ). The probability that next reaction will be a particular $\alpha_{\text{next}}$ is $\frac{\rho(\mathbf{c},\alpha_{\text{next}})}{\rho(\mathbf{c})}$ .

The kinetic model is based on the physical assumption of well-mixedness that is valid in a dilute solution. Thus, we assume the finite density constraint, which stipulates that a volume required to execute a CRN must be proportional to the maximum molecular count obtained during execution [SolCooWinBru08]. In other words, the total concentration (molecular count per volume) is bounded. This realistically constrains the speed of the computation achievable by CRNs.

For a CRD or CRC stably computing a predicate/function, the stabilization time is the function $t:\mathbb{N}\to\mathbb{N}$ defined for all $n\in\mathbb{N}$ as $t(n)=$ the worst-case expected time to reach from any valid initial configuration of size $n$ to a stable configuration.

2.5 Semilinear sets, predicates, functions

Definition 2.2.

A set $L\subseteq\mathbb{N}^{d}$ is linear if there are vectors $\mathbf{b},\mathbf{p}_{1},\dots,\mathbf{p}_{k}$ such that $L=\{\mathbf{b}+n_{1}\mathbf{p}_{1}+\dots+n_{k}\mathbf{p}_{k}\mid n_{1},\dots,n% _{k}\in\mathbb{N}\}$ . A set is semilinear if it is a finite union of linear sets. A predicate $\phi:\mathbb{N}^{d}\to\{0,1\}$ is semilinear if the set $\phi^{-1}(1)$ is semilinear. A function $f:\mathbb{N}^{d}\to\mathbb{N}$ is semilinear if its graph $\{(\mathbf{x},y)\in\mathbb{N}^{d+1}\mid f(\mathbf{x})=y\}$ is semilinear.

The following is a famous characterization of the computational power of CRNs [Angluin2006ComputationalPower, chen2023rate].

Theorem 2.3 ([Angluin2006ComputationalPower, chen2023rate]).

A predicate/function is stably computable by a CRD/CRC if and only if it is semilinear.

Definition 2.4.

$T\subseteq\mathbb{N}^{d}$ is a threshold set is if there are constants $c,w_{1},\dots,w_{d}\in\mathbb{Z}$ such that $T=\{\mathbf{x}\in\mathbb{N}^{d}\mid w_{1}\mathbf{x}(1)+\dots+w_{d}\mathbf{x}(d% )\leq c\}.$ $M\subseteq\mathbb{N}^{d}$ is a mod set if there are constants $c,m,w_{1},\dots,w_{d}\in\mathbb{N}$ such that $M=\{\mathbf{x}\in\mathbb{N}^{d}\mid w_{1}\mathbf{x}(1)+\dots+w_{d}\mathbf{x}(d% )\equiv c\mod m\}.$

The following well-known characterization of semilinear sets is useful.

Theorem 2.5 ([Ginsburg1966Semigroups]).

A set is semilinear if and only if it is a Boolean combination (union, intersection, complement) of threshold and mod sets.

3 Execution bounded chemical reaction networks

In this section, we define execution bounded CRNs and state a few alternate characterizations of the definition. \optsubmissionProofs are in the appendix. \optfinalconfProofs are in the full version of this paper.

Definition 3.1.

A CRN $\mathcal{C}$ is execution bounded from configuration $\mathbf{x}$ if all executions $\mathcal{E}=(\mathbf{x},\ldots)$ starting at $\mathbf{x}$ are finite. A CRD or CRC $\mathcal{C}$ is execution bounded if it is execution bounded from every valid initial configuration. $\mathcal{C}$ is entirely execution bounded if it is execution bounded from every configuration.

This is a distinct concept from the notion of “bounded” CRNs studied by Rackoff [Rackoff1978CoveringBoundedness] (studied under the equivlaent formalism of vector addition systems). That paper defines a CRN to be bounded from a configuration $\mathbf{x}$ if $|\mathsf{reach}(\mathbf{x})|$ is finite (and shows that the decision problem of determining whether this is true is $\mathsf{EXPSPACE}$ -complete.) We use the term execution bounded to avoid confusion with this concept.

{toappendix}

We first observe that being execution bounded from $\mathbf{x}$ implies a slightly stronger condition: there is a uniform upper bound on the length of all executions from $\mathbf{x}$ .¹¹1 In other words, this rules out the possibility that, although all executions from $\mathbf{x}$ are finite, there are infinitely many of them $\mathcal{E}_{1},\mathcal{E}_{2},\dots$ , each longer than the previous.

{observation}

A CRN is execution bounded from $\mathbf{x}$ if and only if there is a constant $N\in\mathbb{N}$ such that all executions from $\mathbf{x}$ have length at most $N$ . Equivalently, there are finitely many executions from $\mathbf{x}$ .

Proof 3.2.

We use Kőnig’s lemma to show that in the absence of an infinite path, the number of all possible paths must be finite, which directly implies a global bound on the length of all executions. We represent the set of all executions for $\mathcal{C}$ as a tree where each edge represents a single reaction applied and each node stores the complete execution sequence starting from configuration $\mathbf{x}$ . Note that this construction is slightly different from a more straightforward graph with the reachable states as nodes, which would not give us a tree, since the same state can be reached by different executions. Formally, we generate the tree as follows: $T_{\mathcal{C}}(\mathbf{x})=(V,E)$ where $V\triangleq\{\mathcal{E}\in\{\mathbb{N}^{\Lambda}\}^{*}\mid\mathcal{E}\text{ % is a valid execution sequence starting from }\mathbf{x}\}$ , $E\triangleq\{(\mathcal{E}_{1},\mathcal{E}_{2})\mid\mathcal{E}_{1}\preceq% \mathcal{E}_{2}\land|\mathcal{E}_{2}|=|\mathcal{E}_{1}|+1$ }. In other words, all the executions from $\mathbf{x}$ of length $d$ are the nodes at depth $d$ of this tree. One can think of the nodes as being labeled by configurations rather than executions (specifically the final configuration of the execution, with the tree rooted at $\mathbf{x}$ ), but the same configuration can label multiple nodes if it can be reached from $\mathbf{x}$ via different executions. In this case the children of a configuration are those that are reachable from it by applying a single reaction.

This tree is finitely branching, as we can only choose from a finite number of reactions at any node. By definition of execution bounded, there is no execution sequence with an infinite length. Due to the bijection between paths in $T_{\mathcal{C}}(\mathbf{x})$ and executions possible in $\mathcal{C}$ , there is no infinite path in the tree. By Kőnig’s Lemma, the tree has a finite number of nodes, guaranteeing a single bound $N$ (the depth of the tree) on the length of every execution.

The next lemma characterizes execution boundedness as equivalent to having a finite reachable state space with no cycles.

Lemma 3.3.

A CRN is execution bounded from $\mathbf{x}$ if and only if $\mathsf{reach}(x)=\{\mathbf{y}\mid\mathbf{x}\Rightarrow\mathbf{y}\}$ is finite and, for all $\mathbf{y}\in\mathsf{reach}(x)$ , $\mathbf{y}\not\Rightarrow\mathbf{y}$ except by the zero-length execution.

Proof 3.4.

Every configuration reachable from $\mathbf{x}$ is reached through some execution contained in $T_{\mathcal{C}}(\mathbf{x})$ as a node and there exists only a finite number of them (Section 3). Multiple unique executions can produce the same configuration but one execution cannot produce multiple configurations. Thus, there exists a surjection from the nodes of $T_{\mathcal{C}}(\mathbf{x})$ into $\mathsf{reach}(x)$ and $\mathsf{reach}(x)$ must also be finite. For the second part of the condition, we prove its contrapositive and assume there exists $\mathbf{y}\in\mathsf{reach}(x)$ , $\mathbf{y}\Rightarrow_{P}\mathbf{y}\land|P|>0$ . Let $P=(\mathbf{p}_{1},\mathbf{p}_{2},\dots,\mathbf{p}_{n})$ . It holds that $\mathbf{p}_{1}=\mathbf{p}_{n}$ and $\mathbf{p}_{n-1}\Rightarrow\mathbf{p}_{1}$ . We can construct an infinite-length execution $P^{\prime}=(\mathbf{x},\dots,\mathbf{p}_{1},\dots,\mathbf{p}_{n-1},\mathbf{p}_% {1},\dots)$ , which must also be a valid under the reactions of $\mathcal{C}$ , making $\mathcal{C}$ execution unbounded from $\mathbf{x}$ .

If $\mathsf{reach}(x)$ is finite and contains no such $\mathbf{y}$ , then we can construct a finite, directed, acyclic graph $G_{\mathcal{C}}(\mathbf{x})=(V,E)$ where $V=\mathsf{reach}(x)$ , $E=\{(\mathbf{x},\mathbf{y})\mid\mathbf{x}\Rightarrow_{P}\mathbf{y}\land|P|>0\}$ . The longest path in the graph has length of at most $|\mathsf{reach}(x)|-1$ . A bijection exists between paths in $G_{\mathcal{C}}(\mathbf{x})$ and executions possible in $\mathcal{C}$ starting from $\mathbf{x}$ . We set $n=|\mathsf{reach}(x)|$ satisfying that each execution has length of at most $n$ , making $\mathcal{C}$ execution bounded.

The following result is used frequently in impossibility proofs for CRNs and population protocols, and it will help us prove another characterization of execution bounded CRNs in Section 3.

Lemma 3.5.

(Dickson’s Lemma) For every infinite sequence of nonnegative integer vectors $\mathbf{x}_{1},\mathbf{x}_{2},\dots\in\mathbb{N}^{k}$ , there are $i<j$ such that $\mathbf{x}_{i}\leqq\mathbf{x}_{j}$ .

We first observe an equivalent characterization of execution bounded that will be useful in the negative results of Section 6. \optsubmissionA proof is in the appendix.

Definition 3.6.

A execution $\mathcal{E}=(\mathbf{x}_{1},\mathbf{x}_{2},\dots)$ is self-covering if for some $i<j$ , $\mathbf{x}_{i}\leqq\mathbf{x}_{j}$ . It is strictly self-covering if $\mathbf{x}_{i}\leq\mathbf{x}_{j}$ . We also refer to these as (strict) self-covering paths.²²2 Rackoff [Rackoff1978CoveringBoundedness] uses the term “self-covering” to mean what we call strictly self-covering here, and points out that Karp and Miller [karp1969parallel] showed that $|\mathsf{reach}(\mathbf{x})|$ is infinite if and only if there is a strictly self-covering path from $\mathbf{x}$ . The distinction between these concepts is illustrated by the CRN $A\mathop{\rightleftharpoons}\limits B$ . From any configuration $\mathbf{x}$ , $\mathsf{reach}(\mathbf{x})$ is finite ( $|\mathsf{reach}(\mathbf{x})|=\mathbf{x}(A)+\mathbf{x}(B)+1$ ), and there is no strict self-covering path. However, from (say) $\{A\}$ , there is a (nonstrict) self-covering path $\{A\}\Rightarrow\{B\}\Rightarrow\{A\}$ , and by repeating, this CRN has an infinite cycling execution within its finite configuration space $\mathsf{reach}(\{A\})=\{\{A\},\{B\}\}$ .

{lemmarep}

A CRN is execution bounded from $\mathbf{x}$ if and only if there is no self-covering path from $\mathbf{x}$ .

Proof 3.7.

For the forward direction, assume there is a self-covering path from $\mathbf{x}$ , which reaches to $\mathbf{x}_{i}$ and later to $\mathbf{x}_{j}\geqq\mathbf{x}_{i}$ . Then the reactions going from $\mathbf{x}_{i}$ to $\mathbf{x}_{j}$ can be repeated indefinitely (in a cycle if $\mathbf{x}_{i}=\mathbf{x}_{j}$ , and increasing some molecular counts unboundedly if $\mathbf{x}_{i}\leq\mathbf{x}_{j}$ ), so $\mathcal{C}$ is not execution bounded from $\mathbf{x}$ .

For the reverse direction, assume $\mathcal{C}$ is not execution bounded from $\mathbf{x}$ . Then there is an infinite execution $\mathcal{E}=(\mathbf{x}=\mathbf{x}_{1},\mathbf{x}_{2},\mathbf{x}_{3},\dots).$ By Dickson’s Lemma there are $i<j$ such that $\mathbf{x}_{i}\leqq\mathbf{x}_{j}$ , i.e., $\mathcal{E}$ is self-covering.

4 Execution bounded CRDs stably decide all semilinear sets

In this section, we will show the computational equivalence between execution bounded and execution unbounded CRNs by a construction. The following is the main result of this section.

Theorem 4.1.

Exactly the semilinear sets are stably decidable by execution bounded CRDs. Furthermore, each can be stably decided with expected stabilization time $\Theta(n\log n)$ .

Since semilinear sets are Boolean combinations of mod and threshold predicates, we prove this theorem by showing that execution bounded CRDs can decide mod and threshold sets individually as well as any Boolean combination in the following lemmas. To ensure execution boundedness in the last step, we require the following property.

Definition 4.2.

Let $\mathcal{D}$ be a CRD with voting species $\Upsilon$ . We say $\mathcal{D}$ is single-voting if for any valid initial configuration $\mathbf{i}\in\mathbb{N}^{\Sigma}$ and any $\mathbf{c}\in\mathbb{N}^{\Lambda}$ s.t. $\mathbf{i}\Rightarrow\mathbf{c}$ , $\sum_{V\in\Upsilon}\mathbf{c}(V)=1$ , i.e., exactly one voter is present in every reachable configuration.

{lemmarep}

Every mod set $M=\big{\{}\left(x_{1},\ldots,x_{d}\right)\mid\sum_{i=1}^{d}w_{i}x_{i}\equiv c% \bmod m\big{\}}$ is stably decidable by an execution bounded, single-voting CRD with stabilization time $\Theta(n\log n)$ .

We design a CRD $\mathcal{D}$ with exactly one leader present at all times, cycling through $m$ “states” while consuming the input and accepting on state $c$ . Let $\Sigma=\{X_{1},\dots,X_{d}\}$ be the set of input species and start with only one $L_{0}$ leader, i.e. set the initial context $\mathbf{s}(L_{0})=1$ and $\mathbf{s}(S)=0$ for all other species. For each $i\in\{1,\dots,d\},j\in\{0,\dots,m-1\}$ add the following reaction: $X_{i}+L_{j}\rightarrow L_{j+w_{i}\bmod m}.$ Let only $L_{c}$ vote yes and all other species no, i.e. $\Upsilon=\{L_{c}\}$ . For any valid initial configuration, $\mathcal{D}$ reaches a stable configuration which votes yes if and only if the input is in the mod set, and no otherwise. \optsubmissionThe time and execution boundedness are proven in the appendix.

Proof 4.3.

$\mathcal{D}$ terminates with the correct output value: At any point in time, there is a single leader $L_{j}$ present (the initial configuration contains a single leader and each reaction produces and consumes one). Every reaction satisfies the following invariant (for the leader’s subscript $j$ ): $j\equiv\sum_{i=1}^{d}w_{i}x_{i}^{\prime}\bmod m$ where $x_{i}^{\prime}$ is the updated count of species $X_{i}$ in the current configuration. By design of $\mathcal{D}$ , there will be a reaction applicable as long as there are copies of $X_{i}$ (a leader with any subscript can react with any $X_{i}$ ). After applying this reaction as often as possible, we have reached a stable configuration with $L_{\sum_{i=1}^{d}w_{i}x_{i}\bmod m}$ as the only species present.

$\mathcal{D}$ is execution bounded: Every reaction reduces the count of chemicals by one. Every possible execution contains exactly $\|\mathbf{i}\|$ configurations, where $\|\mathbf{i}\|$ is the number of all molecules in the starting configuration.

$\mathcal{D}$ is single-voting: Initially, $L_{0}$ is present and the only voter. Every valid input contains no voter and every reaction results in no change to the count of copies of $L_{i}$ .

$\mathcal{D}$ stabilizes in $\Theta(n\log n)$ time: We start with $\#L=1,\#X=n$ in volume $n$ . $n$ reactions must occur before $\mathcal{D}$ terminates. For the first reaction, we have a rate of $\lambda=\frac{n\cdot 1}{n}$ , for the last (with only the leader and one $X$ present), our rate will be $\lambda=\frac{1\cdot 1}{n}$ . Thus, the expected time for all $n$ reactions to complete is

\sum_{i=1}^{n}\frac{n}{i}=n\sum_{i=1}^{n}\frac{1}{i}=\Theta(n\log n).

{lemmarep}

Every threshold set $T=\big{\{}(x_{1},\ldots,x_{d})\mid\sum_{i=1}^{d}w_{i}x_{i}\geq t\big{\}}$ is stably decidable by an execution bounded, single-voting CRD with stabilization time $\Theta(n\log n)$ .

We design a CRD $\mathcal{D}$ which multiplies the input molecules according to their weight and consumes positive and negative units alternatingly using a single leader. Once no more reaction is applicable, the leader’s state will indicate whether or not there are positive units left and the threshold is met. Let $\Sigma=\{X_{1},\dots,X_{d}\}$ be the set of input species and $\Upsilon=\{L_{Y}\}$ the yes voter. We first add reactions to multiply the input species by their respective weights. For all $i\in\{1,\dots,d\}$ , add the reaction:

\displaystyle X_{i}\rightarrow\begin{cases}w_{i}P&\text{if }w_{i}>0\\ -w_{i}N&\text{if }w_{i}<0\\ \emptyset&\text{otherwise}\end{cases}

(5)

$P$ and $N$ represent “positive” and “negative” units respectively. Now add reactions to consume $P$ and $N$ alternatingly using a leader until we run out of one species:

	$\displaystyle L_{Y}+N$	$\displaystyle\rightarrow L_{N}$		(6)
	$\displaystyle L_{N}+P$	$\displaystyle\rightarrow L_{Y}$		(7)

Finally, initialize the CRD with one $L_{Y}$ and the threshold number $t$ copies of $P$ (or $-tN$ if $t$ is negative), i.e. $\mathbf{s}(L_{Y})=1$ , $\mathbf{s}(P)=t$ if $t>0$ , or $\mathbf{s}(N)=-t$ if $t<0$ , and $\mathbf{s}(S)=0$ for all other species. For any valid initial configuration, $\mathcal{D}$ reaches a stable configuration which votes yes if and only if the weighted sum of inputs is above the threshold, and no otherwise. \optsubmissionThe execution time is proven in the appendix.

Proof 4.4.

$\mathcal{D}$ is single-voting since it starts with a single leader and no reaction changes the count of $L_{B}$ molecules.

$\mathcal{D}$ stabilizes in $\Theta(n\log n)$ time: First, all input species will be converted to $w_{i}$ instances of $P$ or $N$ . We run these reactions until no $X_{i}$ . As they are independent of molecules other than the reactant, these reactions have a rate of $\lambda=i$ , so the expected time until the next reaction is $\frac{1}{i}$ . The total time for reactions (5) to complete is therefore $\sum_{i=1}^{n}\frac{1}{i}=\mathit{\Theta}(\log n)$ . The time for reactions (6) and (7) on the other hand is asymptotically dominated by the last reaction, where $\#L=1$ and $\#B=1$ , where $B\in\{P,N\}$ , so $\lambda=\frac{1\cdot 1}{n}$ . Let $n_{P},n_{N}$ be the counts of $P,N$ and assume without loss of generality $n_{P}\geq n_{N}$ . We get:

\sum_{i=0}^{n_{N}-1}\frac{n}{(n_{P}-i)}+\frac{n}{(n_{N}-i)}\leq\sum_{i=0}^{n_{% N}-1}\frac{2n}{(n_{N}-i)}=2n\sum_{i=1}^{n_{N}}\frac{1}{i}=\Theta(n\log n).

Lemma 4.5.

If sets $X_{1},X_{2}\subseteq\mathbb{N}^{d}$ are stably decided by some execution bounded, single-voting CRD, then so are $X_{1}\cup X_{2},X_{1}\cap X_{2}$ , and $\overline{X_{1}}$ with stabilization time $O(n\log n)$ .

Proof 4.6.

To stably decide $\overline{X_{1}}$ , swap the yes and no voters.

For $\cup$ and $\cap$ , consider a construction where we decide both sets separately and record both of their votes in a new voter species. For this, we allow the set of all voters to be a strict subset of all species. We first add reactions to duplicate our input with reactions of the form

X_{i}\rightarrow X_{i,1}+X_{i,2}

(8)

by two separate CRDs. Subsequently, we add reactions to record the separate votes in one of four new voter species: $V_{NN},V_{NY},V_{YN},V_{YY}$ . The first and second CRN determine the first and second subscript respectively. For $b\in\{Y,N\}$ and if $S_{b},T_{b}$ are leaders of $\mathcal{C}_{1}$ and $\mathcal{C}_{2}$ respectively, add the reactions:

	$\displaystyle S_{b}+V_{\overline{b}?}\rightarrow S_{b}+V_{b?}$		(9)
	$\displaystyle T_{b}+V_{?\overline{b}}\rightarrow T_{b}+V_{?b}$		(10)

E.g. if $N_{1}$ is the no voter of the first CRD, we would add $N_{1}+L_{YN}\rightarrow N_{1}+L_{NN}$ and $N_{1}+L_{YY}\rightarrow N_{1}+L_{NY}$ . We let the yes voters be: $\Upsilon=\{V_{NY},V_{YN},V_{YY}\}$ to stably decide $X_{1}\cup X_{2}$ or $\Upsilon=\{V_{YY}\}$ to stably decide $X_{1}\cap X_{2}$ .

Reaction (8) will complete in $O(\log n)$ time and is clearly execution bounded since the input $X_{i}$ is finite and not produced in any reaction. Consequently, two separate CRNs run in $\mathit{\Theta}(n\log n)$ time as shown in Section 4 and Section 4. After stabilization of the parallel CRNs, we expect reaction (9) and (10) to happen exactly once. Each molecule involved is a leader and has count $1$ in volume $n$ . This leads to a rate of $\lambda=\frac{1\cdot 1}{n}$ , so the expected time for one reaction to happen is $O(n)$ . It is important to note that reactions (9) and (10) do not result in unbounded executions due to the unanimous vote in parallel CRDs. In both mod sets and threshold sets, the leader changes its vote a maximum of $|\mathbf{i}|$ times, with only ever one leader present at any time. Again, we start with only one $V_{bb}$ voter present initially and no reaction changes the count of voters, making our construction single-voting.

Since semilinear predicates are exactly Boolean combinations of threshold and mod predicates, Sections 4, 4 and 4.5 imply the following.

Theorem 4.7.

Every semilinear set is stably decidable by an execution bounded, single-voting CRD, with stabilization time $O(n\log n).$

We can also prove the same result for all-voting CRDs. Note, however, that such CRDs cannot be “composed” using the constructions of LABEL:{lem:boolean-closure-crd} and 5.5, which crucially relied on the assumption that the CRDs being used as “subroutines” are single-voting. \optsubmissionA proof is in the appendix.

{thmrep}

Every semilinear set is stably decidable by an execution bounded, all-voting CRD, with stabilization time $O(n\log n).$

Proof 4.8.

By Theorem 4.7, every semilinear set is stably decided by a single-voting CRD. We convert this to an all-voting CRD, where every species is required to vote yes or no, by “propagating” the final vote (recorded in the single voter $V^{0}$ voting no or $V^{1}$ voting yes) back to all other molecules. A superscript indicates the “global” decision. The execution boundedness proven in Lemma 4.5 ensures that the leader propagates the final vote only a finite amount of times. For each vote $b\in\{0,1\}$ and each voter $V^{b}$ voting $b$ , and all other species $S\in\Lambda\backslash\{V\}$ , replace species $S$ with two versions $S^{0}$ and $S^{1}$ , and add reactions:

\displaystyle V^{b}+S^{\overline{b}}\mathop{\rightarrow}\limits V^{b}+S^{b}

(11)

The original reactions of the CRD must also be replaced with “functionally identical” reactions for the new versions of species. For example, the reaction $A+B\mathop{\rightarrow}\limits C+D$ becomes

	$\displaystyle A^{0}+B^{0}$	$\displaystyle\mathop{\rightarrow}\limits C^{0}+D^{0}$
	$\displaystyle A^{0}+B^{1}$	$\displaystyle\mathop{\rightarrow}\limits C^{0}+D^{0}$
	$\displaystyle A^{1}+B^{0}$	$\displaystyle\mathop{\rightarrow}\limits C^{0}+D^{0}$
	$\displaystyle A^{1}+B^{1}$	$\displaystyle\mathop{\rightarrow}\limits C^{1}+D^{1}$

In the middle two cases we can pick the superscripts of the products arbitrarily, whereas in the first and last case, we must choose the product votes to match those of the reactants to ensure stable states remain stable.

A vote change of the $V^{b}$ leader leads to the propagation of the vote to at most $n$ molecules once using reaction (11). This reaction dominates the runtime, as a single molecule is required to interact with each other molecule. We cannot speed this process up using an epidemic style process as conflicting votes would make the CRN execution unbounded. The original CRD takes time $O(n\log n)$ to converge on a correct output for the single voter $V^{b}$ . At that point, a standard coupon collector argument shows that the voter $V^{b}$ takes expected time $O(n\log n)$ to correct the votes of all other species via reaction (11).

5 Execution bounded CRCs stably compute all semilinear functions

In this section we shift focus from computing Boolean-valued predicates $\phi:\mathbb{N}^{d}\to\{0,1\}$ to integer-valued functions $f:\mathbb{N}^{d}\to\mathbb{N}$ , showing that execution bounded CRCs can stably compute the same class of functions (semilinear) as unrestricted CRCs.

Similar to [Chen2012DeterministicFunction, Doty2013Leaderless], we compute semilinear functions by decomposing them into “affine pieces”, which we will show can be computed by execution bounded CRNs and combined by using semilinear predicates to decide which linear function to apply for a given input.³³3 While this proof generalizes to multivariate output functions as in [Chen2012DeterministicFunction, Doty2013Leaderless], to simplify notation we focus on single output functions. Multi-valued functions $f:\mathbb{N}^{d}\to\mathbb{N}^{l}$ can be equivalently thought of as $l$ separate single output functions $f_{i}:\mathbb{N}^{d}\to\mathbb{N}$ , which can be computed in parallel by independent CRCs.

We say a partial function $f:\mathbb{N}^{k}\dashrightarrow\mathbb{N}$ is affine if there exist a vectors $\mathbf{a}\in\mathbb{Q}^{k}$ , $\mathbf{c}\in\mathbb{N}^{k}$ with $\mathbf{x}-\mathbf{c}\geq\mathbf{0}$ and nonnegative integer $b\in\mathbb{N}$ such that $f(\mathbf{x})=\mathbf{a}^{\top}(\mathbf{x}-\mathbf{c})+b.$ This definition of affine function may appear contrived, but the main utility of the definition is that it satisfies Section 5. For convenience, we can ensure to only work with integer valued molecule counts by multiplying by $\frac{1}{d}$ after the dot product, where $d$ may be taken to be the least common multiple of the denominators of the rational coefficients in the original definition such that $n_{i}=d\cdot\mathbf{a}(i)$ : \optfull

f(\mathbf{x})=b+\sum_{i=1}^{k}a_{i}(x_{i}-c_{i})\iff f(\mathbf{x})=b+\frac{1}{% d}\sum_{i=1}^{k}n_{i}(x_{i}-c_{i}).

\opt

submission,final $f(\mathbf{x})=b+\sum_{i=1}^{k}\mathbf{a}(i)(\mathbf{x}(i)-\mathbf{c}(i))\iff f% (\mathbf{x})=b+\frac{1}{d}\sum_{i=1}^{k}n_{i}(\mathbf{x}(i)-\mathbf{c}(i)).$

We say that a partial function $f:\mathbb{N}^{k}\rightarrow\mathbb{N}^{2}$ is a diff-representation of $f$ if $\operatorname{dom}f=\operatorname{dom}\hat{f}$ and, for all $\mathbf{x}\in\operatorname{dom}f$ , if $\left(y_{P},y_{C}\right)=\hat{f}(\mathbf{x})$ , then $f(\mathbf{x})=y_{P}-y_{C}$ , and $y_{P}=O(f(\mathbf{x}))$ . In other words, $\hat{f}$ represents $f$ as the difference of its two outputs $y_{P}$ and $y_{C}$ , with the larger output $y_{P}$ possibly being larger than the original function’s output, but at most a multiplicative constant larger [Doty2013Leaderless].

Lemma 5.1.

Let $f:\mathbb{N}^{k}\rightarrow\mathbb{N}$ be an affine partial function. Then there is a diff-representation $\hat{f}:\mathbb{N}^{k}\longrightarrow\mathbb{N}^{2}$ of $f$ and an execution bounded CRC that monotonically stably computes $\hat{f}$ in expected time $O(n)$ .

Proof 5.2.

Define a CRC $C$ with input species $\Sigma=\{X_{1},\ldots,X_{k}\}$ and output species $\Gamma=\{Y^{P},Y^{C}\}$ . We need to ensure that after stabilizing, $y=\#Y^{P}-\#Y^{C}$

To account for the $b$ offset, start with $b$ copies of $Y^{P}$ .

For the $c_{i}$ offset, we must reduce the number of $X_{i}$ by $c_{i}$ . Since the result will be used in the next reaction, we want to produce a new species $X_{i}^{\prime}$ and require $X_{i}^{\prime}$ to not be consumed during the computation. We achieve this by adding reactions that let $X_{i}$ consume itself $c_{i}$ times (kee** track with a subscript) and converting $X_{i}$ to $X_{i}^{\prime}$ once $c_{i}$ has been reached. For each $i\in\{1,\ldots,k\}$ and $m,p\in\left\{1,\ldots,c_{i}\right\}$ , if $m+p\leq c_{i}$ , add the reaction

X_{i,m}+X_{i,p}\rightarrow X_{i,m+p}

(12)

If $m+p>c_{i}$ , add the reaction

\displaystyle X_{i,m}+X_{i,p}\rightarrow X_{i,c_{i}}+\left(m+p-c_{i}\right)X_{% i}^{\prime}

(13)

Runtime: In volume $n$ , the rate of reactions (12) and (13) would be $\lambda\approx\frac{(x_{i})^{2}}{n}$ ( $x_{i}$ molecules have the chance to react with any of the $x_{i}-1$ others), so the expected time for the next reaction is $\frac{n}{(x_{i})^{2}}$ . The expected time for the whole process is $\sum_{i=1}^{x_{i}}\frac{n}{i^{2}}=n\sum_{i=1}^{x_{i}}\frac{1}{i^{2}}=O(n)$ . Further, the reactions are execution bounded since both strictly decrease the number of their reactants and exactly $x_{i}-1$ reactions will happen.

To account for the $n_{i}/d$ coefficient, we multiply by $n_{i}$ , then divide by $d$ using similar reactions as for the subtraction. To multiply by $n_{i}$ , add the following reaction for each $i\in\{1,\ldots,k\}$ :

\displaystyle X_{i}^{\prime}\rightarrow\begin{cases}n_{i}D_{1}^{P},&\text{ if % }n_{i}>0\\ \left(-n_{i}\right)D_{1}^{C},&\text{ if }n_{i}<0\end{cases}

(14)

For each $m,p\in\left\{1,\ldots,d-1\right\}$ , if $m+p\leq d-1$ , add the reactions

	$\displaystyle D_{m}^{P}+D_{p}^{P}\rightarrow D_{m+p}^{P}$		(15)
	$\displaystyle D_{m}^{C}+D_{p}^{C}\rightarrow D_{m+p}^{C}$		(16)

If $m+p>c_{i}$ , add the reactions

	$\displaystyle D_{m}^{P}+D_{p}^{P}\rightarrow D_{m+p-d}^{B}+Y^{P}$		(17)
	$\displaystyle D_{m}^{C}+D_{p}^{C}\rightarrow D_{m+p-d}^{B}+Y^{C}$		(18)

Reactions (14) complete in expected time $O(\log n)$ , while (17) and (18) complete in $O(n)$ by a similar analysis as for the first two reactions. As for execution boundedness, (14) is only applicable once for every $X_{i}^{\prime}$ ; all other reactions start with a number of reactants which are a constant factor of $X_{i}^{\prime}$ and decrease the count of their reactants by one in each reaction.

We require the following result due to Chen, Doty, Soloveichik [Chen2012DeterministicFunction], guaranteeing that any semilinear function can be built from affine partial functions.

Lemma 5.3 ([Chen2012DeterministicFunction]).

Let $f:\mathbb{N}^{d}\rightarrow\mathbb{N}$ be a semilinear function. Then there is a finite set $\big{\{}f_{1}:\mathbb{N}^{d}\rightarrow\mathbb{N},\ldots,f_{m}:\mathbb{N}^{d}% \rightarrow\mathbb{N}\big{\}}$ of affine partial functions, where each $\operatorname{dom}f_{i}$ is a linear set, such that, for each $\mathbf{x}\in\mathbb{N}^{d}$ , if $f_{i}(\mathbf{x})$ is defined, then $f(\mathbf{x})=f_{i}(\mathbf{x})$ , and $\bigcup_{i=1}^{m}\operatorname{dom}f_{i}=\mathbb{N}^{d}$ .

We strengthen Lemma 5.3 to show we may assume each $\mathrm{dom}\ f_{i}$ is disjoint from the others. This is needed not only to prove Theorem 5.5, but to correct the proof of Lemma 4.4 in [Chen2012DeterministicFunction], which implicitly assumed the domains are disjoint. \optsubmissionSection 5 is proven in the appendix.

{lemmarep}

Let $f:\mathbb{N}^{d}\rightarrow\mathbb{N}$ be a semilinear function. Then there is a finite set $\big{\{}f_{1}:\mathbb{N}^{d}\rightarrow\mathbb{N},\ldots,f_{m}:\mathbb{N}^{d}% \rightarrow\mathbb{N}\big{\}}$ of affine partial functions, where each $\operatorname{dom}f_{i}$ is a linear set, and $\operatorname{dom}f_{i}\cap\operatorname{dom}f_{j}=\emptyset$ for all $i\neq j$ , such that, for each $\mathbf{x}\in\mathbb{N}^{d}$ , if $f_{i}(\mathbf{x})$ is defined, then $f(\mathbf{x})=f_{i}(\mathbf{x})$ , and $\bigcup_{i=1}^{m}\operatorname{dom}f_{i}=\mathbb{N}^{d}$ .

Proof 5.4.

By [Ito1969SemilinearSetsFiniteUnionDisjointLinearSets, Theorem 2], every semilinear set is a finite union of disjoint fundamental linear sets. The author defines a linear set $L=\big{\{}\mathbf{b}+n_{1}\mathbf{u}_{1}+\ldots+n_{p}\mathbf{u}_{p}\mid n_{1},% \ldots,n_{p}\in\mathbb{N}\big{\}}$ as fundamental, if $\mathbf{u}_{1},\dots\mathbf{u}_{p}\in\mathbb{N}^{k}$ span a $p$ -dimensional vector space in $\mathbb{R}^{k}$ , i.e. all vectors are linearly independent in $\mathbb{R}^{k}$ .⁴⁴4 This distinction is significant because not all integer-valued linear sets can be represented using solely linearly independent vectors. An illustrative example is $\mathbf{b}=\mathbf{0},\mathbf{u}_{1}=(1,1,1),\mathbf{u}_{2}=(2,0,1),\mathbf{u}% _{3}=(0,2,1)$ , as discussed in [Chen2012DeterministicFunction]. The vectors $\mathbf{u}_{1},\mathbf{u}_{2},\mathbf{u}_{3}$ are not linearly independent in $\mathbb{R}^{3}$ , yet this set cannot be expressed with less than three basis vectors. The proof of Lemma 5.3 in [Chen2012DeterministicFunction] shows that each linear set $L_{i}$ comprising the semilinear graph of $f$ corresponds to one partial affine function $f_{i}$ . The fact that [Ito1969SemilinearSetsFiniteUnionDisjointLinearSets, Theorem 2] lets us assume each $L_{i}$ is disjoint from the others immediately implies that each $\mathrm{dom}\ f_{i}$ is disjoint from the others.

The next theorem shows that semilinear functions can be computed by execution bounded CRCs in expected time $\Theta(n\log n)$ .

Theorem 5.5.

Let $f:\mathbb{N}^{d}\rightarrow\mathbb{N}$ be a semilinear function. Then there is an execution bounded CRC that stably computes $f$ with stabilization time $O(n\log n)$ , in expectation and with probability at least $1-n^{-c}$ .

Proof 5.6.

We employ the same construction of [Chen2012DeterministicFunction] with minor alterations. A CRC with input species $\Sigma=\{X_{1},\ldots,X_{d}\}$ and output species $\Gamma=\{Y\}$ . By Section 5, we decompose our semilinear function into partial affine functions (with linear, disjoint domains), which can be computed in parallel by Lemma 5.1. Further, we decide which function to use by computing the predicate $\phi_{i}=$ “ $\mathrm{x}\in\operatorname{dom}f_{i}$ ” (Theorem 4.7). We interpret each $\widehat{Y}_{i}^{P}$ and $\widehat{Y}_{i}^{C}$ as an “inactive” version of “active” output species $Y_{i}^{P}$ and $Y_{i}^{C}$ . Let $L_{i}^{Y},L_{i}^{N}$ be the yes and no voters respectively voting whether $\mathbf{x}$ lies in the domain of $i$ -th partial function. Now, we convert the function result of the applicable partial affine function to the global output by adding the following reactions for each $i\in\{1,\ldots,m\}$ .

$\displaystyle L_{i}^{Y}+\widehat{Y}_{i}^{P}$	$\displaystyle\rightarrow L_{i}^{Y}+Y_{i}^{P}+Y$	(19)
$\displaystyle L_{i}^{N}+Y_{i}^{P}$	$\displaystyle\rightarrow L_{i}^{N}+M_{i}$	(20)
$\displaystyle M_{i}+Y$	$\displaystyle\rightarrow\widehat{Y}_{i}^{P}$	(21)

Reaction (19) produces an output copy of species $Y$ and (20) and (21) reverse the first reaction using only bimolecular reactions. Both are catalyzed by the vote of the $i$ -th predicate result. Also add reactions

	$\displaystyle L_{i}^{Y}+\widehat{Y}_{i}^{C}\rightarrow L_{i}^{Y}+Y_{i}^{C}$		(22)
	$\displaystyle L_{i}^{N}+Y_{i}^{C}\rightarrow L_{i}^{N}+\widehat{Y}_{i}^{C}$		(23)

and

	$\displaystyle Y_{i}^{P}+Y_{i}^{C}$	$\displaystyle\rightarrow K$		(24)
	$\displaystyle K+Y$	$\displaystyle\rightarrow\varnothing$		(25)

Reactions (22) and (23) activate and deactivate the “negative” output values and reactions (24) and (25) allow two active partial outputs to cancel out and consume the excess $Y$ in the process. When the input is in the domain of function $i$ , exactly one copy of $L_{i}^{Y}$ will be present, otherwise one copy of $L_{i}^{N}$ . Since we know that the predicate computation is execution bounded and produces at most one voter, the catalytic reaction will also happen at most as often as the leader changes its vote. Therefore, it is also execution bounded.

6 Limitations of execution bounded CRNs

The main positive results of the paper (LABEL:{thm:execution_bdd_CRDs_decide_all_semilinear_sets} and 5.5) rely on the assumption that valid initial configurations have a single leader (in particular, they are execution bounded only from configurations with a single leader, but not from arbitrary configurations). Section 4 shows that we may assume the CRD deciding a semilinear set is all-voting. However, for the “constructive” results LABEL:{lem:boolean-closure-crd} and 5.5, which compose the output of a CRD $\mathcal{D}$ with downstream computation, using $\mathcal{D}$ as a “subroutine” to stably compute a more complex set/function, the constructions crucially use the assumption that $\mathcal{D}$ is single-voting (i.e., only the leader of $\mathcal{D}$ votes) to argue the resulting composed CRN is execution bounded. In this section we show these assumptions are necessary, proving that execution bounded CRNs without those constraints are severely limited in their computational abilities.

We show that entirely execution bounded CRNs (from every configuration) can be characterized by a simpler property of having a “linear potential function” that essentially measures how close the CRN is to reaching a terminal configuration. We then use this characterization to prove that entirely execution bounded CRNs can stably decide only limited semilinear predicates (eventually constant, Definition 6.11), assuming all species vote, and that molecular counts cannot decrease to $O(1)$ in stable configurations (see Definition 6.8).

6.1 Linear potential functions

We define a linear potential function of a CRN to be a nonnegative linear function of states that each reaction strictly decreases.

Definition 6.1.

A linear potential function $\Phi:\mathbb{R}^{\Lambda}\to\mathbb{R}_{\geq 0}$ for a CRN is a nonnegative, linear function of configurations, such that for each reaction $(\mathbf{r},\mathbf{p})$ , $\Phi(\mathbf{p}-\mathbf{r})<0.$

Note that since $\Phi(\mathbf{x})=\sum_{S\in\Lambda}c_{S}\mathbf{x}(S)$ is required to be nonnegative on all configurations $\mathbf{x}$ , it must be nondecreasing in each species, i.e., all coefficients $c_{S}$ must be nonnegative (though some are permitted to be 0). Intuitively, we can think of $\Phi$ as assigning a nonnegative “mass” to each species (the mass of $S$ is $c_{S}$ ), such that each reaction removes a positive amount of mass from the system.

Remark 6.2.

A system of linear inequalities with rational coefficients has a real solution if and only if it has a rational solution. For any homogeneous system (where all inequalities are comparing to 0), any positive scalar multiple of a solution is also a solution. By clearing denominators, a system has a rational solution if and only if it has an integer solution. Thus, one can equivalently define a linear potential function to be a function $\Phi(\mathbf{x})=\sum_{S\in\Lambda}c_{S}\mathbf{x}(S)$ such that each $c_{S}\in\mathbb{N}$ , i.e., we may assume $\Phi:\mathbb{N}^{\Lambda}\to\mathbb{N}$ . In particular, since $\Phi$ is decreased by each reaction, it is decreased by at least 1.

A CRN may or may not have a linear potential function. Although it is not straightforward to “syntactically check” a CRN to see if has a linear potential function, it is efficiently decidable: a CRN has a linear potential function if and only if the following system of linear inequalities has a solution (which can be solved in polynomial time using linear programming techniques; the variables to solve for are the $c_{S}$ for each $S\in\Lambda$ ), where the $i$ ’th reaction has reactants $\mathbf{r}_{i}$ and products $\mathbf{p}_{i}$ , and species $S\in\Lambda$ has mass $c_{S}\geq 0$ : \optfull

(\forall i)\sum_{S\in\Lambda}[\mathbf{p}_{i}(S)-\mathbf{r}_{i}(S)]c_{S}<0

\opt

submission,final $(\forall i)\sum_{S\in\Lambda}[\mathbf{p}_{i}(S)-\mathbf{r}_{i}(S)]c_{S}<0.$ For example, for the reactions $A+A\mathop{\rightarrow}\limits B+C$ and $B+B\mathop{\rightarrow}\limits A$ , for each reaction to strictly decrease the potential function $\Phi(\mathbf{x})=c_{A}\mathbf{x}(A)+c_{B}\mathbf{x}(B)+c_{C}\mathbf{x}(C)$ , $\Phi$ must satisfy $2c_{A}>c_{B}+c_{C}$ and $2c_{B}>c_{A}$ . In this case, $c_{A}=1,c_{B}=1,c_{C}=0$ works.

The following is a variant of Farkas’ Lemma [farkas1902theorie], one of several similar “Theorems of the Alternative” stating that exactly one of two different linear systems has a solution. (See [mangasarian1994nonlinear, Chapter 4] for a list of such theorems.) A proof can be found in [gale1960theory, Theorem 2.10].

Theorem 6.3.

Let $\mathbf{M}$ be a real matrix. Exactly one of the following statements is true.

1.

There is a vector $\mathbf{u}\geqq\mathbf{0}$ such that $\mathbf{M}\mathbf{u}<\mathbf{0}$ .
2.

There is a vector $\mathbf{v}\geq\mathbf{0}$ such that $\mathbf{v}\mathbf{M}\geqq\mathbf{0}$ .

We require the following discrete variant of Theorem 6.3. The geometric intuition of this version is illustrated in Figure 1. \optsubmissionIt is proven in the appendix.

{corollaryrep}

Let $\mathbf{M}$ be a rational matrix. Exactly one of the following statements is true.

1.

There is an integer vector $\mathbf{u}\geq\mathbf{0}$ such that $\mathbf{M}\mathbf{u}\geqq\mathbf{0}$ .
2.

There is an integer vector $\mathbf{v}\geqq\mathbf{0}$ such that $\mathbf{v}\mathbf{M}<\mathbf{0}$ .

Proof 6.4.

For convenience when we use Section 6.1 in proving Theorem 6.5, we swapped the roles of $\mathbf{u}$ and $\mathbf{v}$ in left- vs. right-multiplication with $\mathbf{M}$ ; the real-valued version of the statement of Section 6.1 is equivalent to Theorem 6.3 by taking the transpose of $\mathbf{M}$ .

To see that we may assume the vectors are integer-valued if $\mathbf{M}$ is rational-valued, recall that a system of linear equalities/inequalities with rational coefficients has a solution if and only if it has a rational solution. Since the system is homogeneous (the matrix-vector product is compared to the zero vector $\mathbf{0}$ ), any multiple of a solution is also a solution. By clearing denominators, it has a rational solution if and only if it has an integer solution.

Refer to caption — Figure 1: Geometric intuition of Section 6.1. A matrix $\mathbf{M}$ has column vectors $\mathbf{x}_{1},\mathbf{x}_{2},\mathbf{x}_{3}$ . The *cone* of $\mathbf{M}$ is the nonnegative span of these vectors, shown as a faded gray region. Exactly one of two scenarios occurs: a) The cone of $\mathbf{M}$ intersects the first quadrant (nonnegative orthant in higher dimensions) away from the origin, i.e., some semipositive point ( $\mathbf{M}\mathbf{u}\geq\mathbf{0}$ above) is a nonnegative linear combination of vectors $\mathbf{x}_{1},\mathbf{x}_{2},\mathbf{x}_{3}$ . b) The cone of $\mathbf{M}$ does not intersect the first quadrant except at the origin. In this case we can draw a dashed line (hyperplane in higher dimensions) separating the cone of $\mathbf{M}$ from the first quadrant. The orthogonal vector $\mathbf{v}\geq\mathbf{0}$ to this line lies in the first quadrant, but $\mathbf{v}\mathbf{M}<0$ means each vector $\mathbf{x}_{i}$ has negative dot product $\mathbf{v}\cdot\mathbf{x}_{i}<0$ , i.e., all $\mathbf{x}_{i}$ vectors lie on the *other* side of the line.

{toappendix}

Although we do not need the following fact, it is worthwhile to observe that, if $\mathbf{M}$ is integer-valued (as in our application), then the solution $\mathbf{u}$ or $\mathbf{v}$ (whichever exists) in Section 6.1 has entries that are at most exponential in $\|\mathbf{M}\|$ i.e., at most exponential in the sum of absolute values of entries of $\mathbf{M}$ (see e.g., [papadimitriou1981complexity]). So in particular when we consider $\mathbf{M}$ having small $O(1)$ size entries, this means the solution $\mathbf{u}$ or $\mathbf{v}$ has entries that are at most exponential in the number of rows and columns of $\mathbf{M}$ . When $\mathbf{M}$ is a stoichiometric matrix, this corresponds to the number of species and reactions, respectively, of the CRN.

Section 6.1 will help us prove the following theorem characterizing CRNs with bounded executions from all configurations. Theorem 6.5 is used in this paper only to prove Theorems 6.9 and 6.3, but it may also be of independent interest, since it equates a “global, infinitary, difficult-to-check” condition (bounded executions from all configurations) with a “local, easy-to-check” condition (having a linear potential function).

Theorem 6.5.

A CRN has a linear potential function if and only if it is entirely execution bounded.

Proof 6.6.

Let $\mathcal{C}=(\Lambda,R)$ be a CRN. The forward direction is easy: assuming $\mathcal{C}$ has potential function $\Phi$ , since each reaction decreases $\Phi$ by at least 1 (see Remark 6.2), starting from configuration $\mathbf{x}$ , we can execute at most $\Phi(\mathbf{x})$ reactions while kee** $\Phi$ nonnegative. Thus $\mathcal{C}$ is entirely execution bounded.

To see the reverse direction, assume that $\mathcal{C}$ is execution bounded from every configuration, and let $\mathbf{M}$ be the stoichiometric matrix of $\mathcal{C}$ . We claim there is no integer vector $\mathbf{u}\geq\mathbf{0}$ satisfying $\mathbf{M}\mathbf{u}\geqq\mathbf{0}$ ; for the sake of contradiction suppose otherwise. Interpreting $\mathbf{u}$ as counts of reactions to execute, for any sufficiently large configuration $\mathbf{x}$ , all reactions in $\mathbf{u}$ can be applied (in arbitrary order), and the vector $\mathbf{M}\mathbf{u}$ describes the resulting change in species counts, reaching to configuration $\mathbf{y}=\mathbf{x}+\mathbf{M}\mathbf{u}$ . Since $\mathbf{M}\mathbf{u}\geqq\mathbf{0}$ , this path is self-covering, i.e., $\mathbf{y}\geqq\mathbf{x}$ . But since $\mathcal{C}$ is execution bounded from every configuration, by Section 3, $\mathcal{C}$ has no self-covering path from any configuration, a contradiction. This establishes the claim that $\mathbf{M}\mathbf{u}\geqq\mathbf{0}$ has no integer solution $\mathbf{u}\geq\mathbf{0}$ .

By Section 6.1, there is an integer vector $\mathbf{v}\geqq\mathbf{0}$ such that $\mathbf{v}\mathbf{M}<\mathbf{0}$ . Let $\mathbf{v}\in\mathbb{N}^{\Lambda}$ be the coefficients of a linear function $\Phi:\mathbb{N}^{\Lambda}\to\mathbb{N}$ , i.e., $\Phi(\mathbf{x})=\mathbf{v}\cdot\mathbf{x}$ . Then the vector $\mathbf{v}\mathbf{M}\in\mathbb{Z}^{R}$ represents the amount $\Phi$ changes by one unit of each reaction, i.e., $\mathbf{v}\mathbf{M}(\alpha)$ is the amount $\Phi$ increases when executing reaction $\alpha$ once. Since $\mathbf{v}\mathbf{M}<\mathbf{0}$ , this means that every reaction strictly decreases $\Phi$ , i.e., $\Phi$ is a linear potential function for $\mathcal{D}$ .

{toappendix}

Remark 6.7.

By employing the real-valued version of Section 6.1, the above proof also shows that Theorem 6.5 holds for the continuous model of CRNs [chen2023rate], in which species amounts are modeled as continuous nonnegative real concentrations. In this case, a continuous CRN would be defined to be execution bounded from configuration $\mathbf{x}$ if each reaction can be executed by at most a finite (real-valued) amount from $\mathbf{x}$ .

6.2 Impossibility of stably deciding majority and parity

In this section, we prove Theorem 6.9, which is a special case of our main negative result, Section 6.3. We give a self-contained proof of Theorem 6.9 because it is simpler and serves as an intuitive warmup to some of the key ideas used in proving Section 6.3, without the complexities of dealing with arbitrary semilinear sets.

Theorem 6.9 shows a limitation on the computational power of entirely execution bounded, all-voting CRNs, but it requires an additional constraint on the CRN for the result to hold (and we later give counterexamples showing that this extra hypothesis is provably necessary), described in the following definition.

Definition 6.8.

Let $\mathcal{D}$ be a CRD. The output size of $\mathcal{D}$ is the function $s:\mathbb{N}\to\mathbb{N}$ defined $s(n)=\min_{\mathbf{x},\mathbf{y}}\{\|\mathbf{y}\|\mid\mathbf{x}\Rightarrow% \mathbf{y},\|\mathbf{x}\|=n,\mathbf{x}\text{ is a valid initial configuration}% ,\mathbf{y}\text{ is stable}\}$ , the size of the smallest stable configuration reachable from any valid initial configuration of size $n$ . A CRD is non-collapsing if $\lim_{n\to\infty}s(n)=\infty$ .

Put another way, $\mathcal{D}$ is collapsing if there is a constant $c$ such that, from infinitely many initial configurations $\mathbf{x}$ , $\mathcal{D}$ can reach a stable configuration of size at most $c$ . All population protocols are non-collapsing, since every reaction preserves the configuration size.

Theorem 6.9.

No noncollapsing, all-voting, entirely execution bounded CRD can stably decide the majority predicate $[X_{1}\geq X_{2}?]$ or the parity predicate $[X\equiv 1\mod 2?]$ .

Proof 6.10.

Let $\mathcal{D}=(\Lambda,R,\Sigma,\Upsilon_{\mathrm{Y}},\Upsilon_{\mathrm{N}},% \mathbf{s})$ be a CRD obeying the stated conditions, and suppose for the sake of contradiction that $\mathcal{D}$ stably decides the majority predicate (so $\Sigma=\{X_{1},X_{2}\}$ ).

We consider the sequence of stable configurations $\mathbf{a}_{1},\mathbf{b}_{1},\mathbf{a}_{2},\mathbf{b}_{2},\dots$ defined as follows. Let $\mathbf{a}_{1}$ be a stable configuration reachable from initial configuration $\mathbf{s}+\{X_{1},X_{2}\}$ ; since the correct answer is yes, all species present in $\mathbf{a}_{1}$ vote yes. Now add a single copy of $X_{2}$ . By additivity, the configuration $\mathbf{a}_{1}+\{X_{2}\}$ is reachable from $\mathbf{s}+\{X_{1},2X_{2}\}$ , for which the correct answer in this case is no. Thus, since $\mathcal{D}$ stably decides majority, from $\mathbf{a}_{1}+\{X_{2}\}$ , a stable “no” configuration is reachable; call this $\mathbf{b}_{1}$ . Now add a single $X_{1}$ . Since the correct answer is yes, from $\mathbf{b}_{1}+\{X_{1}\}$ a stable “yes” configuration is reachable, call it $\mathbf{a}_{2}$ .

Continuing in this way, we have a sequence of stable configurations $\mathbf{a}_{1},\mathbf{b}_{1},\mathbf{a}_{2},\mathbf{b}_{2},\dots$ where all species in $\mathbf{a}_{i}$ vote yes and all species in $\mathbf{b}_{i}$ vote no. Since $\mathcal{D}$ is noncollapsing, the size of the configurations $\mathbf{a}_{i}$ and $\mathbf{b}_{i}$ increases without bound as $i\to\infty$ . (Possibly $\|\mathbf{a}_{i+1}\|<\|\mathbf{a}_{i}\|$ , i.e., the size is not necessarily monotonically increasing, but for all sufficiently large $j>i$ , we have $\|\mathbf{a}_{j}\|>\|\mathbf{a}_{i}\|$ .) Since all species vote, for some constant $\delta>0$ , to get from $\mathbf{a}_{i}+\{X_{2}\}$ to $\mathbf{b}_{i}$ , at least $\delta\|\mathbf{a}_{i}\|$ reactions must occur. This is because all species in $\mathbf{a}_{i}$ must be removed since they vote yes, and each reaction removes at most $O(1)$ molecules. (Concretely, let $\delta=1/\max_{(\mathbf{r},\mathbf{p})\in R}\|\mathbf{r}\|-\|\mathbf{p}\|$ , i.e., 1 over the most net molecules consumed in any reaction.) Similarly, to get from $\mathbf{b}_{i}+\{X_{1}\}$ to $\mathbf{a}_{i+1}$ , at least $\delta\|\mathbf{b}_{i}\|$ reactions must occur.

Since $\mathcal{D}$ is entirely execution bounded, by Theorem 6.5, $\mathcal{D}$ has a linear potential function $\Phi(\mathbf{x})=\mathbf{v}\cdot\mathbf{x}$ , where $\mathbf{v}\geq\mathbf{0}$ . Adding a single $X_{2}$ to $\mathbf{a}_{i}$ increases $\Phi$ by the constant $\mathbf{v}(X_{2})$ . Since $\|\mathbf{a}_{i}\|$ grows without bound, the number of reactions to get from $\mathbf{a}_{i}+\{X_{2}\}$ to $\mathbf{b}_{i}$ increases without bound as $i\to\infty$ , and since each reaction strictly decreases $\Phi$ by at least 1, the total change in $\Phi$ that results from adding $X_{2}$ and then going from $\mathbf{a}_{i}+\{X_{2}\}$ to $\mathbf{b}_{i}$ is unbounded in $i$ , so unboundedly negative for sufficiently large $i$ (negative once $i$ is large enough that $\delta\|\mathbf{a}_{i}\|\geq\mathbf{v}(X_{2})+2$ ). Similarly, adding a single $X_{1}$ to $\mathbf{b}_{i}$ and going from $\mathbf{b}_{i}+\{X_{1}\}$ to $\mathbf{a}_{i+1}$ , the resulting total change in $\Phi$ is unbounded and (eventually) negative.

$\Phi$ starts this process at the constant $\Phi(\mathbf{s}+\{X_{1},X_{2}\})$ . Before $\|\mathbf{a}_{i}\|$ and $\|\mathbf{b}_{i}\|$ are large enough that $\delta\|\mathbf{a}_{i}\|\geq\mathbf{v}(X_{2})+2$ and $\delta\|\mathbf{b}_{i}\|\geq\mathbf{v}(X_{1})+2$ (i.e., large enough that the net change in $\Phi$ is negative resulting from adding a single input and going to the next stable configuration), $\Phi$ could increase, if $\Phi(\{X_{1}\})$ (resp. $\Phi(\{X_{2}\})$ ) is larger than the net decrease in $\Phi$ due to following reactions to get from $\mathbf{a}_{i}+\{X_{2}\}$ to $\mathbf{b}_{i}$ (resp. from $\mathbf{b}_{i}+\{X_{1}\}$ to $\mathbf{a}_{i}$ ).

However, since $\mathcal{D}$ is noncollapsing, this can only happen for a constant number of $i$ (so $\Phi$ never reaches more than a constant above its initial value $\Phi(\mathbf{s}+\{X_{1},X_{2}\})$ ), after which $\Phi$ strictly decreases after each round of this process. At some point in this process, $\mathcal{D}$ will not be able to reach all the way to the next $\mathbf{a}_{i}$ or $\mathbf{b}_{i}$ without $\Phi$ becoming negative, a contradiction.

The argument for parity is similar, but instead of alternating adding $X_{1}$ then $X_{2}$ , in each round we always add one more $X$ to flip the correct answer.

Theorem 6.9 is false without the noncollapsing hypothesis. The following collapsing, leaderless (but all-voting and entirely execution bounded) CRD stably decides majority: Species $X_{1},x_{1}$ vote yes, while $X_{2},x_{2}$ vote no:

	$\displaystyle X_{1}+X_{2}$	$\displaystyle\mathop{\rightarrow}\limits x_{1}+x_{2}$
	$\displaystyle X_{1}+x_{2}$	$\displaystyle\mathop{\rightarrow}\limits X_{1}$
	$\displaystyle X_{2}+x_{1}$	$\displaystyle\mathop{\rightarrow}\limits X_{2}$
	$\displaystyle x_{1}+x_{2}$	$\displaystyle\mathop{\rightarrow}\limits x_{1}$

It has bounded executions from any configuration: $\min(\#X_{1},\#X_{2})$ of the first reaction can occur, and the other reactions decrease molecular count, so are limited by the total configuration size. However, it is collapsing since a stable configuration of size 1 is always reachable. Theorem 6.9 is similarly false without the all-voting hypothesis; for each of the reactions with one product above, add another non-voting product $W$ . This converts the CRD to be noncollapsing but not all-voting. Of course, the execution bounded hypothesis is also necessary: the original population protocols paper [angluin2004computation] showed that all-voting, noncollapsing, leaderless population protocols can stably decide all semilinear predicates.

The following collapsing, all-voting, leaderless (but entirely execution bounded) CRD stably decides parity. Let the input species be named $X_{1}$ . Species $X_{1}$ votes yes, $X_{0}$ votes no:

	$\displaystyle X_{1}+X_{1}$	$\displaystyle\mathop{\rightarrow}\limits X_{0}$
	$\displaystyle X_{1}+X_{0}$	$\displaystyle\mathop{\rightarrow}\limits X_{1}$
	$\displaystyle X_{0}+X_{0}$	$\displaystyle\mathop{\rightarrow}\limits X_{0}$

\opt

full It has bounded executions from any configuration: exactly $\#X_{1}+\#X_{0}-1$ reactions can occur since each reduces $\#X_{1}+\#X_{0}$ by 1. Similar to above, by adding the non-voting product $W$ to each reaction above, this CRD becomes noncollapsing but not all-voting, showing that the all-voting hypothesis is also necessary for stably deciding parity.

6.3 Impossibility of stably deciding not eventually constant predicates

We now present our main negative result, Section 6.3, which generalizes Theorem 6.9 to show that such CRNs can stably decide only trivial (eventually constant) predicates. \optsubmissionThe proof is in the appendix.

Definition 6.11.

Let $\phi:\mathbb{N}^{d}\rightarrow\{0,1\}$ be a predicate. We say $\phi$ is eventually constant if there is $n_{0}\in\mathbb{N}$ such that $\phi$ is constant on $\mathbb{N}_{\geq n_{0}}^{d}=\left\{\mathbf{x}\in\mathbb{N}^{d}\mid(\forall i% \in\{1,\ldots,d\})\ \mathbf{x}(i)\geq n_{0}\right\}$ , i.e., either $\phi^{-1}(0)\cap\mathbb{N}_{\geq n_{0}}^{d}=\emptyset$ or $\phi^{-1}(1)\cap\mathbb{N}_{\geq n_{0}}^{d}=\emptyset$ .

In other words, although $\phi$ may have an infinite number of each output, “sufficiently far from the boundary of the positive orthant” (where all coordinates exceed $n_{0}$ ), only one output appears. \optsubmission,fullSee Figure 2 for a 2D example.

{toappendix}

For any set $B\subseteq\mathbb{N}^{d}$ and $\mathbf{v}\in\mathbb{N}^{d}$ , write $B+\mathbf{v}$ to denote the set $\{\mathbf{x}+\mathbf{v}\mid\mathbf{x}\in B\},$ which is $B$ translated by vector $\mathbf{v}$ . Let $\mathbf{u}_{i}\in\mathbb{N}^{d}$ denote the unit vector in direction $i$ , i.e., $\mathbf{u}_{i}(i)=1$ and $\mathbf{u}_{i}(j)=0$ for $j\neq i$ .

Definition 6.12.

We say $A\subseteq\mathbb{N}^{d}$ is periodic if, for some $k\in\mathbb{N}^{+}$ , for some finite set $F\subseteq\{0,1,\dots,k-1\}^{d}$ , $A=\bigcup_{n_{1},\dots,n_{d}\in\mathbb{N}}F+\sum_{i=1}^{d}k\cdot n_{i}\cdot% \mathbf{u}_{i}$ . We say $k$ is the period of $A$ and say that $A$ is $k$ -periodic. Equivalently, $A$ is $k$ -periodic if, for all $\mathbf{x}\in\mathbb{N}^{d}$ and all unit vectors $\mathbf{u}_{i}$ , $\mathbf{x}\in A\iff\mathbf{x}+k\cdot\mathbf{u}_{i}\in A$ .

In other words, $A$ is periodic if it is a union of copies of a finite subset $F$ of the $k\times k\times\dots\times k$ hypercube with a corner at the origin, translated in each direction by every nonnegative integer multiple of the hypercube’s width. See Figure 3. Note that if $A$ is $k$ -periodic, then it is also $k^{\prime}$ -periodic for every positive integer multiple $k^{\prime}=i\cdot k$ of $k$ .

Lemma 6.13.

Let $A\subseteq\mathbb{N}^{d}$ be a Boolean combination of mod sets. Then $A$ is periodic.

Proof 6.14.

We prove this by induction on the number of mod sets. For the base case, let $A=\{\mathbf{x}\mid\mathbf{w}\cdot\mathbf{x}\equiv c\mod m\}$ be a single mod set, where $\mathbf{w}\in\{0,\dots,m-1\}^{d}$ and $c,m\in\mathbb{N}$ are constants. Letting $k=m$ and $F=A\cap\{0,\dots,m-1\}^{d}$ in Definition 6.12 works. Let $\mathbf{x}\in\mathbb{N}^{d}$ . Then for all $1\leq i\leq d$ , $\mathbf{w}\cdot\mathbf{x}\equiv\mathbf{w}\cdot(\mathbf{x}+m\mathbf{u}_{i})\mod m$ , so $\mathbf{w}\cdot\mathbf{x}\equiv c\mod m\iff\mathbf{w}\cdot(\mathbf{x}+m\mathbf% {u}_{i})\equiv c\mod m$ , meaning that $\mathbf{x}\in A\iff\mathbf{x}+m\mathbf{u}_{i}\in A$ , so $A$ is $k$ -periodic.

The inductive case amounts to showing that periodic sets are closed under Boolean operations of union, intersection, and complement. Clearly the complement of any periodic set is also periodic.

Inductively assume that $A_{1},A_{2}\subseteq\mathbb{N}^{d}$ are periodic; we argue that $A_{1}\cup A_{2}$ is periodic. Letting $k$ be the least common multiple of their periods, we may assume both $A_{1}$ and $A_{2}$ are $k$ -periodic with the same period $k$ . Then for all $\mathbf{x}\in\mathbb{N}^{d}$ and all unit vectors $\mathbf{u}_{i}$ , $\mathbf{x}\in A_{1}\iff\mathbf{x}+k\cdot\mathbf{u}_{i}\in A_{1}$ and $\mathbf{x}\in A_{2}\iff\mathbf{x}+k\cdot\mathbf{u}_{i}\in A_{2}$ . Thus $\mathbf{x}\in A_{1}\cup A_{2}\iff\mathbf{x}+k\cdot\mathbf{u}_{i}\in A_{1}\cup A% _{2}$ , so $A_{1}\cup A_{2}$ is also $k$ -periodic. Similar reasoning shows $A_{1}\cap A_{2}$ is $k$ -periodic (one can also appeal to DeMorgan’s Laws).

Each threshold set $T$ is defined by a hyperplane that partitions $\mathbb{N}^{d}$ into the sets $T$ (on one side of the hyperplane, including integer points on the hyperplane itself) and $\overline{T}$ (on the other side of the hyperplane). More generally, several threshold sets partition $\mathbb{N}^{d}$ into multiple disjoint subsets we call “regions”. Furthermore, any predicate that is a Boolean combination of threshold sets has constant output in any region; the next definition formalizes this.

Definition 6.15.

Let $A\subseteq\mathbb{N}^{d}$ be Boolean combination of threshold sets $T_{1},\dots,T_{k}\subset\mathbb{N}^{d}$ . A region of $A$ is a convex polytope $R\subset\mathbb{R}_{\geq 0}^{d}$ such that, for all $\mathbf{x},\mathbf{y}\in R\cap\mathbb{N}^{d}$ , for all $1\leq i\leq k$ , $\mathbf{x}\in T_{i}\iff\mathbf{y}\in T_{i}$ . The output of the region $R$ is the value 1 if $R\cap\mathbb{N}^{d}\subset A$ and 0 if $R\cap\mathbb{N}^{d}\cap A=\emptyset$ . (Note these are the only two possibilities, since no individual threshold set $T_{i}$ is exited or entered as we move within $R$ .) A region $R$ is totally unbounded if, for all $c\in\mathbb{N}$ , $R\cap\mathbb{N}_{\geq c}^{d}\neq\emptyset$ , i.e., $R$ contains points that are arbitrarily large on all components. A region is called partially bounded if it is not totally unbounded.

Put another way, predicates defined by Boolean combinations of threshold sets are defined by $(d-1)$ -dimensional hyperplanes that partition $\mathbb{N}^{d}$ into regions, where in each region, the output of the predicate is all yes, or all no. In fact this is an exact characterization of Boolean combinations of threshold predicates.

Definition 6.16.

For any set $A\subseteq\mathbb{R}^{d}$ , the recession cone of $A$ is

\mathrm{recc}(A)=\{\mathbf{v}\in\mathbb{R}^{d}\mid(\forall\mathbf{x}\in A)(% \forall\lambda>0)\ \mathbf{x}+\lambda\mathbf{v}\in A\},

the set of vectors $\mathbf{v}$ such that, from any point in $A$ , one can move in direction $\mathbf{v}$ forever without leaving $A$ .

{observation}

A region $R$ defined by threshold sets is totally unbounded if and only if $\mathrm{recc}(R)\cap\mathbb{R}_{>0}^{d}\neq\emptyset$ , i.e., the recession cone of $R$ contains a positive vector.

Lemma 6.17.

Let $A\subseteq\mathbb{N}^{d}$ be Boolean combination of threshold sets that is not eventually constant. Then there are two adjacent totally unbounded regions $R_{0}$ , $R_{1}$ with opposite outputs, such that the normal vector $\mathbf{h}$ of the hyperplane $H$ separating $R_{0}$ and $R_{1}$ has at least one negative component and at least one positive component.

Proof 6.18.

See Figure 5 for an example in 2D. Since $A$ is not eventually constant, it must have two totally unbounded regions $R_{0}$ and $R_{1}$ with opposite outputs; assume WLOG that $R_{i}$ has output $i$ . Let $c\geq 0$ be sufficiently large that all partially bounded regions of $A$ are subsets of $B=\mathbb{N}^{d}\setminus\mathbb{N}_{\geq c}^{d}$ . Now, simply pick any points $\mathbf{x}_{0}\in R_{0}\setminus B$ and $\mathbf{x}_{1}\in R_{1}\setminus B$ . There is some path from $\mathbf{x}_{0}$ to $\mathbf{x}_{1}$ that follows only unit vectors (i.e., moves only to adjacent points that are distance 1 from the previous point), such that every intermediate point $\mathbf{x}^{\prime}$ also obeys $\mathbf{x}^{\prime}\not\in B$ .

Then this path never enters a partially bounded region of $A$ , since they are all subsets of $B$ . Thus, since the path starts in a region $R_{0}$ with output 0, ends in a region $R_{1}$ with output 1, there must be two adjacent points $\mathbf{a},\mathbf{b}$ on the path, where $\mathbf{a}$ is in a totally unbounded region with output 0 and $\mathbf{b}$ is in a totally unbounded region with output 1.

Finally, we must that the normal vector of the hyperplane separating $R_{0}$ from $R_{1}$ has a negative and a positive entry. Recall that a threshold set $T$ is defined by $T=\{\mathbf{x}\in\mathbb{N}^{d}\mid\mathbf{w}\cdot\mathbf{x}\leq a\}$ , where $\mathbf{w}=(w_{1},\dots,w_{d})\in\mathbb{N}^{d}$ and $a\in\mathbb{N}$ (Definition 2.4). Since $A$ is a Boolean combination of threshold sets and $R_{0},R_{1}$ are adjacent with opposite outputs, there must be some threshold set $T$ such that $R_{1}\subseteq T$ , but $R_{0}\cap T=\emptyset$ (or vice versa, but assume $R_{1}\subseteq T$ WLOG, since we could replace $T$ with $\overline{T}$ in the Boolean combination defining $A$ ). Equivalently, we can think of the regions $R_{0}$ and $R_{1}$ as being separated by the hyperplane $\mathbf{w}\cdot\mathbf{x}=a$ , with normal vector $\mathbf{w}$ and offset $a$ , such that all points $\mathbf{x}\in R_{1}$ obey $\mathbf{w}\cdot\mathbf{x}\leq a$ , and all points $\mathbf{x}\in R_{0}$ obey $\mathbf{w}\cdot\mathbf{x}>a$ . The transition between the regions at points $\mathbf{a}$ and $\mathbf{b}$ involves crossing the hyperplane, where the inequality changes from $\leq a$ to $>a$ , which defines the boundary between different outputs (0 in $R_{0}$ and 1 in $R_{1}$ ). Therefore, the points on the hyperplane $\mathbf{w}\cdot\mathbf{x}=a$ necessarily lie exactly at the boundary between these regions.

We show that $\mathbf{w}$ cannot be nonnegative or nonpositive. Suppose $\mathbf{w}\geq\mathbf{0}$ (scale the normal vector by $-1$ otherwise). Since $R_{1}$ is totally unbounded, it contains points that are arbitrarily large on all components. More formally, there is a strictly increasing sequence $\mathbf{x}_{1}<\mathbf{x}_{2}<\dots$ such that all $\mathbf{x}_{i}\in R_{1}.$ Since $\mathbf{w}\geq\mathbf{0}$ , $\lim_{i\to\infty}\mathbf{w}\cdot\mathbf{x}_{i}=\infty$ . This contradicts the previous assumption that all points $\mathbf{x}\in R_{1}$ obey $\mathbf{w}\cdot\mathbf{x}\leq a$ (geometrically, we would cross the hyperplane somewhere and land in $R_{0}$ ). Symmetric reasoning applies to the case $\mathbf{w}\leq\mathbf{0}$ . We conclude that the separating hyperplane must have a normal vector $\mathbf{w}$ with at least one positive and at least one negative component, establishing the lemma.

The next lemma shows that the there exists a vector $\mathbf{v}>\mathbf{0}$ parallel to the hyperplane separating the two regions. In other words, we can move along $H$ while increasing every component.

{lemmarep}

Let $H$ be a hyperplane with normal vector $\mathbf{h}$ . Then there is a positive vector $\mathbf{v}>\mathbf{0}$ with $\mathbf{v}\cdot\mathbf{h}=0$ if and only if $\mathbf{h}$ has at least one negative component and at least one positive component.

Proof 6.19.

If $\mathbf{v}>\mathbf{0}$ and $\mathbf{h}\geq\mathbf{0}$ then $\mathbf{v}\cdot\mathbf{h}>0$ . Similarly, if $\mathbf{v}>\mathbf{0}$ and $\mathbf{h}\leq\mathbf{0}$ then $\mathbf{v}\cdot\mathbf{h}<0$ . So to get $\mathbf{v}\cdot\mathbf{h}=0$ , $\mathbf{h}$ must have at least one positive and at least one negative element.

We construct $\mathbf{v}$ as follows: Let $I_{+}$ denote the indices of the positive coordinates of $\mathbf{h}$ and $I_{-}$ the indices of the negative coordinates. Our goal is to balance out the positive and negative parts of the dot product, given by $\mathbf{v}\cdot\mathbf{h}=\sum_{i\in I_{+}}\mathbf{v}(i)\mathbf{h}(i)+\sum_{i% \in I_{-}}\mathbf{v}(i)\mathbf{h}(i)$ . Set $\mathbf{v}(i)$ to be the sum of the positive coordinates of $\mathbf{h}$ if $i\in I_{-}$ and the sum of the absolute values of negative coordinates of $\mathbf{h}$ otherwise:

\displaystyle\mathbf{v}(i)=\begin{cases}\sum_{j\in I_{-}}|\mathbf{h}(j)|&\text% {if }i\in I_{+},\\ \sum_{j\in I_{+}}\mathbf{h}(j)&\text{if }i\in I_{-},\\ 0&\text{otherwise.}\end{cases}

Substituting into the formula shows the correctness. For brevity, let $p:=\mathbf{v}(i)$ if $i\in I_{+}$ and $n:=\mathbf{v}(i)$ if $i\in I_{-}$ as above.

	$\displaystyle\mathbf{v}\cdot\mathbf{h}$	$\displaystyle=\sum_{i\in I_{+}}\mathbf{v}(i)\mathbf{h}(i)+\sum_{i\in I_{-}}% \mathbf{v}(i)\mathbf{h}(i)$
		$\displaystyle=\sum_{i\in I_{+}}\left(n\right)\mathbf{h}(i)+\sum_{i\in I_{-}}% \left(p\right)\mathbf{h}(i)$
		$\displaystyle=n\sum_{i\in I_{+}}\mathbf{h}(i)+p\sum_{i\in I_{-}}\mathbf{h}(i)$
		$\displaystyle=np+p(-n)$
		$\displaystyle=0.$

Finally, if $\mathbf{v}$ is not integer-valued, scale it by the least common multiple of all coordinate denominators to ensure $\mathbf{v}\in\mathbb{N}^{d}$ without altering the dot product.

Lemma 6.20.

Let $\phi:\mathbb{N}^{d}\to\{0,1\}$ be a semilinear predicate that is not eventually constant. Then there is an infinite sequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ and constant $c$ , such that for all $j\in\mathbb{N}$ ,

1.

$\phi(\mathbf{x}_{j})\neq\phi(\mathbf{x}_{j+1})$ (correct answer swaps for each subsequent point),
2.

$\mathbf{x}_{j}\leq\mathbf{x}_{j+1}$ (inputs are increasing), and
3.

$\|\mathbf{x}_{j+1}-\mathbf{x}_{j}\|\leq c$ (adjacent inputs are “close”).

Proof 6.21.

We associate to $\phi$ the set $A\subseteq\mathbb{N}^{d}$ where $\phi^{-1}(1)=A$ , i.e., $\phi(\mathbf{x})=1\iff\mathbf{x}\in A$ .

Since $A$ is semilinear, it is a Boolean combination of threshold sets $T_{1},\dots,T_{k}$ and mod sets $M_{1},\dots,M_{l}$ . Recall Definition 6.15, where the threshold sets partition $\mathbb{N}^{d}$ into regions, where moving within a region does not cross and hyperplanes defining the threshold sets, thus does not change the Boolean value [ $\mathbf{x}\in T_{i}$ ?] for any $T_{i}$ . Suppose we have $m$ regions $R_{1},\dots,R_{m}.$ Then we can rewrite $A\cap R_{j}$ as a Boolean combination of mod sets only, intersected with $R_{j}$ . We do this by replacing each $T_{i}$ in the original Boolean expression with either $\mathbb{N}^{d}$ or $\emptyset$ , depending whether $R_{j}\subseteq T_{i}$ or $R_{j}\cap T_{i}=\emptyset$ , respectively.⁵⁵5 For example, if the expression is $T_{1}\cup(M_{1}\cap T_{2})\cup(M_{2}\cup T_{3})\cup(M_{3}\cap M_{4})$ , if the points are in $T_{2}$ but not $T_{1}$ or $T_{3}$ , this becomes $\emptyset\cup(M_{1}\cap\mathbb{N}^{d})\cup(M_{2}\cup\emptyset)\cup(M_{3}\cap M% _{4})=M_{1}\cup M_{2}\cup(M_{3}\cap M_{4})$ . (Note by the definition of region these are the only two possibilities.) Let $M^{\prime}_{j}$ be this Boolean combination of mod sets, such that $M^{\prime}_{j}\cap R_{j}=A\cap R_{j}$ . By Lemma 6.13, $M^{\prime}_{j}$ is periodic.

Consider a totally unbounded region $R_{j}.$ By Section 6.3, $\mathrm{recc}(R_{j})$ contains a positive vector $\mathbf{v}$ . We have two cases:

for some totally unbounded region $R_{j}$ , $M^{\prime}_{j}\cap R_{j}$ is not constant:

This is illustrated in Figures 3 and 4, which show two subcases. Figure 3 shows the subcase where, for some $\mathbf{v}\in\mathrm{recc}(R_{j})\cap\mathbb{N}^{d}$ and point $\mathbf{y}_{0}\in R_{j}$ , defining $\mathbf{y}_{i}=\mathbf{y}_{0}+i\mathbf{v}$ , the sequence $\phi(\mathbf{y}_{0}),\phi(\mathbf{y}_{1}),\dots$ is not constant. Since $M^{\prime}_{j}$ is periodic, the sequence $O=(\phi(\mathbf{y}_{0}),\phi(\mathbf{y}_{1}),\dots)$ is periodic with period $p$ . So we can find a subsequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ obeying all three conditions of the lemma. In particular, it suffices to choose a point $\mathbf{x}_{0}=\mathbf{y}_{0}\in R_{j}\cap A$ (resp. $\mathbf{y}_{0}\in R_{j}\setminus A$ ) let $i<p$ such that $\mathbf{y}_{i}\not\in A$ (resp. $\mathbf{y}_{i}\in A$ ), letting $\mathbf{x}_{1}=\mathbf{y}_{i}$ , and let $\mathbf{x}_{2}=\mathbf{y}_{p}$ , and subsequent elements of the subsequence are the same distances apart ( $\mathbf{x}_{3}=\mathbf{y}_{p+i},\mathbf{x}_{4}=\mathbf{y}_{2p},\dots$ ).

Figure 4 shows the subcase where, for all $\mathbf{y}_{0}\in R_{j}$ and $\mathbf{v}\in\mathrm{recc}(R_{j})\cap\mathbb{N}^{d}$ , defining $\mathbf{y}_{i}=\mathbf{y}_{0}+i\mathbf{v}$ , the sequence $\phi(\mathbf{y}_{0}),\phi(\mathbf{y}_{1}),\dots$ is constant. However, since $M^{\prime}_{j}$ is not constant, we can still find a sequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ , but unlike the previous subcase, it is not a subsequence of points collinear along one vector $\mathbf{v}$ .

Since $M^{\prime}_{j}$ is periodic and not constant, and since $R_{j}$ is totally unbounded, for every $\mathbf{x}\in R_{j}\cap A$ , there is $\mathbf{x}^{\prime}\geq\mathbf{x}$ such that $\mathbf{x}^{\prime}\in R_{j}\setminus A$ , i.e., for every point in the region in $A$ , there is a larger point in $R_{j}$ not in $A$ . Also, since $M^{\prime}_{j}$ is periodic, there is a constant $c$ independent of $\mathbf{x}$ such that $\|\mathbf{x}^{\prime}-\mathbf{x}\|\leq c$ . By symmetric reasoning, there is a $\mathbf{x}^{\prime\prime}\in R_{j}\cap A$ such that $\mathbf{x}^{\prime\prime}\geq\mathbf{x}^{\prime}$ and $\|\mathbf{x}^{\prime\prime}-\mathbf{x}^{\prime}\|\leq c$ .

Let $\mathbf{x}_{0}\in R_{j}\cap A$ be arbitrary. For all $i\in\mathbb{N}$ , choose $\mathbf{x}_{i+1}\in R_{j}$ based on $\mathbf{x}_{i}$ as above, such that $\mathbf{x}_{i+1}\geq\mathbf{x}_{i}$ , $\|\mathbf{x}_{i+1}-\mathbf{x}_{i}\|\leq c$ , and $\mathbf{x}_{i+1}\in A$ if $i$ is odd and $\mathbf{x}_{i+1}\not\in A$ if $i$ is even. Then the sequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ satisfies the lemma.

for all totally unbounded regions $R_{j}$ , $M^{\prime}_{j}\cap R_{j}$ is constant:

This implies that the mod sets $M_{1},\dots,M_{l}$ can be “factored out” of the Boolean expression defining $A$ in terms of threshold sets $T_{1},\dots,T_{k}$ and the mod sets $M_{1},\dots,M_{l}$ , which will give the same output as $A$ in totally unbounded regions. Put another way, $A\cap(R_{1}\cup\dots\cup R_{u})$ is a Boolean combination of the threshold sets $T_{1},\dots,T_{k}$ , where $R_{1},\dots,R_{u}$ represents all the totally unbounded regions.

By Lemma 6.17, two adjacent totally unbounded regions of $A$ have opposite outputs. See Figure 5 for an example of picking the points $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ below. These adjacent regions are separated by some hyperplane $H_{j}$ , such that $H_{j}\subseteq A$ , but for some unit vector $\mathbf{u}_{i}$ , $(H_{j}+\mathbf{u}_{i})\cap A=\emptyset$ , i.e., all of $H_{j}$ is contained in $A$ , but the entire hyperplane adjacent to $H_{j}$ in direction $\mathbf{u}_{i}$ , consists of points not in $A$ . Note this is not true for general hyperplanes, e.g., one whose orthogonal vector is $(1,1)$ , where both unit vectors $\mathbf{u}_{1}=(1,0)$ and $\mathbf{u}_{2}=(0,1)$ would move off the hyperplane, but in the “yes” direction where the point is still contained in the threshold set. However, since $H_{j}$ is separating two totally unbounded regions, some strictly positive vector $\mathbf{v}>\mathbf{0}$ is parallel to $H_{j}$ , i.e., obeys $\mathbf{v}\cdot\mathbf{h}=0$ for $H_{j}$ ’s orthogonal vector $\mathbf{h}$ . By Section 6.3, $\mathbf{h}$ has at least one positive coordinate (say $i$ ) and at least one negative coordinate (say $k$ ), so that unit vector $\mathbf{u}_{i}$ moves to one side of $H_{j}$ and $\mathbf{u}_{k}$ moves to the other side.

In this case, we let $\mathbf{v}>\mathbf{0}$ be some vector parallel to $H_{j}$ , let $\mathbf{x}_{0}\in H_{j}$ , sufficiently large that the vector $\mathbf{v}$ , starting at $\mathbf{x}_{0}$ , does not cross any of the hyperplanes of $T_{1},\dots,T_{k}$ (as in Figure 5). Define the rest of the infinite sequence as

	$\displaystyle\mathbf{x}_{1}$	$\displaystyle=\mathbf{x}_{0}+\mathbf{u},$
	$\displaystyle\mathbf{x}_{2}$	$\displaystyle=\mathbf{x}_{0}+\mathbf{v},$
	$\displaystyle\mathbf{x}_{3}$	$\displaystyle=\mathbf{x}_{0}+\mathbf{v}+\mathbf{u},$
	$\displaystyle\mathbf{x}_{4}$	$\displaystyle=\mathbf{x}_{0}+2\mathbf{v},$
	$\displaystyle\mathbf{x}_{5}$	$\displaystyle=\mathbf{x}_{0}+2\mathbf{v}+\mathbf{u},$
	$\displaystyle\mathbf{x}_{6}$	$\displaystyle=\mathbf{x}_{0}+3\mathbf{v},$
	$\displaystyle\mathbf{x}_{7}$	$\displaystyle=\mathbf{x}_{0}+3\mathbf{v}+\mathbf{u},$
	$\displaystyle\vdots$

By the arguments given above, for all odd $i$ , $\phi(\mathbf{x}_{i})=0$ and for all even $i$ , $\phi(\mathbf{x}_{i})=1$ , satisfying condition (1). If $j$ is even, then $\mathbf{x}_{j+1}=\mathbf{x}_{j}+\mathbf{v}+\mathbf{u}$ , so clearly $\mathbf{x}_{j}\leq\mathbf{x}_{j+1}$ , satisfying condition (2). If $j$ is odd, then $\mathbf{x}_{j+1}=\mathbf{x}_{j}-\mathbf{u}+\mathbf{v}$ . Since $\mathbf{v}>\mathbf{0}$ , we have $\mathbf{v}-\mathbf{u}\geq\mathbf{0}$ , so $\mathbf{x}_{j+1}\geq\mathbf{x}_{j}$ , satisfying condition (2). Finally, $\|\mathbf{x}_{j+1}-\mathbf{x}_{j}\|\leq\|\mathbf{v}\|+1$ , satisfying condition (3).

{thmrep}

If a noncollapsing, all-voting, entirely execution bounded CRD stably decides a predicate $\phi$ , then $\phi$ is eventually constant.

\opt

submissionA complete proof appears in the appendix.

{proofsketch}

This proof is similar to that of Theorem 6.9. In that proof, we repeatedly add a “constant amount of additional input $\{X_{2}\}$ or $\{X_{1}\}$ , which flips the output”. For more general semilinear, but not eventually constant, predicates, we dig into the structure of the semilinear set to find a sequence of constant-size vectors representing additional inputs that flip the correct answer. Any predicate that is not eventually constant has infinitely many yes inputs and infinitely many no inputs, but in general they could be increasingly far apart: e.g., $\phi(\mathbf{x})=1$ if and only if $2^{n}\leq\|\mathbf{x}\|<2^{n+1}$ for even $n$ . For the potential function argument to work, each subsequence input needs to be at most a constant larger than the previous.

But if $\phi$ is semilinear (and not eventually constant) then we can show that there is a sequence of increasing inputs $\mathbf{x}_{0}\leq\mathbf{x}_{1}\leq\mathbf{x}_{2}\leq\dots$ , each a constant distance from the next ( $\|\mathbf{x}_{j+1}-\mathbf{x}_{j}\|=O(1)$ ), flip** the output ( $\phi(\mathbf{x}_{j})\neq\phi(\mathbf{x}_{j+1})$ ). Roughly, this is true for one of two reasons. Using Theorem 2.5, $\phi$ is a Boolean combination of threshold and mod sets. Either the mod sets are not combined to be trivially $\emptyset$ or $\mathbb{N}^{d}$ , in which case we can find some vector $\mathbf{v}$ that, followed infinitely far from some starting point $\mathbf{x}_{0}$ (so $\mathbf{x}_{i}=\mathbf{x}_{0}+i\mathbf{v}$ ) periodically hits both yes inputs ( $\phi(\mathbf{x}_{j})=1$ ) and no inputs ( $\phi(\mathbf{x}_{j})=0$ ). \optsubmission,full(See Figures 3 and 4.) Otherwise, the mod sets can be removed and simplify the Boolean combination to only threshold sets, in which case the infinite sequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ can be obtained by moving along a threshold hyperplane that separates yes from no inputs. \optsubmission,full(See Figure 5.)

Proof 6.22.

This proof is similar to that of Theorem 6.9, with the vectors $\mathbf{v}_{i}$ defined below playing the role of the “constant amount of additional input $\{X_{2}\}$ or $\{X_{1}\}$ that flips the correct answer” in that proof.

Let $\mathcal{D}=(\Lambda,R,\Sigma,\Upsilon_{\mathrm{Y}},\Upsilon_{\mathrm{N}},% \mathbf{s})$ be a CRD obeying the stated conditions, and suppose for the sake of contradiction that $\mathcal{D}$ stably decides a semilinear predicate $\phi$ that is not eventually constant.

By Lemma 6.20, there is an infinite sequence $\mathbf{x}_{0},\mathbf{x}_{1},\dots$ such that

1.

$\phi(\mathbf{x}_{i})\neq\phi(\mathbf{x}_{i+1})$ (i.e., the correct answer swaps for each subsequent input)
2.

$\mathbf{x}_{i}\leq\mathbf{x}_{i+1}$ , i.e., the inputs are increasing (on at least one coordinate(s)), and
3.

for some constant $c$ , $\|\mathbf{x}_{i+1}-\mathbf{x}_{i}\|\leq c$ , i.e., adjacent inputs are “close”.

Assume WLOG that $\phi(\mathbf{x}_{0})=0$ . For each $i\in\mathbb{N}$ , let $\mathbf{v}_{i}=\mathbf{x}_{i+1}-\mathbf{x}_{i}$ , noting by condition (2) that $\mathbf{v}_{i}\geq 0$ .

We consider the sequence of stable configurations $\mathbf{a}_{0},\mathbf{a}_{1},\mathbf{a}_{2},\dots$ defined as follows. Let $\mathbf{a}_{0}$ be a stable configuration reachable from $\mathbf{x}_{0}$ ; since the correct answer is no, all species present in $\mathbf{a}_{0}$ vote no. Now add $\mathbf{v}_{0}$ to $\mathbf{a}_{0}$ . By additivity, the configuration $\mathbf{a}_{0}+\mathbf{v}_{0}$ is reachable from $\mathbf{x}_{1}=\mathbf{x}_{0}+\mathbf{v}_{0}$ . Since the correct answer for $\mathbf{x}_{1}$ is yes, $\mathcal{D}$ must go from $\mathbf{a}_{0}+\mathbf{v}_{0}$ to a stable “yes” configuration, call this $\mathbf{a}_{1}$ . Now add $\mathbf{v}_{1}$ to $\mathbf{a}_{1}$ . Since the correct answer is no, $\mathcal{D}$ must now reach from $\mathbf{a}_{1}+\mathbf{v}_{1}$ to a stable “no” configuration, call it $\mathbf{a}_{2}$ . By condition (3), each $\mathbf{v}_{i}$ obeys $\|\mathbf{v}_{i}\|<c$ for some constant $c$ .

Continuing in this way, we have a sequence of stable configurations $\mathbf{a}_{0},\mathbf{a}_{1},\dots$ where all species in $\mathbf{a}_{i}$ vote yes for odd $i$ , and all species in $\mathbf{a}_{i}$ vote no for even $i$ . Since $\mathcal{D}$ is noncollapsing, the size of the configurations $\mathbf{a}_{i}$ increases without bound as $i\to\infty$ . (Possibly $\|\mathbf{a}_{i+1}\|<\|\mathbf{a}_{i}\|$ , i.e., the size is not necessarily monotonically nondecreasing, but for all sufficiently large $j>i$ , we have $\|\mathbf{a}_{j}\|>\|\mathbf{a}_{i}\|$ .)

Since all species vote, for some constant $\delta>0$ , to get from $\mathbf{a}_{i}+\mathbf{v}_{i}$ to $\mathbf{a}_{i+1}$ , at least $\delta\|\mathbf{a}_{i}\|$ reactions must occur. This is because all species in $\mathbf{a}_{i}$ must be removed since they vote the opposite of the voters in $\mathbf{a}_{i+1}$ , and each reaction removes at most $O(1)$ molecules. (Concretely, let $\delta=1/\max_{(\mathbf{r},\mathbf{p})\in R}\|\mathbf{r}\|-\|\mathbf{p}\|$ , i.e., 1 over the most net molecules consumed in any reaction.)

Since $\mathcal{D}$ is entirely execution bounded, by Theorem 6.5, $\mathcal{D}$ has a linear potential function $\Phi(\mathbf{x})=\mathbf{w}\cdot\mathbf{x}$ , where $\mathbf{w}\geq\mathbf{0}$ . Adding $\mathbf{v}_{i}$ to $\mathbf{a}_{i}$ increases $\Phi$ by $\mathbf{w}(\mathbf{v}_{i})$ , which is bounded above by a constant since $\|\mathbf{v}_{i}\|<c$ . Since $\|\mathbf{a}_{i}\|$ grows without bound, the number of reactions to get from $\mathbf{a}_{i}+\mathbf{v}_{i}$ to $\mathbf{a}_{i+1}$ increases without bound as $i\to\infty$ , and since each reaction strictly decreases $\Phi$ by at least 1, the total change in $\Phi$ that results from adding $\mathbf{v}_{i}$ and then going from $\mathbf{a}_{i}+\mathbf{v}_{i}$ to $\mathbf{a}_{i+1}$ is unbounded in $i$ , so unboundedly negative for sufficiently large $i$ (negative once $i$ is large enough that $\delta\|\mathbf{a}_{i}\|\geq\mathbf{w}(\mathbf{v}_{i})+2$ ).

However, $\Phi$ started at the constant $\Phi(\mathbf{x}_{0})$ . Before $\|\mathbf{a}_{i}\|$ is large enough that $\delta\|\mathbf{a}_{i}\|\geq\mathbf{w}(\mathbf{v}_{i})+2$ (i.e., large enough that the net change in $\Phi$ is negative resulting from adding a single input and going to the next stable configuration), $\Phi$ could increase, if $\Phi(\mathbf{v}_{i})$ is larger than the net decrease in $\Phi$ due to following reactions to get from $\mathbf{a}_{i}+\mathbf{v}_{i}$ to $\mathbf{a}_{i+1}$ .

At some point in this process, $\mathcal{D}$ will not be able to reach all the way to the next $\mathbf{a}_{i}$ without $\Phi$ becoming negative, a contradiction.

The statement of Theorem 6.9 does not mention the concept of a leader, but it would typically apply to leaderless CRDs. A CRD may be execution bounded from configurations with a single leader, but not execution bounded when multiple leaders are present (preventing the use of Theorem 6.5, which requires the CRD to be execution bounded from all configurations). For example, in Lemma 4.5, reaction (9) occurs finitely many times if the leader/voter $S_{Y}$ or $S_{N}$ has count 1. However, if $S_{Y}$ and $S_{N}$ can be present simultaneously (e.g., if we start with two leaders), then the reactions $S_{Y}+V_{NN}\mathop{\rightarrow}\limits S_{Y}+V_{YN}$ and $S_{N}+V_{YN}\mathop{\rightarrow}\limits S_{N}+V_{NN}$ can flip between $V_{NN}$ and $V_{YN}$ infinitely often in an unbounded execution.

If the CRN is leaderless, however, we have the following, which says that if it is execution bounded from valid initial configurations, then it is execution bounded from all configurations.

{lemmarep}

If a leaderless CRD or CRC is execution bounded, then it is entirely execution bounded.

{proofsketch}\opt

submissionA proof is in the appendix. Since $\mathcal{C}$ is leaderless, the sum of two valid initial configurations is also valid. Thus if we can produce some species from a valid initial configuration, we can produce arbitrarily large counts of all species by adding up sufficiently many initial configurations. This means that for any configuration $\mathbf{x}$ , from any sufficiently large valid initial configuration $\mathbf{i}$ , some $\mathbf{y}\geqq\mathbf{x}$ is reachable from $\mathbf{i}$ . But if $\mathcal{C}$ is execution bounded from $\mathbf{i}$ , since $\mathbf{i}\Rightarrow\mathbf{y}$ , it must also be execution bounded from $\mathbf{y}$ , thus also from $\mathbf{x}$ since by additivity any reactions applicable to $\mathbf{x}$ are also applicable to $\mathbf{y}$ .

Proof 6.23.

Let $\mathcal{C}$ be a leaderless CRD or CRC. Let $\mathbf{x}$ be any configuration. We first show that some $\mathbf{y}\geqq\mathbf{x}$ is reachable from a valid initial configuration $\mathbf{i}$ .

We may assume without loss of generality that $\mathcal{C}$ only contains species producible from valid initial configurations, otherwise we obtain an equivalent CRN by removing those unproducible species from $\mathcal{C}$ .

Since $\mathcal{C}$ is leaderless, the sum of two valid initial configurations is also valid. Then each species $S$ being producible means that there is a valid initial configuration $\mathbf{i}_{S,1}$ such that for some $\mathbf{y}_{S,1}$ , $\mathbf{i}_{S,1}\Rightarrow\mathbf{y}_{S,1}$ and $\mathbf{y}_{S,1}(S)\geq 1$ , i.e., at least one copy of $S$ can be produced. Let $\mathbf{i}_{S,k}=k\cdot\mathbf{i}_{S,1}$ . By additivity, $\mathbf{i}_{S,k}\Rightarrow\mathbf{y}_{S,k}$ , where $\mathbf{y}_{S,k}=k\cdot\mathbf{y}_{S,1}$ , noting that $\mathbf{y}_{S,k}(S)\geq k$ . In other words, all species are producible in arbitrarily large counts from some valid initial configuration.

Now we argue all species can be made simultaneously arbitrarily large count from some valid initial configuration; in particular, we can reach a configuration with counts at least $\mathbf{x}$ . Let $\mathbf{i}=\sum_{S\in\Lambda}\mathbf{i}_{S,\mathbf{x}(S)}$ . Since each $\mathbf{i}_{S,\mathbf{x}(S)}\Rightarrow\mathbf{y}_{S,\mathbf{x}(S)}$ , by additivity we have $\mathbf{i}\Rightarrow\mathbf{y}$ , where $\mathbf{y}=\sum_{S\in\Lambda}\mathbf{y}_{S,\mathbf{x}(S)}$ . Then for each $S\in\Lambda$ , $\mathbf{y}(S)\geq\mathbf{x}(S)$ , so $\mathbf{y}\geqq\mathbf{x}$ .

Since all executions from $\mathbf{i}$ are finite, all executions from $\mathbf{y}$ are finite. By additivity, any sequence of reactions applicable to $\mathbf{x}$ is also applicable to $\mathbf{y}$ . Thus all executions from $\mathbf{x}\leqq\mathbf{y}$ must be finite as well, i.e., $\mathcal{C}$ is entirely execution bounded since $\mathbf{x}$ is an arbitrary configuration.

Section 6.3 lets us replace “entirely execution bounded” in Section 6.3 with “leaderless and execution bounded”:

Corollary 6.24.

If a noncollapsing, all-voting, leaderless, execution bounded CRD stably decides a predicate $\phi$ , then $\phi$ is eventually constant.

In particular, since the original model of population protocols [angluin2004computation] defined them as leaderless and all-voting—and since population protocols are noncollapsing—we have the following.

Corollary 6.25.

If an execution bounded population protocol stably decides a predicate $\phi$ , then $\phi$ is eventually constant.

{toappendix}

6.4 Feedforward CRNs

We show that another common constraint, feedforwardness, significantly reduces computational power, making it impossible to decide even simple mod and threshold sets.

Definition 6.26.

A CRN is reaction-feedforward if reactions can be ordered $r_{1},r_{2},\ldots,r_{n}$ such that, for all $k<\ell$ , no reactant of $r_{k}$ appears in $r_{\ell}$ (as either reactant or product).

Reaction-feedforward CRNs are significant in the sense that many continuous real-valued CRNs computing numerical-valued functions (where the count of some species $Y$ is interpreted as the output, e.g., $2X\to Y$ computes $f(x)=\lfloor x/2\rfloor$ ) can be computed by reaction-feedforward CRNs [chen2023rate].⁶⁶6 The definition of feedforward in reference [chen2023rate] is different from the definition given here, being based on an ordering of species rather than reactions. However, it is straightforward to verify by inspection that the CRNs given for the positive results of [chen2023rate] are reaction-feedforward according to Definition 6.26. Compared to general CRNs, reaction-feedforward CRNs are easy to analyze and prove correctness. One reason is that, if a reaction-feedforward CRN can reach terminal configuration from $\mathbf{x}$ at all, then it is execution bounded from $\mathbf{x}$ .

There is a similar definition, called simply feedforward in [chen2023rate], based on ordering of species rather than reactions. We use the term species-feedforward to avoid confusion with Definition 6.26. We say a reaction $(\mathbf{r},\mathbf{p})$ produces a species $S$ if $\mathbf{p}(S)>\mathbf{r}(S)$ , and it consumes $S$ if $\mathbf{r}(S)>\mathbf{p}(S)$ .

Definition 6.27.

A CRN is species-feedforward if species can be ordered $S_{1},S_{2},\ldots,S_{n}$ such that every reaction producing a species $S_{\ell}$ consumes a earlier species $S_{k}$ where $k<\ell$ .

Although the term “linear potential function” was not used in [chen2023rate], it is shown in [chen2023rate, Lemma 4.8] that species-feedforward CRNs have a linear potential function (assigning weight $\frac{1}{K^{i}}$ to species $S_{i}$ for a suitably large constant $K$ ), thus are entirely execution bounded. The same is not always true of reaction-feedforward CRNs, for example $X\mathop{\rightarrow}\limits 2X$ is reaction-feedforward but not execution bounded. However, we can use similar techniques to proofs used for so-called noncompetitive CRNs in [vasic2022programming] to show “reasonable” reaction-feedforward CRNs are execution bounded.

Lemma 6.28.

Suppose in a reaction-feedforward CRN that $\mathbf{i}\Rightarrow\mathbf{c}$ by execution $P$ , and $\mathbf{i}\Rightarrow\mathbf{d}$ by execution $Q$ . If any reaction occurs less in $P$ than $Q$ , then $\mathbf{c}$ is not terminal.

Proof 6.29.

Here we equivalently think of an execution from $\mathbf{i}$ as a sequence of reactions, since from those and $\mathbf{i}$ we can deduce the configurations in the execution. Define $\#(r_{k},P)$ as the number of times reaction $r_{k}$ occurs in the execution $P$ . Let $r_{k}$ be the first reaction in the reaction-feedforward order such that $\#(r_{k},P)<\#(r_{k},Q)$ . Assume, for brevity of explanation, that $r_{k}$ has only one reactant, denoted $A$ ; the argument below, however, is general and applies to any number of reactants in $r_{k}$ .

By the definition of a reaction-feedforward CRN, the reactions $r_{k+1}$ through $r_{n}$ do not affect the count of $A$ . Further, reactions $r_{1}$ through $r_{k-1}$ can only produce $A$ and not consume it, reactions $r_{1}$ through $r_{k}$ can increase the count of $A$ , and among them, only $r_{k}$ can decrease it. Let $m=\#(r_{k},P)$ . Let $Q^{\prime}$ represent the prefix sequence $(\mathbf{i},\mathbf{x}_{1},\ldots,\mathbf{x}_{p})$ of $Q$ where the transition $\mathbf{x}_{p}\Rightarrow\mathbf{x}_{p+1}$ corresponds to the $(m+1)$ st execution of reaction $r_{k}$ . The configuration $\mathbf{x}_{p}$ is thus the configuration just before $r_{k}$ occurs more in $Q$ than in $P$ .

Note that reactions $r_{1}$ through $r_{k-1}$ occur at least as often in $P$ as in $Q$ (i.e. $\#(r_{i},P)\geq\#(r_{i},Q)$ for $i=1$ to $k-1$ ). Therefore, they occur at least as often in $P$ as in $Q^{\prime}$ , since $Q^{\prime}$ is a prefix of $Q$ . Moreover, by our choice of $Q^{\prime}$ , $\#(r_{k},P)=\#(r_{k},Q^{\prime})$ . So $A$ is present in $\mathbf{c}$ , i.e. $\mathbf{c}(A)>0$ . Thus, $r_{k}$ is applicable at $\mathbf{c}$ , so $\mathbf{c}$ is not terminal.

The following corollary implies that any reaction-feedforward CRN that can reach a terminal configuration from $\mathbf{i}$ is execution bounded from $\mathbf{i}$ .

Corollary 6.30.

In a reaction-feedforward CRN $\mathcal{C}$ , if there is a terminal configuration $\mathbf{c}_{\mathbf{i}}$ reachable from initial configuration $\mathbf{i}$ , then $\mathbf{c}_{\mathbf{i}}$ is reached by every sufficiently long execution from $\mathbf{i}$ . Furthermore, all of these executions are permutations of the same number of each reaction type. In particular, $\mathcal{C}$ is execution bounded from $\mathbf{i}$ .

Proof 6.31.

Let $P$ be the execution leading from $\mathbf{i}$ to $\mathbf{c}_{\mathbf{i}}$ . Consider any execution $Q$ with $|Q|>|P|$ . By the pigeonhole principle, $Q$ must involve more occurrences of some reaction $r$ than $P$ does. By Lemma 6.28, this would imply that $\mathbf{c}_{\mathbf{i}}$ is not terminal, which contradicts the premise that $\mathbf{c}_{\mathbf{i}}$ is terminal. Therefore, no execution $Q$ can be longer than $P$ . Consider any execution $Q$ where $|Q|=|P|$ . $Q$ must be a permutation of $P$ , as any deviation resulting in more of any reaction would, by the pigeonhole principle, lead to a contradiction of the terminality of $\mathbf{c}_{\mathbf{i}}$ . To address the possibility of a shorter terminal execution, consider any execution $Q$ with $|Q|<|P|$ . There must be some reaction $r$ occurring more frequently in $P$ than in $Q$ , and by Lemma 6.28, $Q$ cannot reach a terminal configuration.

As noted, in the model of continuous CRNs, it is known that all the functions that can be stably computed (the continuous, piecewise linear functions) can be stably computed by reaction-feedforward CRNs [chen2023rate]. In contrast, with discrete CRNs computing predicates, we show that reaction-feedforward CRNs cannot stably decide all semilinear sets by giving two counterexamples, showing that reaction-feedforward CRDs can decide neither “most” mod sets (6.31) nor “most” threshold sets (6.32). Specifically, we chose the parity and majority predicate as our counterexamples, although the techniques generalize to more complex mod and threshold sets, e.g., $[X_{1}+2X_{2}\equiv 3\mod 5?]$ .

{lemmarep}

Reaction-feedforward CRDs can’t stably decide the parity predicate $[X\equiv 1\mod 2?]$ .

Proof 6.32.

We show that in any possible construction, the input species must be a reactant of two distinct reactions. By letting the CRN stabilize and then introducing another input molecule, there must exist a set of rules inverting the output in either way, consisting of at least two reactions with $X$ as reactant, breaking the reaction-feedforward condition.

Consider the set of even numbers. A simple, non-reaction-feedforward CRD that decides parity is:

	$\displaystyle Y+X\rightarrow N$
	$\displaystyle N+X\rightarrow Y$

where $X$ is the input species, $Y$ is a yes voter, and $N$ is a no voter, initialized with $1Y$ and $nX$ . In either way to order these reactions, a reactant of the first reaction appears in the second reaction. Thus, the CRN is not reaction-feedforward.

To show that no such CRN could decide parity, we show that any construction requires us to have at least one reactant reappear in a later reaction, or even stronger: at least one species must be a reactant of two distinct reactions. Specifically, this is true for the input species $X$ .

To motivate the choice of species, let’s consider an even simpler parity computing CRD.

	$\displaystyle X+X\rightarrow Y$
	$\displaystyle Y+X\rightarrow X$

where $X$ is both input and votes no, $Y$ votes yes, initialized with $1Y$ and $nX$ . Only the input species appears twice as a reactant. Intuitively, this is true for all CRDs because we expect the input to be able to change our answer in either way, reversing the previous one.

Suppose for the sake of contradiction that there is a reaction-feedforward CRD $\mathcal{C}$ which stably decides whether the initial number $n=\#X$ of input $X$ is even. We withhold two copies of $X$ and let $\mathcal{C}$ stabilize on the correct output of yes. Denote $\Upsilon$ as the set of yes voters. We denote the no voters with $\overline{\Upsilon}\triangleq\Lambda\backslash\Upsilon$ . Only species contained in $\Upsilon$ are present in the stable, correct output configuration.

Now, we release one of the remaining copies of $X$ . We first run the chain reaction (if any) starting from only one $X$ . Let $\Omega_{X}:=\{S\mid\exists\mathbf{x}\in\mathsf{reach}(\{1X\}):\mathbf{x}(S)>0\}$ be the set of species producible from $\{1X\}$ (e.g., if there is no reaction $X\mathop{\rightarrow}\limits...$ , then $\Omega_{X}$ is just $\{X\}$ ). Without loss of generality, we assume $\Omega_{X}\subseteq\Upsilon$ , that is, all of $X$ ’s direct products are yes-voters (if not, exchange $\Upsilon$ and $\overline{\Upsilon}$ in what follows). To correct the answer ( $n+1\not\equiv n\bmod 2$ ), $\mathcal{C}$ must consume all species currently present and produce at least one copy of a species in $\overline{\Upsilon}$ . It follows that for all $X\in\Omega_{X}$ , $\mathcal{C}$ contains a reaction with $X$ as a reactant. Further, none of these reactions contain a reactant of $\overline{\Upsilon}$ , since none are present in the current configuration.

Finally, we release the last remaining copy of $X$ . Again, we produce the set $\Omega_{X}$ from $X$ . To invert the vote again, we must consume all $Y\in\overline{\Upsilon}$ and produce at least one member of $\Upsilon$ . The reaction(s) consuming $\overline{\Upsilon}$ must have a member of $\Omega_{X}$ as a reactant since the configuration is stable without $\Omega_{X}$ . Further, the reaction cannot be any of the ones from before since they contain a member of $\overline{\Upsilon}$ as reactant.

Since there are least two reactions sharing a common species as reactant, the reactions cannot be ordered such that no reactant of the first of these reactions appears in the latter one. This makes $\mathcal{C}$ non-reaction-feedforward, contradicting our initial assumption.

{lemmarep}

Reaction-feedforward CRDs can’t stably decide the majority predicate $[X_{1}\geq X_{2}?]$ .

Proof 6.33.

Suppose, for the sake of contradiction, there exists a reaction-feedforward CRD $\mathcal{C}$ which stably decides the predicate. We let $\mathcal{C}$ stabilize on input $\{nX_{1},nX_{2}\}$ (yielding output yes), while withholding two copies of $X_{2}$ . We release one $X_{2}$ . Again, we consider the full set of species a single $X_{2}$ could produce before reacting with other molecules (denoted $\Omega_{X_{2}}$ ). Without loss of generality, we consider all of them yes voters i.e. $\Omega_{X_{2}}\subseteq\Upsilon$ . The correct output now changes to no, and all yes voters must be consumed by reactions that only have reactants which are yes voters and further, these reactions contain all species in $\Omega_{X_{2}}$ as reactants.

Once the vote has reversed and stabilized, it contains only species of $\overline{\Upsilon}\triangleq\Lambda\backslash\Upsilon$ . We release the last $X_{2}$ and let it produce $\Omega_{X_{2}}$ . Since $\Omega_{X_{2}}\subseteq\Upsilon$ i.e. all elements are yes voters, but the correct vote is still no, all $X\in\Omega_{X_{2}}$ must be consumed again. This time, they must be consumed in reactions involving no voters, must be distinct reactions from those in the previous step. Thus, all species $X\in\Omega_{X_{2}}$ appear at least twice as a reactant, breaking the reaction-feedforward condition.

7 Conclusion

\opt

full We explored the computational capabilities of execution bounded Chemical Reaction Networks (CRNs), which terminate after a finite number of reactions. This constraint aligns the model with practical scenarios where fuel supply is limited.

Our findings illustrate that the computational power of these CRNs varies significantly based on structural choices. Specifically, CRNs with an initial leader and the ability to allow only the leader to vote can stably compute all semilinear predicates and functions in $O(\|x\|\log\|x\|)$ parallel time. Without an initial leader, and requiring all species to vote, these networks are limited to computing eventually constant predicates. This limitation holds considerable weight for decentralized systems modeled by population protocols, which inherently exhibit these traits. Additionally, we introduced a new characterization of execution bounded networks through a nonnegative linear potential function, providing a novel theoretical tool for analyzing the physical constraints CRNs.

A key question remains open: Can execution bounded CRNs compute semilinear functions and predicates within polylogarithmic time? Angluin, Aspnes and Eisenstat introduced a fast population protocol that simulates a register machine with high probability. This protocol can perform standard operations like comparison, addition, subtraction, and multiplication and division by constants in $O(\log^{5}n)$ time [AngluinAE2008Fast]. Chen, Doty and Soloveichik applied this construction to CRNs in [Chen2012DeterministicFunction], showing that semilinear functions can be computed by CRNs without error in expected polylogarithmic time in the kinetic model. Central to their success in both cases is the “phase clock”, which generates a clock signal to indicate the probable completion of an epidemic style chain reaction and orders more recent instructions to overwrite older ones. This clock is inherently unbounded in its execution, cycling through $m$ stages.

References

[1] Dana Angluin, James Aspnes, Zoë Diamadi, Michael J Fischer, and René Peralta. Computation in networks of passively mobile finite-state sensors. In PODC 2004: Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing, pages 290–299, 2004.
[2] Dana Angluin, James Aspnes, and David Eisenstat. Fast computation by population protocols with a leader. Distributed Computing, 21(3):183–199, September 2008.
[3] Dana Angluin, James Aspnes, David Eisenstat, and Eric Ruppert. The computational power of population protocols, 2006.
[4] Ho-Lin Chen, David Doty, Wyatt Reeves, and David Soloveichik. Rate-independent computation in continuous chemical reaction networks. Journal of the ACM, 70(3), May 2023.
[5] Ho-Lin Chen, David Doty, and David Soloveichik. Deterministic function computation with chemical reaction networks. Natural Computing, 13(4):517–534, 2014. Preliminary version appeared in DNA 2012.
[6] David Doty and Monir Hajiaghayi. Leaderless deterministic chemical reaction networks. Natural Computing, 14(2):213–223, 2015. Preliminary version appeared in DNA 2013.
[7] Julius Farkas. Theorie der einfachen ungleichungen. Journal für die reine und angewandte Mathematik (Crelles Journal), 1902(124):1–27, 1902.
[8] David Gale. The theory of linear economic models. University of Chicago press, 1960.
[9] Daniel T. Gillespie. Exact stochastic simulation of coupled chemical reactions. Journal of Physical Chemistry, 81(25):2340–2361, 1977.
[10] S. Ginsburg and E. H. Spanier. Semigroups, Presburger formulas, and languages. Pacific Journal of Mathematics, 16(2):285–296, 1966.
[11] Ryuichi Ito. Every semilinear set is a finite union of disjoint linear sets. Journal of Computer and System Sciences, 3(2):221–231, 1969.
[12] Richard M Karp and Raymond E Miller. Parallel program schemata. Journal of Computer and system Sciences, 3(2):147–195, 1969.
[13] Olvi L Mangasarian. Nonlinear programming. SIAM, 1994.
[14] Christos H Papadimitriou. On the complexity of integer programming. Journal of the ACM (JACM), 28(4):765–768, 1981.
[15] Charles Rackoff. The covering and boundedness problems for vector addition systems. Theoretical Computer Science, 6(2):223–231, 1978.
[16] David Soloveichik, Matthew Cook, Erik Winfree, and Jehoshua Bruck. Computation with finite stochastic chemical reaction networks. Natural Computing, 7(4):615–633, 2008.
[17] Marko Vasić, Cameron Chalk, Austin Luchsinger, Sarfraz Khurshid, and David Soloveichik. Programming and training rate-independent chemical reaction networks. Proceedings of the National Academy of Sciences, 119(24):e2111552119, 2022.