Decidable models of integer-manipulating programs with recursive parallelism (technical report)

Matthew Hague and Anthony Widjaja Lin

Royal Holloway, University of London, UK and Yale-NUS College, Singapore

Abstract: We study safety verification for multithreaded programs with recursive parallelism (i.e. unbounded thread creation and recursion) as well as unbounded integer variables. Since the threads in each program configuration are structured in a hierarchical fashion, our model is state-extended ground-tree rewrite systems equipped with shared unbounded integer counters that can be incremented, decremented, and compared against an integer constant. Since the model is Turing-complete, we propose a decidable underapproximation. First, using a restriction similar to context-bounding, we underapproximate the global control by a weak global control (i.e. DAGs possibly with self-loops), thereby limiting the number of synchronisations between different threads. Second, we bound the number of reversals between non-decrementing and non-incrementing modes of the counters. Under this restriction, we show that reachability becomes NP-complete. In fact, it is poly-time reducible to satisfaction over existential Presburger formulas, which allows one to tap into highly optimised SMT solvers. Our decidable approximation strictly generalises known decidable models including (i) weakly-synchronised ground-tree rewrite systems, and (ii) synchronisation/reversal-bounded concurrent pushdown systems systems with counters. Finally, we show that, when equipped with reversal-bounded counters, relaxing the weak control restriction by the notion of senescence results in undecidability.

1 Introduction

Verification of multithreaded programs is well-known to be a challenging problem. One approach that has proven effective in addressing the problem is to bound the number of context switches [38, 36]. [Recall that a context switch occurs when the CPU switches from executing one thread to executing a different thread.] When the number of context switches is fixed, one may adopt pushdown systems as a model of a single thread and show that reachability for the concurrent extension of the abstraction (i.e. multi-pushdown systems) is NP-complete [38]. This result has paved the way for an efficient use of highly optimised SMT solvers in verifying concurrent programs (e.g. see [24, 1, 19]). Note that without bounding the number of context switches the model is undecidable [37].

In the past decade the work of Qadeer and Rehof [38] has spawned a lot of research in underapproximation techniques for verifying multithreaded programs, e.g., see [24, 1, 19, 40, 5, 28, 35, 31, 22, 42, 4, 2, 6, 20, 33, 27, 42, 14] among many others. Other than unbounded recursions, some of these results simultaneously address other sources of infinity, e.g., unbounded thread creation [31, 22, 5], unbounded integer variables [24], and unbounded FIFO queues [1, 2].

Contributions.

In this paper we generalise existing underapproximation techniques [23, 31] so as to handle both shared unbounded integer variables and recursive parallelism (unbounded thread creation and unbounded recursions). The paper also provides a cleaner proof of the result in [24]: an NP upper bound for synchronisation/reversal-bounded reachability analysis of concurrent pushdown systems with counters. We describe the details below.

We adopt state-extended ground-tree rewrite systems (sGTRS) [31] as a model for multithreaded programs with recursive parallelism (e.g. programming constructs including fork/join, parbegin/parend, and Parallel.For). Ground-tree rewrite systems (GTRS) are known (see [21]) to strictly subsume other well-known sequential and concurrent models like pushdown systems [11], PA-processes [18], and PAD-processes [34], which are known to be suitable for analysing concurrent programs. [One may think of GTRS as an extension of PA and PAD processes with return values to parent threads [21].] We then equip sGTRS with unbounded integer counters that can be incremented, decremented, and compared against an integer constant.

Since our model is Turing-powerful, we provide an underapproximation of the model for which safety verification becomes decidable. First, we underapproximate the global control by a weak global control [26, 31] (i.e. DAGs possibly with self-loops), thereby limiting the number of synchronisations between different threads. To this end, we may simply unfold the underlying control-state graph of the sGTRS (see Section 3) in the standard way, while preserving self-loops. This type of underapproximation is similar to loop acceleration in the symbolic acceleration framework of [8]. Second, we bound the number of reversals between non-decrementing and non-incrementing modes of the counters [25]. Under these two restrictions, reachability is shown to be NP-complete; in fact, it is poly-time reducible to satisfaction over existential Presburger formulas, which allows one to tap into highly optimised SMT solvers. Our result strictly generalises the decidability (in fact, NP-completeness) of reachability for (i) weakly-synchronised ground-tree rewrite systems [31, 41], and (ii) synchronisation/reversal-bounded concurrent pushdown systems with counters [24].

Finally, we show one negative result that delineates the boundary of decidability. If we relax the weak control underapproximation by the notion of senescence (with age restrictions associated with nodes in the trees) [22], then the resulting model becomes undecidable.

Related Work.

Recursively-parallel program analysis was analysed in detail by Bouajjani and Emmi [10]. However, in contrast to our systems, their model does not allow processes to communicate during execution. Instead, processes hold handles to other processes which allow them to wait on the completion of others, and obtain the return value. They show that when handles can be passed to child processes (during creation) then the state reachability problem is undecidable. When handles may only be returned from a child to its parent, state reachability is decidable, with the complexity depending on which of a number of restrictions are imposed.

The work of Bouajjani and Emmi is closely related to branching vector addition systems [43] which can model a stack of counter values which can be incremented and decremented (if they remain non-negative), but not tested. While it is currently unknown whether reachability of a configuration is decidable, control-state reachability and boundedness are both 2ExpTime-complete [17].

Another variant of vector addition systems with recursion are pushdown vector addition systems, where a single (sequential) stack and several global counters are permitted. As before, these counters can be incremented and decremented, but not compared with a value. Reachability of a configuration, and control-state reachability in these models remain open problems, but termination (all paths are finite) and boundedness are known to be decidable [30]. For reachability of a configuration, an under-approximation algorithm is proposed by Atig and Ganty where the stack behaviour is approximated by a finite index context-free language [7].

Lang and Löding study boundedness problems over sequential pushdown systems [29]. In this model, the pushdown system is equipped with a counter that can be incremented, reset, or recorded. Their model differs from ours first in the restriction to sequential systems, and second because the counter cannot effect execution or be decremented: it is a recording of resource usage. These kind of cost functions have also been considered over static trees [13, 9], however, to our knowledge, they have not been studied over tree rewrite systems.

2 Preliminaries

We write ℕ to denote the set of natural numbers and ℤ the set of integers.

Trees

A ranked alphabet is a finite set of characters Σ together with a rank function ρ : Σ ↦ ℕ. A tree domain D ⊂ ℕ^∗ is a non-empty finite subset of ℕ^∗ that is both prefix-closed and younger-sibling-closed. That is, if η i ∈ D, then we also have η ∈ D and, for all 1 ≤ j ≤ i, η j ∈ D (respectively). A tree over a ranked alphabet Σ is a pair t = (D, λ) where D is a tree domain and λ : D ↦ Σ such that for all η ∈ D, if λ(η) = a and ρ(a) = n then η has exactly n children (i.e. η n ∈ D and η (n + 1) ∉ D). Let T_Σ denote the set of trees over Σ.

Context Trees

A context tree over the alphabet Σ with a set of context variables x₁, …, x_n is a tree C = (D, λ) over Σ ⊎ {x₁, …, x_n} such that for each 1 ≤ i ≤ n we have ρ(x_i) = 0 and there exists a unique context node η_i such that λ(η_i) = x_i. By unique, we mean η_i ≠ η_j for all i ≠ j. We will denote such a tree C[x₁, …, x_n]. Given trees t_i = (D_i, λ_i) for each 1 ≤ i ≤ n, we denote by C[t₁, …, t_n] the tree t′ obtained by filling each variable x_i with t_i. That is, t′ = (D′, λ′) where

D′ = D ⋃ η₁ · D₁ ⋃ ⋯ ⋃ η_n · D_n and λ′

⎛
⎝

⎞
⎠

⎧
⎪
⎨
⎪
⎩

⎛
⎝

⎞
⎠

if η ∈ D ∧ ∀ i . η ≠ η_i

λ_i

⎛
⎝

η′

⎞
⎠

if η = η_i η′ .

Tree Automata

A bottom-up non-deterministic tree automaton (NTA) over a ranked alphabet Σ is a tuple T = (Q, Δ, F) where Q is a finite set of states, F ⊆ Q is a set of final (accepting) states, and Δ is a finite set of rules of the form (q₁, …, q_n) –[a]→ q where q₁, …, q_n, q ∈ Q, a ∈ Σ and ρ(a) = n. A run of T on a tree t = (D, λ) is a mapping π : D ↦ Q such that for all η ∈ D labelled λ(η) = a with ρ(a) = n we have (π(η 1), …, π(η n)) –[a]→ π(η). It is accepting if π(ε) ∈ F. The language defined by a tree automaton T over alphabet Σ is a set L(T) ⊆ T_Σ of trees over which there exists an accepting run of T.

Parikh images

Given an alphabet Σ = {γ₁,…,γ_n} and a word w ∈ Σ^*, we write P(w) to denote a mapping ρ: Σ → ℕ, where ρ(a) is defined to be the number of occurrences of a in w. Given a language L ⊆ Σ^*, we write P(L) to denote the set {P(w) | w ∈ L}. We say that P(L) is the Parikh image of L.

Presburger Arithmetic

Presburger formulas are first-order formulas over integers with addition. Here, we use existential Presburger formulas ϕ(x,y) := ∃ x ϕ, where (i) x and y are sets of variables, and (ii) ϕ is a boolean combination of expressions ∑_i=1^m a_iz_i ∼ b for variables z₁,…,z_m ∈ x ∪ y, constants a₁,…,a_m,b∈ ℤ, and ∼ ∈ {≤,≥,<,>,=} with constants represented in binary. A solution to ϕ is a valuation b: y ↦ ℤ to y such that ϕ(x,b) is true. The formula ϕ is satisfiable if it has a solution. Satisfiability of existential Presburger formulas is known to be NP-complete [39].

3 Formal Models

In this section, we will define our formal models, which are based on ground-tree rewrite systems. Ground-tree rewrite systems (GTRSs) [15] permit subtree rewriting where rules are given as a pair of ground-trees. In the sequel, we use the extension proposed by Löding [32] where NTA (instead of ground trees) appear in the rewrite rules. Hence, a single rule may correspond to an infinite number of concrete rules (i.e. containing concrete trees).

Ground Tree Rewrite Systems with State and Reversal Bounded Counters.

To capture synchronisations between different subthreads, we follow [31, 26, 41] and extend GTRS with state (a.k.a. global control). The resulting model is denoted by sGTRS (state-extended GTRS). To capture integer variables, we further extend the model with unbounded integer counters, which can be incremented, decremented, and compared against an integer constant. Since Minsky’s machines can easily be encoded in such a model, we apply a standard underapproximation technique: reversal-bounded analysis of the counters [23, 25]. This means that one only analyses executions of the machines whose number of reversals between nondecrementing and nonincrementing modes of the counters is bounded by a given constant r ∈ ℕ (represented in unary). The resulting model will be denoted by rbGTRS. We will now define this model in more detail.

An atomic counter constraint on counter variables C = {c₁,…,c_k} is an expression of the form c_i ∼ v, where v ∈ ℤ and ∼ ∈ {<,≤,=,≥,>}. A counter constraint θ on C is a boolean combination of atomic counter constraints on C. Given a valuation ν : C ↦ ℤ to the counter variables, we can determine whether θ[ν] is true or false by replacing a variable c by ν(c) and evaluating the resulting boolean expressions in the obvious way. Let Cons_C denote the set of all counter constraints on C. Intuitively, these formulas will act as guards to determine whether certain transitions can be fired. Given two counter valuations ν and µ we define ν + µ as the pointwise addition of the valuations. That is, (ν + µ)(c) = ν(c) + µ(c).

Given a sequence of counter values, a reversal occurs when a counter switches from being incremented to being decremented or vice-versa. For example, if the values of a counter c along a run are 1,1,1,2,3,4,4,4,3,2,2,3, then the number of reversals of c is 2 (reversals occur in between the overlined positions). A sequence of valuations is reversal-bounded whenever the number of reversals is the sequence is bounded.

Definition 1 (r-Reversal-Bounded) For a counter c from a set of counters C, a sequence ν₁, …, ν_n of counter valuations over C is r-reversal-bounded for c whenever we can partition ν₁, …, ν_n into (r+1) sequences A₁, …, A_r+1 (with ν₀, …, ν_n = A₁, …, A_r+1) such that for all 1 ≤ i ≤ r there is some ∼ ∈ {≤, ≥} such that for all ν_j, ν_j+1 appearing together in A_i, we have ν_j(c) ∼_cν_j+1(c).

We define sGTRS with reversal-bounded counters.

Definition 2 (sGTRSs with r-Reversal-Bounded Counters (rbGTRS)) A state-extended ground tree rewrite system with r-reversal-bounded counters (rbGTRS) is a tuple G = (P, Σ, Γ, R, C, r) where P is a finite set of control-states, Σ is a finite ranked alphabet, Γ is a finite alphabet of output symbols (i.e. transition labels), C is a finite set of counters, R is a finite set of rules of the form (p₁, T₁, θ) –[γ]→ (p₂, T₂, µ) where p₁, p₂ ∈ P, γ ∈ Γ, θ ∈ Cons_C, µ ∈ C ↦ ℤ, and T₁, T₂ are NTAs over Σ.

In the sequel, we will omit mention of the number r in the tuple G if it is clear from the context.

A configuration of an sGTRS with counters is a tuple α = (p, t, ν) where p is a control-state, t a tree, and ν a valuation of the counters. We have a transition (p₁, t₁, ν₁) –[γ]→ (p₂, t₂, ν₂) whenever there is a rule (p₁, T₁, θ) –[γ]→ (p₂, T₂, µ) ∈ R such that: (i) (dynamics of counters) θ[ν₁] is true and ν₂ = ν₁ + µ, and (ii) (dynamics of trees) t₁ = C[t′₁] for some context C and tree t′₁ ∈ L(T₁) and t₂ = C[t′₂] for some tree t′₂ ∈ L(T₂). A run π over γ₁…γ_n−1 is a sequence

⎛
⎝

p₁, t₁, ν₁

⎞
⎠

–[γ₁]→ ⋯ –[γ_n−1]→

⎛
⎝

p_n, t_n, ν_n

⎞
⎠

such that for all 1 ≤ i < n we have (p_i, t_i, ν_i) –[γ_i]→ (p_i+1, t_i+1, ν_i+1) is a transition of G and for each c ∈ C the sequence ν₁, …, ν_n is r-reversal-bounded for c. We say that γ₁…γ_n−1 is the output string of π. We write (p, t, ν) –[γ₁…γ_n]→ (p′, t′, ν′) (or simply (p, t, ν) →^* (p′, t′, ν′)) whenever there is a run from (p, t, ν) to (p′, t′, ν′) over γ₁…γ_n. Let ε denote the empty output symbol.

Whenever we wish to discuss sGTRSs without counters, we simply omit the counter components. That is, we have configurations of the form (p, t) and transitions of the form (p₁, T₁) –[γ]→ (p₂, T₂). The standard notion of GTRS (i.e. not state-extended) [32] is simply sGTRS without counters with only one state.

We next define the problems of (global) reachability. To this end, we use a tree automaton T (resp. an existential Presburger formula ϕ) to represent the tree (resp. counter) component of a configuration. More precisely, a symbolic config-set of an rbGTRS G = (P, Σ, Γ, R, C, r) is a tuple (p, T, ϕ), where p ∈ P, T is an NTA over Σ, and ϕ(x) is an existential Presburger formula with free variables x = {x_c}_{c ∈ C} (i.e. one free variable for each counter). Each symbolic config-set (c, T, ϕ) represents a set of configurations of G defined as follows: [[(p, T, ϕ) ]] := { (p, t, ν) : t ∈ L(T), ϕ(ν) is true }. Global Reachability

Input: an rbGTRS G and two symbolic config-sets (p₁, T₁, ϕ₁) (p₂, T₂, ϕ₂)

Problem: Decide whether (p₁, t₁, ν₁) →^* (p₂, t₂, ν₂), for some (p₁, t₁, ν₁) ∈ [[(p₁, T₁, ϕ₁) ]] and (p₂, t₂, ν₂) ∈ [[(p₂, T₂, ϕ₂) ]] The problem of control-state reachability can be defined by restricting (i) the tree automata T₁ and T₂ to accept, respectively, a singleton tree and the set of all trees, and (ii) the solutions to the formulas ϕ₁ and ϕ₂ are, respectively, {ν₀} (where ν₀ is the valuation assigning 0 to all counters) and the set of all counter valuations.

Remark 1 When we measure the complexity of reachability for rbGTRS, the number r of reversals is represented in unary, while the numbers in counter constraints and valuations are represented in binary. This is consistent with the standard representation of numbers in previous work on reversal-bounded counter machines (e.g. see [23, 24]). The unary representation for r can be justified by the fact that bugs can often be discovered within a small number of reversals.

Weakly Synchronised Ground Tree Rewrite Systems

The control-state and global reachability problems for sGTRS are known to be undecidable [12, 21]. The problems become NP-complete for weakly-synchronised sGTRS [31, 41], where the underlying control-state graph (where there is an edge between p₁ and p₂ whenever there is a transition (p₁, T₁) –[γ]→ (p₂, T₂)) may only have cycles of length 1 (i.e. self-loops), i.e., a DAG (directed acyclic graph) possibly with self-loops. Underapproximation by a weak control is akin to loop acceleration in the symbolic acceleration framework of [8]. We extend the definition to rbGTRSs. The original definition can be easily obtained by omitting the counter components.

We define the underlying control graph of an rbGTRS G = (P, Σ, Γ, R, C) as a tuple (P, Δ) where Δ = {(p₁, p₂) | (p₁, T₁, θ) –[γ]→ (p₁, T₂, µ) ∈ R} .

Definition 3 (Weakly-Synchronised rbGTRS) An rbGTRS is weakly synchronised if its underlying control graph (P, Δ) is a DAG possibly with self-loops.

4 Decidability

In this section we will prove the main result of the paper:

Theorem 1 Global reachability for weakly synchronised rbGTRS is NP-com-
plete. In fact, it is poly-time reducible to satisfiability over existential Presburger formulas.

To prove this theorem, we fix notation for the input to the problem: an rbGTRS G = (P, Σ, Γ, R, C, r) and two symbolic config-sets (p₁, T₁, ϕ₁), (p₂, T₂, ϕ₂) of G. Let C = {c_i}_i=1^k. The gist of the proof is as follows. From G, we construct a new sGTRS G′ (without counters) by encoding the dynamics of the counters in the output symbols of G′. Of course, G′ has no way of comparing the values of counters with constants. [In this sense, G′ only overapproximates the behavior of G.] To deal with this problem, we use the result of [31] to compute an existential Presburger formula ψ capturing the Parikh images of the set of all output strings of G′ from (p₁, T₁, ϕ₁) to (p₂, T₂, ϕ₂). The final formula is ψ ∧ ψ′, where ψ is a constraint asserting that the desired counter comparisons are performed throughout runs of G′. We sketch the details of the construction below.

Modes of the counters.

The first notion that is crucial in our proof is that of mode of a counter [23, 25], which is an abstraction of the values of a counter in a run of an rbGTRS containing three pieces of information: (i) the region of the counter value (i.e. how it compares to constants occurring in counter constraints), (ii) the number of reversals that has been performed by each counter (between 0 and r), and (iii) whether a counter is currently non-decrementing (↑) or non-incrementing (↓). A mode vector is simply a k-tuple of modes, one mode for each of the k counters. We now formalise these notions.

Let d₁ < … < d_m be the integer constants appearing in the counter constraints in G. This sequence of constants gives rise to the set REG of regions defined as REG := { A₀,…,A_m, B₁,…,B_m}, where B_i := {d_i} (where 1 ≤ i ≤ m), A_i := { n ∈ ℤ: d_i < n < d_i+1 } (where 1 ≤ i < m), A₀ := { n ∈ ℤ: n < d₁ }, and A_m := { n ∈ ℤ: n > d_m }. A mode is simply a tuple in REG × [0,r] × {↑,↓}. A mode vector is simply a tuple in Modes := REG^k × [0,r]^k × {↑,↓}^k.

Building the sGTRS G′.

We might be tempted to build G′ by first removing the counters from G and then embedding Modes into the control-states G′. This, however, causes two problems. First, the number of control-states becomes exponential in k. Second, the resulting system is no longer weakly synchronised even though G originally was weakly synchronised. To circumvent this problem, we adapt a technique from [23]. Every run π of G from (p₁, T₁, ϕ₁) to (p₂, T₂, ϕ₂) can be associated with a sequence σ of mode vectors recording the information (i)–(iii) for each counter. The crucial observation is that there are at most N_max := 2mk(r+1) different mode vectors in σ. This is because a counter can only go through at most 2m regions without incurring a reversal. For this reason, we may use the control-states of G′ to store the number of mode vectors that G has gone through, while the actual mode vector guessed by G′ will be made “visible” in the output strings of G′. That way, we can use an additional existential Presburger formula ψ′ (see below) to enforce that the run of G′ faithfully simulates runs of G. In addition, the shape of the control-states (DAG with self-loops) of G′ is preserved. [The product graph of two DAGs with self-loops is also a DAG with self-loops.] We detail the construction below.

Define the weakly-synchronised sGTRS G′ = (P′, Σ, Γ′, R′) as follows. Let P′ := P × [0,N_max]. The output alphabet Γ′ is defined as Γ × R × [0,N_max] × {0,1}, where the boolean flag is used to denote whether the transition taken changes the mode. We define R′ as follows. For each rule τ = (p, T, θ) –[γ]→ (p′, T′, µ) in R, we add the rule ((p,i), T) –[(γ,τ,i,0)]→ ((p′,i), T′) for each i ∈ [0,N_max], and ((p,i), T) –[(γ,τ,i,1)]→ ((p′,i+1), T′) for each i ∈ [0,N_max). Since G is weakly-synchronised and the mode counter never decreases, it follows that G′ is weakly-synchronised too. Note also that this construction can be performed in polynomial-time.

Constructing the formula ψ ∧ ψ′.

As we mentioned, ψ is an existential Presburger formula encoding the Parikh image P(L) of the set L of all output strings of G′ from ((p₁,0), T₁) to (S, T₂), where S = {p₂} × [0,N_max]. More precisely, the set z of free variables of ψ include z_a for each a ∈ Γ′. Furthermore, for each valuation µ ∈ z ↦ ℤ, it is the case that ψ(µ) is true iff µ ∈ P(L). Such a formula is known to be polynomial-time computable since G′ is a weakly-synchronised sGTRS [31].

Recall that ψ′ should assert that the desired counter comparisons are performed throughout runs of G′. To this end, the formula ψ′ will have extra variables for guessing the existence of a sequence of N_max distinct mode vectors through runs of G′. More precisely, the formula ψ′ is the conjunction

ϕ₁(x) ∧ ϕ₂(y) ∧ Dom(m₀,…,m_{N_max}) ∧ Init(m₀) ∧

GoodSeq(m₀,…,m_{N_max}) ∧ Respect(z,m₀,…,m_{N_max}) ∧ EndVal(x,y,z).

The set x consists of variables x_i (1 ≤ i ≤ k) which contain the initial value of the ith counter. Similarly, the set y consists of variables y_i (1 ≤ i ≤ k) which contain the final value of the ith counter. Each m_i denotes a set of variables for the ith mode vector defined as follows:

reg_jⁱ (for each j ∈ [1,k]) — to encode which of the 2m+1 possible regions the jth counter is in.
rev_jⁱ (for each j ∈ [1,k]) — to encode how many reversals have been used up by the jth counter.
arr_jⁱ (for each j ∈ [1,k]) — to encode whether the jth counter is non-incrementing or non-decrementing.

We detail each subformula below.

The subformula Dom asserts that each variable in m_i (for each i) has the right domain (i.e. range of integer values). More precisely, for each j ∈ [1,k], we add the conjuncts: (i) 0 ≤ reg_jⁱ ≤ 2m, (ii) 0 ≤ rev_jⁱ ≤ r, and (iii) 0 ≤ arr_jⁱ ≤ 1. For the first constraint, we use an even number of the form 2i to represent the region A_i, and an odd number 2i−1 to represent the region B_i. The last constraint simply encodes non-decrementing (↑) as 1, and non-incrementing (↓) as 0.

The subformula Init asserts that m₀ is an initial mode vector. More precisely, for each j ∈ [1,k], we add the conjuncts rev_j⁰ = 0.

The subformula GoodSeq asserts that m₀,…,m_{N_max} forms a valid sequence of mode vectors. More precisely, for each i ∈ [0,N_max) and each j ∈ [1,k], we add the conjuncts: (i) arr_jⁱ ≠ arr_jⁱ⁺¹ ⇒ rev_jⁱ⁺¹ = rev_jⁱ+1, (ii) arr_jⁱ = arr_jⁱ⁺¹ ⇒ rev_jⁱ⁺¹ = rev_jⁱ, (iii) reg_jⁱ < reg_jⁱ⁺¹ ⇒ arr_jⁱ⁺¹ = 1, and (iv) reg_jⁱ > reg_jⁱ⁺¹ ⇒ arr_jⁱ⁺¹ = 0. For example, the first constraint asserts that a change in the direction (non-incrementing or non-decrementing) of the counter incurs one reversal. The other constraints are similar.

The subformula Respect asserts that the Parikh image z of the run of G′ respects the sequence m₀,…,m_{N_max} of mode vectors. In effect, this subformula ensures that G′ faithfully simulates G. Firstly, we need to assert that the jth counter values at the start and at the end of the ith mode of G′ (which are encoded in z) are in the right regions reg_jⁱ. To state this more precisely, for each rule τ = (p, T, θ) –[γ]→ (p′, T′, µ) in R, we let µ_j(τ) denote the value µ(c_j). For each i ∈ [0,N_max] and j ∈ [1,k], we denote by the notation StartCounter_jⁱ the term x_j + ∑_s=0ⁱ⁻¹∑_(γ,τ,s,l) µ_j(τ) × z_(γ,τ,s,l), where γ, τ, and l, range over, respectively, Γ, R, and {0,1}. Similarly, we denote by EndCounter_jⁱ the term StartCounter_jⁱ + ∑_(γ,τ,i,0) µ_j(τ) × z_(γ,τ,i,0). We add the conjuncts: (i) reg_jⁱ = 2h ⇒ EndCounter_jⁱ ∈ A_h, for each h ∈ [0,m], and (ii) reg_jⁱ = 2h+1 ⇒ EndCounter_jⁱ ∈ B_h, for each h ∈ [0,m). [Note that formulas of the form g ∈ A, for a Presburger term g and a set S ∈ {A₀,…,A_m,B₁,…,B_m}, can be easily replaced by quantifier-free Presburger formulas, e.g., g ∈ A₀ stands for g < d₁.] To ensure that the initial condition is correct, for each j ∈ [1,k], we add the following conjuncts: (1) StartCounter_j⁰ ∈ A_h ⇒ reg_j⁰ = 2h, and (2) StartCounter_j⁰ ∈ B_h ⇒ reg_j⁰ = 2h+1. Secondly, we need to state that the transitions executed in each mode are valid (i.e. satisfy the counter constraints). More precisely, for each γ ∈ Γ, τ ∈ R, i ∈ [0,N_max], and l ∈ {0,1}, if θ is the counter constraint in τ, we add the conjunct z_(γ,τ,i,l) > 0 ⇒ θ(StartCounter₁ⁱ,…,StartCounter_kⁱ). Next we assert that, when the jth counter is non-incrementing (resp. non-decrementing), only non-negative (resp. non-positive) counter increments are permitted. More precisely, for each i ∈ [0,N_max], j ∈ [1,k], l ∈ {0,1}, and τ ∈ R, if µ_j(τ) > 0, then add the conjunct arr_jⁱ = 0 ⇒ z_(γ,τ,i,l) = 0; if µ_j(τ) < 0, then add the conjunct arr_jⁱ = 1 ⇒ z_(γ,τ,i,l) = 0.

Finally, the subformula EndVal simply asserts that, starting from the initial counter value x and following the transitions z, the end counter values are y. To this end, we can simply add the conjunct y_j = EndCounter_j^N_max for each j ∈ [1,k].

This concludes the formula construction. It is immediate that G′ faithfully simulates G iff ψ ∧ ψ′ is true. In addition, the formula construction runs in polynomial-time. Since satisfiability over existential Presburger formulas is NP-complete [39], the NP upper bound for Theorem 1 follows. NP-hardness already holds for the restricted model where the tree component is a stack [23].

5 Senescent Ground-Tree Rewrite Systems

A natural question arising from the result on weakly synchronised rbGTRS is whether the “weakly synchronised” restriction can be relaxed while maintaining decidability. It is known that allowing arbitrary underlying control-state graphs leads to undecidability of reachability even without reversal bounded counters. In this section we explore the notion of senescence [22], which is more general than the weakly synchronised restriction, but still permits a decidable reachability problem (without counters). After giving the formal definition of senescent GTRS, we show the following result.

Theorem 2 (Control-State Reachability of Senescent rbGTRS) The control-state reachability problem for senescent rbGTRS is undecidable.

5.1 Model Definition

Senescence allows the underlying control-state graph to have arbitrary cycles (instead of only self-loops). For sGTRS, control-state reachability is decidable under an “age restriction” that is imposed on the nodes that can be rewritten. That is, when the control-state changes, the nodes in the tree age by one timestep. Once a node reaches an a priori fixed age r, it becomes fixed (i.e. cannot be rewritten by further transitions in the run).

Figure 1: A transition changing the control-state.

Figure 2: A transition that does not change the control-state.

Figure 3: Transitions of a senescent GTRS.

Before the formal definition, two example transitions of a senescent rbGTRS are shown in Figure 3. A configuration is written as its control-state and counter values ((p, ν) or (p′, ν′)) with the tree appearing below. In the tree, the label of each node appears in the centre of the node. The ages of each node is depicted as a subscript on the right. Dotted lines are used to indicate the part of the tree rewritten by a rule. In Figure 1 the transition changes the control-state, causing the age of the nodes that are not rewritten to increase by 1. The rewritten nodes are given the age 0 as they are new, fresh, nodes. The situation when the control-state does not change is shown in Figure 2. In this case, the nodes that are not rewritten maintain the same age. The senescence restriction disallows runs where nodes older than a fixed age are rewritten.

More formally, given a run

⎛
⎝

p₁, t₁, ν₁

⎞
⎠

–[γ₁]→ ⋯ –[γ_n−1]→

⎛
⎝

p_n, t_n, ν_n

⎞
⎠

of an rbGTRS, let C₁, …, C_n−1 be the sequence of tree contexts used in the transitions from which the run was constructed. That is, for all 1 ≤ i < n, we have t_i = C_i[t_i^out] and t_i+1 = C_i[t_i+1ⁱⁿ] where (p_i, T_i, θ_i) –[γ_i]→ (p_i+1, T′_i, µ_i) was the rewrite rule used in the transition and t_i^out ∈ L(T_i), t_i+1ⁱⁿ ∈ L(T′_i) were the trees that were used in the tree update.

For a given position (p_i, t_i, ν_i) in the run and a given node η in the domain of t_i, the birthdate of the node is the largest 1 ≤ j ≤ i such that η is in the domain of C_j[t_jⁱⁿ] and η is in the domain of C_j[x] only if its label is x. The age of a node is the cardinality of the set {i′ | j ≤ i′ < i ∧ p_i′ ≠ p_i′+1}. That is, the age is the number of times the control-state changed between the jth and the ith configurations in the run.

A lifespan-restricted run with a lifespan of r is a run such that each transition (p_i, C_i[t_i^out], ν_i) –[γ_i]→ (p_i+1, C_i[t_i+1ⁱⁿ], ν_i+1) has the property that all nodes η in t_i^out have an age of at most r. That is, more precisely, that all nodes η in the domain of C_i[t_i^out] but only in the domain of C_i[x] if the label is x have an age of at most r.

Definition 4 (Senescent rbGTRS) A senescent rbGTRS with lifespan r is an rbGTRS G = (P, Σ, R, C) where runs are lifespan-restricted with a lifespan of r.

Note that the senescence restriction is weaker than the weakly-synchronised restriction in that the number of times the finite control could change state is unbounded. In fact, a node could be affected by an unbounded number of control-state changes so long as it is always rewritten without becoming fixed (i.e. reaches age r).

5.2 Undecidability

We show control-state reachability for senescent rbGTRSs is undecidable in the appendix, and give the intuition here. In the following, we refer to nodes whose age is within the age bound as live. We refer to nodes that are not live as fixed. Note, each time a node is rewritten, its age is reset to zero. Thus, we can keep leaves of the tree live by allowing them to rewrite to themselves. That is, for all symbols a we wish to keep live and all control-states p, we have a transition (p, a, θ) –[γ]→ (p, a, µ) where θ is a formula that is always satisfied, and µ assigns 0 to all counters (i.e. the rule does not depend on, nor change the counter values). In addition, by omitting the above rules for certain control-states, we can prevent a node from keeping itself fresh in certain situations.

We follow the proof that reachability for reset Petri nets is undecidable [3]. We simulate a two-counter machine. Testing whether such a machine can reach a given control-state while having counters with value zero is undecidable.

Let the two counters be c₁ and c₂. In the tree, we track the value of a counter c ∈ {c₁, c₂} by the number of live leaves labelled with the counter name c. E.g. the tree •(c₁, •(c₂, ∗)) represents the situation where both counters have value 1, assuming these leaves are live. We will always use internal nodes labelled •. The node ∗ is for adding new leaves when required. To increment a counter we add a new leaf labelled c. To decrement a counter, we rewrite a leaf labelled c to a null label. Thus, we can easily increment and decrement counters. Zero tests, however, are more subtle. To help with this, we track, using reversal-bounded counters, the number of increments made to each counter, and in separate reversal-bounded counters, the number of decrements. That is, we have reversal bounded counters {c₁⁺,c₁⁻,c₂⁺,c₂⁻}. When we simulate an increment of c₁ we add a leaf and increment c₁⁺. When we simulate a decrement of c₂ we rewrite a leaf to a null character and increment c₁⁻. Similarly for c₂. We simulate zero tests as follows.

To simulate a zero test on a counter c we perform the following checks. First, we “reset” the counter to zero by forcing enough control-state changes to fix the nodes corresponding to the counter. That is, we move to a control-state p where all leaf labels may rewrite to themselves, except those labelled c. After the move to p all leaves will have age 1. Leaves not labelled c can refresh their age to 0 by rewriting themselves. Leaves labelled c will stay aged 1. Then, we move to the target control-state of the transition we are simulating. Thus, after these moves, all leaves labelled c will reach age 2, while all other nodes will only reach age 1. Thus, if our lifespan is 2, nodes labelled c will no longer be live. That is, the simulated value of c in the tree has been forced to 0.

After this reset operation, the counter value is definitely zero. However, we did not enforce that the counter value was zero before the transition. Recall, we track the number of increments and decrements to c in the reversal bounded counters. If the counter was not zero before the test, there will be a discrepancy with the reversal bounded counters: more increments will be recorded than decrements. E.g. for counter c₁ we will have c₁⁺ > c₁⁻. This cannot be corrected by the simulation. Thus, at the end of the run, we check whether the number of increments is equal to the number of decrements. If not, we know the run made a spurious transition. That is, it performed a zero test transition when the counter was not zero. If no spurious transitions were made, we know the two-counter machine has a corresponding run. This completes the gist of the simulation of a two-counter machine.

6 Extensions and Future Work

We proposed sGTRS with counters as a model of recursively parallel programs with unbounded recursion, thread creation, and integer variables. To obtain decidability, we gave an underapproximation in the form of weak sGTRS with reversal-bounded counters. We showed that the reachability problem for this model is NP-complete; in fact, polynomial-time reducible to satisfiability of linear integer arithmetic, for which highly optimised SMT solvers are available (e.g. Z3 [16]). Additionally, we explored the possibility of relaxing the weakly-synchronised constraint to that of senescence, and showed that the resulting model has an undecidable control-state reachability problem.

One possible avenue of future work is to investigate what happens when local integer values are permitted. That is, reversal-bounded counters can be stored on the nodes of the tree. We may also study techniques that allow nodes to contain multiple labels, permitting the modelling of multiple local variables without an immediate exponential blow up.

Acknowledgments

We thank anonymous reviewers for their helpful feedback. This work was supported by the Engineering and Physical Sciences Research Council [EP/K009907/1] and Yale-NUS College Startup Grant.

References

[1]: Parosh Aziz Abdulla, Mohamed Faouzi Atig, and Jonathan Cederberg. Analysis of message passing programs using SMT-solvers. In ATVA, pages 272–286, 2013.
[2]: C. Aiswarya, P. Gastin, and K. Narayan Kumar. Verifying communicating multi-pushdown systems via split-width. In ATVA, pages 1–17, 2014.
[3]: T. Araki and T. Kasami. Some Decision Problems Related to the Reachability Problem for Petri Nets. Theoretical Computer Science, 3(1):85–104, 1977.
[4]: M. F. Atig, B. Bollig, and P. Habermehl. Emptiness of multi-pushdown automata is 2etime-complete. In DLT, pages 121–133, 2008.
[5]: M. F. Atig, A. Bouajjani, and S. Qadeer. Context-bounded analysis for concurrent programs with dynamic creation of threads. Logical Methods in Computer Science, 7(4), 2011.
[6]: M. F. Atig, K. Narayan Kumar, and P. Saivasan. Adjacent ordered multi-pushdown systems. Int. J. Found. Comput. Sci., 25(8):1083–1096, 2014.
[7]: Mohamed Faouzi Atig and Pierre Ganty. Approximating petri net reachability along context-free traces. In FSTTCS, pages 152–163, 2011.
[8]: S. Bardin, A. Finkel, J. Leroux, and P. Schnoebelen. Flat acceleration in symbolic model checking. In ATVA, pages 474–488, 2005.
[9]: Achim Blumensath, Thomas Colcombet, Denis Kuperberg, Pawel Parys, and Michael Vanden Boom. Two-way cost automata and cost logics over infinite trees. In CSL-LICS, pages 16:1–16:9, 2014.
[10]: A. Bouajjani and M. Emmi. Analysis of recursively parallel programs. ACM Trans. Program. Lang. Syst., 35(3):10, 2013.
[11]: Ahmed Bouajjani, Javier Esparza, and Oded Maler. Reachability analysis of pushdown automata: Application to model-checking. In CONCUR, pages 135–150, 1997.
[12]: Laura Bozzelli, Mojmír Kretínský, Vojtech Rehák, and Jan Strejcek. On decidability of LTL model checking for process rewrite systems. Acta Inf., 46(1):1–28, 2009.
[13]: Thomas Colcombet and Christof Löding. Regular cost functions over finite trees. In LICS, pages 70–79, 2010.
[14]: Wojciech Czerwinski, Piotr Hofman, and Slawomir Lasota. Reachability problem for weak multi-pushdown automata. Logical Methods in Computer Science, 9(3), 2013.
[15]: M. Dauchet and S. Tison. The theory of ground rewrite systems is decidable. In LICS, pages 242–248, 1990.
[16]: L. Mendonça de Moura and N. Bjørner. Z3: An efficient smt solver. In TACAS, pages 337–340, 2008.
[17]: Stéphane Demri, Marcin Jurdzinski, Oded Lachish, and Ranko Lazic. The covering and boundedness problems for branching vector addition systems. J. Comput. Syst. Sci., 79(1):23–38, 2013.
[18]: J. Esparza and A. Podelski. Efficient algorithms for pre^* and post^* on interprocedural parallel flow graphs. In POPL, pages 1–11, 2000.
[19]: Javier Esparza, Pierre Ganty, and Tomás Poch. Pattern-based verification for multithreaded programs. ACM Trans. Program. Lang. Syst., 36(3):9:1–9:29, 2014.
[20]: P. Ganty, R. Majumdar, and M. Monmege. Bounded underapproximations. FMSD, 40(2), 2012.
[21]: Stefan Göller and Anthony Widjaja Lin. Refining the process rewrite systems hierarchy via ground tree rewrite systems. In CONCUR, pages 543–558, 2011.
[22]: M. Hague. Senescent ground tree rewrite systems. In CSL-LICS, pages 48:1–48:10, 2014.
[23]: M. Hague and A. W. Lin. Model checking recursive programs with numeric data types. In CAV, pages 743–759, 2011.
[24]: Matthew Hague and Anthony Widjaja Lin. Synchronisation- and reversal-bounded analysis of multithreaded programs with counters. In CAV, pages 260–276, 2012.
[25]: Oscar H. Ibarra. Reversal-bounded multicounter machines and their decision problems. J. ACM, 25(1):116–133, 1978.
[26]: Mojmír Kretínský, Vojtech Rehák, and Jan Strejcek. Extended process rewrite systems: Expressiveness and reachability. In CONCUR, pages 355–370, 2004.
[27]: Salvatore La Torre, Margherita Napoli, and Gennaro Parlato. Scope-bounded pushdown languages. In DLT, pages 116–128, 2014.
[28]: A. Lal, T. Touili, N. Kidd, and T. Reps. Interprocedural analysis of concurrent programs under a context bound. In TACAS, pages 282–298, Berlin, Heidelberg, 2008. Springer-Verlag.
[29]: Martin Lang and Christof Löding. Modeling and verification of infinite systems with resources. Logical Methods in Computer Science, 9(4), 2013.
[30]: Jérôme Leroux, M. Praveen, and Grégoire Sutre. Hyper-ackermannian bounds for pushdown vector addition systems. In CSL-LICS, pages 63:1–63:10, 2014.
[31]: A. W. Lin. Weakly-synchronized ground tree rewriting (with applications to verifying multithreaded programs). In MFCS, pages 630–642, 2012.
[32]: C. Löding. Reachability problems on regular ground tree rewriting graphs. Theory Coput. Syst., 39(2):347–383, 2006.
[33]: P. Madhusudan and Gennaro Parlato. The tree width of auxiliary storage. In POPL, pages 283–294, 2011.
[34]: R. Mayr. Decidability and Complexity of Model Checking Problems for Infinite-State Systems. PhD thesis, TU-Munich, 1998.
[35]: M. Musuvathi and S. Qadeer. Iterative context bounding for systematic testing of multithreaded programs. In PLDI, pages 446–455, 2007.
[36]: S. Qadeer. The case for context-bounded verification of concurrent programs. In SPIN, pages 3–6, 2008.
[37]: G. Ramalingam. Context-sensitive synchronization-sensitive analysis is undecidable. Transactions on Programming Languages and Systems (TOPLAS), 2000.
[38]: S. Qadeer and J. Rehof. Context-bounded model checking of concurrent software. In TACAS, pages 93–107, 2005.
[39]: Bruno Scarpellini. Complexity of subcases of Presburger arithmetic. Trans. of AMS, 284(1):203–218, 1984.
[40]: D. Suwimonteerabuth, J. Esparza, and S. Schwoon. Symbolic context-bounded analysis of multithreaded java programs. In SPIN, pages 270–287, 2008.
[41]: Anthony Widjaja To and Leonid Libkin. Algorithmic metatheorems for decidable LTL model checking over infinite systems. In FOSSACS, pages 221–236, 2010.
[42]: S. La Torre, P. Madhusudan, and G. Parlato. A robust class of context-sensitive languages. In In LICS, pages 161–170. IEEE Computer Society, 2007.
[43]: Kumar Neeraj Verma and Jean Goubault-Larrecq. Karp-Miller trees for a branching extension of VASS. Discrete Mathematics & Theoretical Computer Science, 7(1):217–230, 2005.

A Proofs and Definitions for Senescent sGTRS

We show that the control-state reachability problem is undecidable via a reduction from the reachability problem for two-counter machines.

A two-counter machine is a tuple M = (S, Δ) where P is a finite set of control-states, Δ is a finite set of rules of the form p₁ –[op, ]→ p₂ where p₁, p₂ ∈ S, and op ∈ {inc₁, inc₂, dec₁, dec₂, zero₁, zero₂}. A configuration of M is a tuple (p, v₀, v₁) ∈ S × ℕ × ℕ. We have a transition (p₁, v₀¹, v₁¹) —→ (p₂, v₀², v₁²) if we have a rule p₁ –[op, ]→ p₂ and

if op = inc_i, v_i² = v_i¹ + 1 and v_1−i² = v_1−i¹,
if op = dec_i, v_i² = v_i¹ − 1 ≥ 0 and v_1−i² = v_1−i¹,
if op = zero_i, v_i² = v_i¹ = 0, and v_1−i² = v_1−i¹.

Let ν₀ be the valuation assigning 0 to all counters. For given two-counter machine M and control-states s₀ and s_f we define a senescent rbGTRS G_M such that there is a run

⎛
⎝

s₀, T₀, ν₀

⎞
⎠

–[ε]→ ⋯ –[ε]→

⎛
⎝

s_f, T, ν

⎞
⎠

for some T and ν iff there is a run

⎛
⎝

p₀, 0, 0

⎞
⎠

—→ ⋯ —→

⎛
⎝

p_f, 0, 0

⎞
⎠

of M. Since this latter problem is well-known to be undecidable, we obtain undecidability of control-state reachability for senescent rbGTRS.

In the following definition we use the following 1-reversal-bounded counters: c₀⁺, c₁⁺, c₀⁻ and c₁⁻. We use R_fresh to keep leaf nodes within the lifespan, R_inc, R_dec, and R_zero to simulate the counter operations, and R_fin to check c_i⁺ = c_i⁻ for both i at the end of the run. Furthermore, let

µ_i⁺

⎛
⎝

⎞
⎠

⎧
⎨
⎩

1	c = c_i⁺
0	otherwise,

µ_i⁻

⎛
⎝

⎞
⎠

⎧
⎨
⎩

1	c = c_i⁻
0	otherwise, and

µ_i⁼

⎛
⎝

⎞
⎠

⎧
⎪
⎨
⎪
⎩

−1

c ∈

⎧
⎨
⎩

c_i⁺,c_i⁻

⎫
⎬
⎭

otherwise.

Recall ν₀ maps all counters to zero.

Given a node η and trees t₁, …, t_n, we will often write η(t₁, …, t_n) to denote the tree with root node η and left-to-right child sub-trees t₁, …, t_n. When η is labelled a, we may also write a(t₁, …, t_n) to denote the same tree. We will often simply write a to denote the tree with a single node labelled a.

For a tree t, let T_t be an NTA accepting only t. For example, T_a(b) is the automaton accepting only the tree a(b), and T_a accepts only the tree containing a single node labelled a. Note, we do not use natural numbers as tree labels, hence T₁, T₂, … may range over all NTAs.

Definition 5 (G_M) Given a two-counter machine M = (S, Δ) and two control-states s₀, s_f ∈ S, we define a senescent rbGTRS with lifespan 1

G_M=

⎛
⎝

P, Σ, Γ, R, C

⎞
⎠

where

S ⊎

⎧
⎨
⎩

⎛
⎝

s, i

⎞
⎠

s ∈ S ∧ i ∈

⎧
⎨
⎩

0,1

⎫
⎬
⎭

⊎

⎧
⎨
⎩

f, p₌

⎫
⎬
⎭

⎧
⎨
⎩

•, ∗, ∘,

⎫
⎬
⎭

⎧
⎨
⎩

⎫
⎬
⎭

⎧
⎨
⎩

c₁⁺, c₂⁺, c₁⁻, c₂⁻

⎫
⎬
⎭

R_fresh ⋃ R_inc ⋃ R_dec ⋃ R_zero ⋃ R_fin

where

R_fresh

⎧
⎨
⎩

⎛
⎝

s, T_η, ⊤

⎞
⎠

–[ε]→

⎛
⎝

s, T_η, ν₀

⎞
⎠

s ∈ S ∧ η ∈

⎧
⎨
⎩

∗,

⎫
⎬
⎭

⋃

⎧
⎨
⎩

⎛
⎝

s, i

⎞
⎠

, T_η, ⊤

⎞
⎠

–[ε]→

⎛
⎝

s, i

⎞
⎠

, T_η, ν₀

⎞
⎠

s ∈ S ∧ η ∈

⎧
⎨
⎩

∗,

⎫
⎬
⎭

∖

⎧
⎨
⎩

⎫
⎬
⎭

R_inc

⎧
⎨
⎩

⎛
⎝

s₁, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎜
⎝

s₂,

•

⎛
⎜
⎝

, ∗

⎞
⎟
⎠

, µ_i⁺

⎞
⎟
⎠

| p₁ –[inc_i, ]→ p₂ ∈ Δ

⎫
⎬
⎭

R_dec

⎧
⎨
⎩

⎛
⎜
⎝

s₁,

, ⊤

⎞
⎟
⎠

–[ε]→

⎛
⎝

s₂, T_∘, µ_i⁻

⎞
⎠

| p₁ –[dec_i, ]→ p₂ ∈ Δ

⎫
⎬
⎭

R_zero

⎧
⎪
⎨
⎪
⎩

⎛
⎝

s₁, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎝

s₂, i

⎞
⎠

, T_∗, ν₀

⎞
⎠

⎛
⎝

s₂, i

⎞
⎠

, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎝

s₂, T_∗, ν₀

⎞
⎠

| p₁ –[zero_i, ]→ p₂ ∈ Δ

⎫
⎪
⎬
⎪
⎭

R_fin

⎧
⎪
⎪
⎪
⎨
⎪
⎪
⎪
⎩

⎛
⎝

s_f, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎝

p₌, T_∗, ν₀

⎞
⎠

⎛
⎝

p₌, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎝

p₌, T_∗, µ₀⁼

⎞
⎠

⎛
⎝

p₌, T_∗, ⊤

⎞
⎠

–[ε]→

⎛
⎝

p₌, T_∗, µ₁⁼

⎞
⎠

⎛
⎝

p₌, T_∗, c₀⁺ = 0 ∧ c₀⁻ = 0 ∧ c₁⁺ = 0 ∧ c₁⁻ = 0

⎞
⎠

–[ε]→

⎛
⎝

f, T_∗, ν₀

⎞
⎠

⎫
⎪
⎪
⎪
⎬
⎪
⎪
⎪
⎭

Proposition 1 (Simulation of M) For a given two-counter machine M and control-states s₀ and s_f there is a run

⎛
⎝

p₀, 0, 0

⎞
⎠

—→ ⋯ —→

⎛
⎝

p_f, 0, 0

⎞
⎠

of M iff there is a run

⎛
⎝

s₀, T₀, ν₀

⎞
⎠

–[ε]→ ⋯ –[ε]→

⎛
⎝

s_f, T, ν

⎞
⎠

for some T and ν of G_M.

Proof. Let s₁ = s₀ and s_n = s_f and suppose we have a run

⎛
⎝

s₁, 0, 0

⎞
⎠

—→ ⋯ —→

⎛
⎝

s_n, 0, 0

⎞
⎠

We build the required run of G_M by induction such that for configuration (s_j, v₀^j, v₁^j) along the run of M, we have a run to a configuration (s_j, T_j, ν_j) of G_M such that

there is one leaf node labelled ∗, this node has age 0,
the number of nodes i in T_j is v_i^j for each j ∈ {0, 1}, each having age 0, and
ν_j(c_i⁺) − ν_j(c_i⁺) = v_i^j for each i ∈ {0,1}.

In the base case the result holds trivially for the configuration (s₁, ∗, ν₀). Now take a transition (s_j, op, s_j+1) from the run of M. By induction we have a run to (s_j, T_j, ν_j) as above. We show how to extend this run to (s_j+1, T_j+1, ν_j+1). There are several cases depending on op. In each case we show how to reach a tree satisfying the induction hypothesis, except the age of the leaf nodes. After the case analysis we show how to satisfy the age requirement also.

When op = inc_i, we use (s_j, T_∗, ⊤) –[ε]→ (s_j+1, T_{•(i, ∗)}, µ_i⁺). It is easy to verify we reach (s_j+1, T_j+1, ν_j+1) as required.
When op = dec_i, we know the ith counter must have a value greater than zero, hence we can apply (s_j, T_i, ⊤) –[ε]→ (s_j+1, T_∘, µ_i⁻). It is easy to verify we reach (s_j+1, T_j+1, ν_j+1) as required.
When op = zero_i, we know the ith counter must have value zero, hence there are no leaves labelled i in T_j. We can apply the following sequence of rules.
1. (s_j, T_∗, ⊤) –[ε]→ ((s_j+1, i), T_∗, ν₀),
2. ((s_j+1, i), T_η, ⊤) –[ε]→ ((s_j+1, i), T_η, ν₀) to each leaf labelled by some η ∈ {∗, 0, 1} ∖ {i},
3. ((s_j+1, i), T_∗, ⊤) –[ε]→ (s_j+1, T_∗, ν₀).
It is easy to verify we reach (s_j+1, T_j+1, ν_j+1) as required.

Finally, to obtain the age restriction on all leaf nodes, we apply (s_j+1, T_η, ⊤) –[ε]→ (s_j+1, T_η, ν₀) to each leaf labelled by some η ∈ {∗, 0, 1}.

Thus, by induction, we can reach a configuration (s_f, T, ν) such that, for each i we have ν(c_i⁺) = ν(c_i⁻). Thus, we can apply a sequence of rules from R_fin to reach (f, T, ν). In particular, we apply (s_f, T_∗, ⊤) –[ε]→ (p₌, T_∗, ν₀) and then simultaneously reduce each reversal-bounded counter to zero using (p₌, T_∗, ⊤) –[ε]→ (p₌, T_∗, µ_i⁼) repeatedly for each i, and then finally apply

⎛
⎝

p₌, T_∗, c₀⁺ = 0 ∧ c₀⁻ = 0 ∧ c₁⁺ = 0 ∧ c₁⁻ = 0

⎞
⎠

–[ε]→

⎛
⎝

f, T_∗, ν₀

⎞
⎠

to complete this direction of the proof.

We prove the opposite direction via two inductions. First, take some run of G_M, which necessarily has the form

⎛
⎝

p₁, T₁, ν₁

⎞
⎠

–[ε]→ ⋯ –[ε]→

⎛
⎝

p_n, T_n, ν_n

⎞
⎠

–[ε]→

⎛
⎝

p₌, T_n, ν_n

⎞
⎠

–[ε]→ ⋯ –[ε]→

⎛
⎝

p₌, 0, 0

⎞
⎠

–[ε]→

⎛
⎝

f, 0, 0

⎞
⎠

where the last sequence of transitions (from p_n) are all from R_fin, p₁ = s₀, T₁ = ∗, ν₁ = ν₀, and p_n = s_f. Let #_i(T) be the number of leaves labelled i in T. We first prove by induction over the run that for all 1 ≤ j ≤ n and i ∈ {0,1} we have #_i(T_j) = ν_j(c_i⁺) − ν_j(c_i⁻). This is a straightforward induction that can be seen by observing

the base case is immediate,
all rules in R_fresh ∪ R_zero do not change #_i(T_j), ν_j(c_i⁺), or ν_j(c_i⁻),
all rules in R_inc increase both #_i(T_j), and ν_j(c_i⁺), by one, and leave ν_j(c_i⁻) unchanged,
all rules in R_dec decrease #_i(T_j) by one, increase ν_j(c_i⁻) by one, and leave ν_j(c_i⁺), unchanged, and
there are no rules from R_fin.

Given #_i(T_j) = ν_j(c_i⁺) − ν_j(c_i⁻) for all j and i, we construct, also by induction, a sequence

⎛
⎝

s₁, v₀¹, v₁¹

⎞
⎠

, …,

⎛
⎝

s_n, v₀ⁿ, v₁ⁿ

⎞
⎠

of M such that for all j and i we have #_i(T_j) = v₀^j and p_j ∈ {s_j, (s_j, 0), (s_j, 1)} and, either

(s_j, v₀^j, v₁^j) —→ (s_j+1, v₀^j+1, v₁^j+1) is a transition of M, or
(s_j, v₀^j, v₁^j) = (s_j+1, v₀^j+1, v₁^j+1).

In the base case we set (s₁, v₀¹, v₀¹) = (s₀, 0, 0). Next, take a transition

⎛
⎝

p_j, T_j, ν_j

⎞
⎠

–[ε]→

⎛
⎝

p_j+1, T_j+1, ν_j+1

⎞
⎠

of G_M. There are several cases depending on which rule τ was applied.

If τ ∈ R_fresh then we set (s_j, v₀^j, v₁^j) = (s_j+1, v₀^j+1, v₁^j+1) and the properties follow from (s_j, v₀^j, v₁^j) by induction.
If τ ∈ R_inc then for some i we have τ = (s_j, T_∗, ⊤) –[ε]→ (s_j+1, T_{•(i, ∗)}, µ_i⁺) and s_j –[inc_i, ]→ s_j+1 is a rule of M. We apply this rule to obtain (s_j, v₀^j, v₁^j) —→ (s_j+1, v₀^j+1, v₁^j+1) and we can directly verify #_i(T_j+1) = v_i^j+1 for each i as required.
If τ ∈ R_dec then for some i we have τ = (s_j, T_i, ⊤) –[ε]→ (s_j+1, T_∘, µ_i⁻) and s_j –[dec_i, ]→ s_j+1 is a rule of M. We apply this rule to obtain (s_j, v₀^j, v₁^j) —→ (s_j+1, v₀^j+1, v₁^j+1) and we can directly verify #_i(T_j+1) = v_i^j+1 for each i as required.
If τ ∈ R_zero there are two sub-cases.
- In the first case, for some i we have τ = (s_j, T_∗, ⊤) –[ε]→ ((s_j+1, i), T_∗, ν₀) and s_j –[zero_i, ]→ s_j+1 is a rule of M. If we apply this rule we obtain (s_j, v₀^j, v₁^j) —→ (s_j+1, v₀^j+1, v₁^j+1) and we can directly verify #_i(T_j+1) = v_i^j+1 for each i as required. However, we need to prove s_j –[zero_i, ]→ s_j+1 can be applied. That is, we need to prove v_i^j is zero. Here we use #_i(T_j′) = ν_j′(c_i⁺) − ν_j′(c_i⁻) for all j′. From the definition of G_M we know that the run from ((s_j+1, i), T_j+1, ν_j+1) must eventually reach s_j+1 where (s_j+1, i) is the only control-state seen before s_j+1 is reached. During this time, we cannot refresh any node labelled i. Thus, assume for contradiction that v_i^j is not zero. Since #_i(T_j) = v_i^j we know there is at least one leaf labelled i. Since this node cannot refresh while the control-state is (s_j+1, i) this node will have age 2 once s_j+1 is reached. Thus, since the lifespan is 1, this node cannot be rewritten by the end of the run. This means T_n has at least one node labelled i. Since 1 ≤ #_i(T_n) = ν_n(c_i⁺) − ν_n(c_i⁻) we know ν_n(c_i⁺) ≠ ν_n(c_i⁻). However, the final transitions of the run of G_M use rules from R_fin and have the effect of ensuring ν_n(c_i⁺) = ν_n(c_i⁻). Hence, we have a contradiction, and v_i^j = 0. Thus we can apply s_j –[zero_i, ]→ s_j+1 as needed.
- If τ = ((s_j, i), T_∗, ⊤) –[ε]→ (s_j+1, T_∗, ν₀) we set (s_j, v₀^j, v₁^j) = (s_j+1, v₀^j+1, v₁^j+1) which satisfies the required properties since (s_j, v₀^j, v₁^j) did by induction.

Thus, we have a sequence (s₁, v₀¹, v₁¹), …, (s_n, v₀ⁿ, v₁ⁿ) from which we can immediately extract a run of M from (s₁, v₀¹, v₁¹) = (s₀, 0, 0) to (s_n, v₀ⁿ, v₁ⁿ) = (s_f, v₀ⁿ, v₁ⁿ). That v₀ⁿ = v₁ⁿ = 0 follows since the final transitions from s_n have the effect of asserting ν_n(c_i⁺) − ν_n(c_i⁻) = 0 from which we conclude #_i(T_n) = 0 and since v_iⁿ = #_i(T_n) we complete the proof as required.

Thus, via Property 1 (Simulation of M) we can reduce the reachability problem for two-counter machines to the control-state reachability problem for senescent rbGTRS. Thus, we show the control-state reachability problem is undecidable and complete the proof of Theorem 2.

This document was translated from L^AT_EX by H^EV^EA.