Unboundedness and Downward Closures of Higher-Order Pushdown Automata

Matthew Hague
Royal Holloway, University of London

Jonathan Kochems
Department of Computer Science, Oxford University

C.-H. Luke Ong
Department of Computer Science, Oxford University

Abstract: We show the diagonal problem for higher-order pushdown automata (HOPDA), and hence the simultaneous unboundedness problem, is decidable. From recent work by Zetzsche this means that we can construct the downward closure of the set of words accepted by a given HOPDA. This also means we can construct the downward closure of the Parikh image of a HOPDA. Both of these consequences play an important rôle in verifying concurrent higher-order programs expressed as HOPDA or safe higher-order recursion schemes.

1 Introduction

Recent work by Zetzsche [40] has given a new technique for computing the downward closure of classes of languages. The downward closure ↓(L) of a language L is the set of all subwords of words in L (e.g. a a is a subword of b a b a b). It is well known that the downward closure is regular for any language [19]. However, there are only a few classes of languages for which it is known how to compute this closure. In general it is not possible to compute the downward closure since it would easily lead to a solution to the halting problem for Turing machines.

However, once a regular representation of the downward closure has been obtained, it can be used in all kinds of analysis, since regular languages are well behaved under all kinds of transformations. For example, consider a system that waits for messages from a complex environment. This complex environment can be abstracted by the downward closure of the messages it sends or processes it spawns. This corresponds to a lossy system where some messages may be ignored (or go missing), or some processes may simply not contribute to the remainder of the execution. In many settings – e.g. the analysis of safety properties of certain kinds of systems – unread messages or unscheduled processes do not effect the precision of the analysis. Since many types of system permit synchronisation with a regular language, this environment abstraction can often be built into the system being analysed.

Many popular languages such as JavaScript, Python, Ruby, and even C++, include higher-order features – which are increasingly important given the popularity of event-based programs and asynchronous programs based on a continuation or callback style of programming. Hence, the modelling of higher-order function calls is becoming key to analysing modern day programs.

A popular approach to verifying higher-order programs is that of recursion schemes and several tools and practical techniques have been developed [23, 38, 26, 24, 30, 5, 6, 34]. Recursion schemes have an automaton model in the form of collapsible pushdown automata (CPDA) [18] which generalises an order-2 model called 2-PDA with links [1] or, equivalently, panic automata [22]. When these recursion schemes satisfy a syntactical condition called safety, a restriction of CPDA called higher-order pushdown automata (HOPDA or n-PDA for order-n HOPDA) is sufficient [29, 21]. HOPDA can be considered an extension of pushdown automata to a “stack of stacks” structure. It remains open as to whether CPDA are strictly more expressive than nondeterministic HOPDA when generating languages of words. It is known that, at order 2, nondeterministic HOPDA and CPDA generate the same word languages [1]. However, there exists a language generated by a deterministic order-2 CPDA that cannot be generated by a deterministic HOPDA of any order [31].

It is well known that concurrency and first-order recursion very quickly leads to undecidability (e.g. [33]). Hence, much recent research has focussed on decidable abstractions and restrictions (e.g. [14, 4, 20, 27, 13, 37, 28, 10, 16]). Recently, these results have been extended to concurrent versions of CPDA and recursion schemes (e.g. [35, 25, 15, 32]). Many approaches rely on combining representations of the Parikh image of individual automata (e.g. [13, 17, 16]). However, combining Parikh images of HOPDA quickly leads to undecidability (e.g. [17]). In many cases, the downward closure of the Parikh image is an adequate abstraction.

Computing downward closures appears to be a hard problem. Recently Zetzsche introduced a new general technique for classes of automata effectively closed under rational transductions – also referred to as a full trio. For these automata the downward closure is computable iff the simultaneous unboundedness problem (SUP) is decidable.

Definition 1 (SUP [40]) Given a language L ⊆ a₁^∗… a_α^∗ does ↓(L) = a₁^∗… a_α^∗?

Theorem 1 [40, Theorem 1] Let C be class of languages that is a full trio. Then downward closures are computable for C if and only if the SUP is decidable for C.

Zetzsche used this result to obtain the downward closure of languages definable by 2-PDA, or equivalently, languages definable by indexed grammars [2]. Moreover, for classes of languages closed under rational transductions, Zetzsche shows that the simultaneous unboundedness problem is decidable iff the diagonal problem is decidable. The diagonal problem was introduced by Czerwiński and Martens [11]. Intuitively, it is a relaxation of the SUP that is insensitive to the order the characters are output. For a word w, let |w|_a be the number of occurrences of a in w.

Definition 2 (Diagonal Problem [11]) Given language L we define

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

= ∀ m . ∃ w ∈ L . ∀ 1 ≤ i ≤ α .

⎪
⎪

_{a_i} ≥ m .

The diagonal problem asks if Diagonal_{a₁, …, a_α}(L) holds of L.

Corollary 1 (Diagonal Problem and Downward Closures) Let C be class of languages that is a full trio. Then downward closures are computable for C if and only if the diagonal problem is decidable for C.

Proof. The only-if direction follows from Theorem 1 since given a language L ⊆ a₁^∗… a_α^∗ the diagonal problem is immediately equivalent to the SUP. In the if direction, the result follows since L satisfies the diagonal problem iff ↓(L) also satisfies the diagonal problem. Since the diagonal problem is decidable for regular languages and ↓(L) is regular, we have the result.

In this work, we generalise Zetzsche’s result for 2-PDA to the general case of n-PDA. We show that the diagonal problem is decidable. Since HOPDA are closed under rational transductions, we obtain decidability of the simultaneous unboundedness problem, and hence a method for constructing the downward closure of a language defined by a HOPDA.

Corollary 2 (Downward Closures) Let P be an n-PDA. The downward closure ↓(L(P)) is computable.

Proof. From Theorem 3 (proved in the sequel), we know that the diagonal problem for HOPDA is decidable. Thus, using Corollary 1, we can construct the downward closure of P.

This result provides an abstraction upon which new results may be based. It also has several immediate consequences:

decidability of separability by piecewise testable languages, which follows from from Czerwiński and Martens [11],
decidability of reachability for parameterised concurrent systems of HOPDA communicating asynchronously via a shared global register, from La Torre et al. [36],
decidability of finiteness of a language defined by a HOPDA, and
computability of the downward closure of the Parikh image of a HOPDA.

We present our decidability proof in two stages. First we show how to decide Diagonal_a(P) for a single character and HOPDA P in Sections 3 and 4. In Sections 5, 6, and 7 we generalise our techniques to the full diagonal problem.

In Section 3.1 we give an outline of the proof techniques for deciding Diagonal_a(P). In short, the outermost stacks of an n-PDA are created and destroyed using push_n and pop_n operations. These push_n and pop_n operations along a run of an n-PDA are “well-bracketed” (each push_n has a matching pop_n and these matchings don’t overlap). The essence of the idea is to take a standard tree decomposition of these well-bracketed runs and observe that each branch of such a tree can be executed by an (n−1)-PDA. We augment this (n−1)-PDA with “regular tests” that allow it to know if, each time a branch is chosen, the alternative branch could have output some a characters. If this is true, then the (n−1)-PDA outputs a single a to account for these missed characters. We prove that, although the (n−1)-PDA outputs far fewer characters, it can still output an unbounded number iff the n-PDA could. Hence, by repeating this reduction, we obtain a 1-PDA, for which the diagonal problem is decidable since it is known how to compute their downward closures [39, 9].

In Section 6.1 we outline the generalisation of the proof to the full problem Diagonal_{a₁, …, a_α}(P). The key difficulty is that it is no longer enough for the (n−1)-PDA to follow only a single branch of the tree decomposition: it may need up to one branch for each of the a₁, …, a_α. Hence, we define HOPDA that can output trees with a bounded number (α) of branches. We then show that our reduction can generalise to HOPDA outputting trees (relying essentially on the fact that the number of branches is bounded).

2 Preliminaries

2.1 Downward Closures

Given two words w = γ₁ … γ_m∈ Σ^∗ and w′ = σ₁ … σ_l∈ Σ^∗ for some alphabet Σ, we write w ≤ w′ iff there exist i₁ < … < i_m such that for all 1 ≤ j ≤ m we have γ_j= σ_{i_j}. Given a set of words L ⊆ Σ^∗, we denote its downward closure ↓(L) = {w | w ≤ w′ ∈ L}.

2.2 Trees

A Σ-labelled finite tree is a tuple T = (D, λ) where Σ is a set of node labels, and D ⊂ ℕ^∗ is a finite set of nodes that is prefix-closed, that is, η δ ∈ D implies η ∈ D, and λ : D → Σ is a function labelling the nodes of the tree.

We write ε to denote the root of a tree (the empty sequence). We also write

⎡
⎣

T₁, …, T_m

⎤
⎦

to denote the tree whose root node is labelled a and has children T₁, …, T_m. That is, we define a[T₁, …, T_m] = (D′, λ′) when for each δ we have T_δ= (D_δ, λ_δ) and D′ = {δη | η ∈ D_δ} ∪ {ε} and

λ′

⎛
⎝

⎞
⎠

⎧
⎪
⎨
⎪
⎩

η = ε

λ_δ

⎛
⎝

η′

⎞
⎠

η = δη′

Also, let T[a] denote the tree ({ε}, λ) where λ(ε) = a. A branch in T = (D, λ) is a sequence of nodes of T, η₁ ⋯ η_n, such that η₁ = є, η_n = δ₁ δ₂ ⋯ δ_n−1 is maximal in D, and η_j+1 = η_j δ_j for each 1 ≤ j ≤ n−1.

2.3 HOPDA

HOPDA are a generalisation of pushdown systems to a stack-of-stacks structure. An order-n stack is a stack of order-(n−1) stacks. An order-n push operation pushes a new order-(n−1) stack onto the stack that is a copy of the existing topmost order-(n−1) stack. Rewrite operations update the character that is at the top of the topmost stacks.

Definition 1 (Order-n Stacks) The set of order-n stacks S_n^Γ over a given stack alphabet Γ is defined inductively as follows.

S₀^Γ

S_k+1^Γ

⎧
⎨
⎩

⎡
⎣

s₁ … s_m

⎤
⎦

_k+1

⎪
⎪

∀ i . s_i∈ S_k^Γ

⎫
⎬
⎭

Stacks are written with the top part of the stack to the left. We define several operations.

top_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

s₁

top_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

top_k

⎛
⎝

s₁

⎞
⎠

n > k

rew_γ

⎛
⎝

⎡
⎣

γ₁ … γ_m

⎤
⎦

₁

⎞
⎠

⎡
⎣

γ γ₂ … γ_m

⎤
⎦

₁

rew_γ

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

⎡
⎣

rew_γ

⎛
⎝

s₁

⎞
⎠

s₂ … s_m

⎤
⎦

n > 1

push_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

⎡
⎣

s₁ s₁ … s_m

⎤
⎦

push_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

⎡
⎣

push_k

⎛
⎝

s₁

⎞
⎠

s₂, …, s_m

⎤
⎦

n > k

pop_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

⎡
⎣

s₂ … s_m

⎤
⎦

pop_k

⎛
⎝

⎡
⎣

s₁ … s_m

⎤
⎦

⎞
⎠

⎡
⎣

pop_k

⎛
⎝

s₁

⎞
⎠

s₂, …, s_m

⎤
⎦

n > k

and set

Ops_n =

⎧
⎨
⎩

rew_γ

⎪
⎪

γ ∈ Γ

⎫
⎬
⎭

⋃

⎧
⎨
⎩

push_k, pop_k

⎪
⎪

1 ≤ k ≤ n

⎫
⎬
⎭

to be the set of order-n stack operations.

For example

push₂

⎛
⎜
⎜
⎜
⎝

⎡
⎣

γ σ

⎤
⎦

₁

⎤
⎦

⎞
⎟
⎟
⎟
⎠

⎡
⎣

γ σ

⎤
⎦

₁

⎡
⎣

γ σ

⎤
⎦

₁

⎤
⎦

rew_σ

⎛
⎜
⎜
⎜
⎝

⎡
⎣

γ σ

⎤
⎦

₁

⎡
⎣

γ σ

⎤
⎦

₁

⎤
⎦

⎞
⎟
⎟
⎟
⎠

⎡
⎣

σ σ

⎤
⎦

₁

⎡
⎣

γ σ

⎤
⎦

₁

⎤
⎦

Definition 2 (HOPDA or n-PDA) An order-n higher order pushdown automaton (HOPDA or n-PDA) is given by a tuple (P, Σ, Γ, R, F, p_in, γ_in) where P is a finite set of control states, Σ is a finite output alphabet (that contains the empty word character є), Γ is a finite stack alphabet, R ⊆ P × Γ × Σ × Ops_n × P is a set of transition rules, F is a set of accepting control states, p_in ∈ P is the initial control state, and γ_in ∈ Γ is the initial stack character.

We write (p, γ) —^a→ (p′, o) for a rule (p, γ, a, o, p′) ∈ R.

A configuration of an n-PDA is a tuple ⟨ p, s ⟩ where p ∈ P and s is an order-n stack over Γ. We have a transition ⟨ p, s ⟩ —^a→ ⟨ p′, s′ ⟩ whenever we have (p, γ) —^a→ (p′, o), top₁(s) = γ, and s′ = o(s).

A run over a word w ∈ Σ^∗ is a sequence of configurations c₀ —^a₁→ ⋯ —^a_m→ c_m such that the word a₁…a_m is w. It is an accepting run if c₀ = ⟨ p_in, [[ γ_in ]] ⟩ — where we write [[ γ ]] for [⋯[γ]₁⋯]_n — and where c_m= ⟨ p, s ⟩ with p ∈ F. Furthermore, for a set of configurations C, we define

Pre_P^∗

⎛
⎝

⎞
⎠

to be the set of configurations c such that there is a run over some word from c to c′ ∈ C. When C is defined as the language of some automaton A accepting configurations, we abuse notation and write Pre_P^∗(A) instead of Pre_P^∗(L(A)).

For convenience, we sometimes allow a set of characters to be output instead of only one. This is to be interpreted as outputing each of the characters in the set once (in some arbitrary order). We also allow sequences of operations o₁; …; o_m in the rules instead of single operations. When using sequences we allow a test operation γ? that only allows the sequence to proceed if the top₁ character of the stack is γ. All of these extensions can be encoded by introducing intermediate control states.

2.3.1 Regular Sets of Stacks

We will need to represent sets of stacks. To do this we will use automata to recognise stacks. We define the stack automaton model of Broadbent et al. [8] restricted to HOPDA rather than CPDA. We will sometimes call these bottom-up stack automata or simply automata. The automata operate over stacks interpreted as words, hence the opening and closing braces of the stacks appear as part of the input. We annotate these braces with the order of the stack the braces belong to. Let Γ_[] = {[_n−1,…,[₁, ]₁,…,]_n−1} ⊎ Γ. Note, we don’t include [_n, ]_n since these appear exclusively at the start and end of the stack.

Definition 3 (Bottom-up Stack Automata) A tuple A is a bottom-up stack automaton when A is (Q, Γ, q_in, Q_F, Δ) where Q is a finite set of states, Γ is a finite input alphabet, q_in ∈ Q is the initial state and Δ : (Q × Γ) → Q is a deterministic transition function.

Representing higher order stacks as a linear word graph, where the start of an order-k stack is an edge labelled [_k and the end of an order-k stack is an edge labelled ]_k, a run of a bottom-up stack automaton is a labelling of the nodes of the graph with states in Q such that

the rightmost (final) node is labelled by q_in, and
whenever we have for any γ ∈ Γ_[], and pair of labelled nodes with an edge q —^γ→ q′ then q = Δ(q′, γ).

The run is accepting if the leftmost (initial) node is labelled by q ∈ Q_F. An example run over the word graph representation of [ [ [γ σ]₁[σ]₁ ]₂ [ [σ]₁ ]₂ ]₃ is given in Figure 1.

Let L(A) be the set of stacks with accepting runs of A. Sometimes, for convenience, if we have a configuration c = ⟨ p, s ⟩ of a HOPDA, we will write c ∈ L(A) when s ∈ L(A).

Figure 1: A run over [ [ [γ σ]₁[σ]₁ ]₂ [ [σ]₁ ]₂ ]₃

3 The Single Character Case

We assume Σ = {a, ε} and use b to range over Σ. This can be obtained by simply replacing all other characters with ε. We also assume that all rules of the form (p, γ) —^b→ (p′, o) with o = push_n or o = pop_n have b = ε. We can enforce this using intermediate control states to first apply o in one step, and then in another output b (the stack operation on the second step will be rew_γ where γ is the current top character). We start with an outline of the proof, and then explain each step in detail.

For convenience, we assume acceptance is by reaching a unique control state in F with an empty stack (i.e. the lowermost stack was removed with a pop_n and F = {p_f}). This can easily be obtained by adding a rule to a new accepting state whenever we have a rule leading to a control state in F. From this new state we can loop and perform pop_n operations until the stack is empty.

3.1 Outline of Proof

The approach is to take an n-PDA P and produce an (n−1)-PDA P₋₁ that satisfies the diagonal problem iff P does. The idea behind this reduction is that an (accepting) run of P can be decomposed into a tree with out-degree at most 2: each push_n has a matching pop_n that brings the stack back to be the same as it was before the push_n; we cut the run at the pop_n and hang the tail next to the push_n and repeat this to form a tree from a run. This is illustrated in Figure 2 where nodes are labelled by their configurations, and the push_n and pop_n points are marked. The dotted arcs connect nodes matched by their pushes and pops – these nodes have the same stacks. Notice that at each branching point, the left and right subtrees start with the same order-(n−1) stacks on top. Notice also that for each branch, none of its transitions remove the topmost order-(n−1) stack. Hence, we can produce an (n−1)-PDA that picks a branch of this tree decomposition to execute and only needs to keep track of the topmost order-(n−1) stack of the n-PDA. When picking a branch to execute, the (n−1)-PDA outputs a single a if the branch not chosen could have output some a characters. We prove that this is enough to maintain unboundedness.

Figure 2: Tree decompositions of runs.

In more detail, we perform the following steps.

Instrument P to record whether an a character has been output. Then, using known reachability results, obtain regular sets of configurations from which the current top_n stack can be popped, and moreover, we can know whether an a is output on the way. These tests can be seen as a generalisation of pushdown systems with regular tests introduced by Esparza et al. [12].
From an n-PDA P, we define an (n−1)-PDA with tests P₋₁ and then an (n−1)-PDA P′ such that
Diagonal_a ⎛
⎝ P ⎞
⎠ ⇐⇒ Diagonal_a ⎛
⎝ P′ ⎞
⎠ .

The tests will be used to check the branches of the tree decomposition not explored by P₋₁.
By repeated applications of the above reduction, we obtain an 1-PDA P for which Diagonal_a(P) is decidable since the downward closure of a context-free grammar (equivalent to 1-PDA) is computable [39, 9] and this is equivalent to the diagonal problem.

The (n−1)-PDA with tests P₋₁ will simulate the n-PDA P in the following way.

All operations except for push_n and pop_n will be simulated directly.
In lieu of performing a push_n, P₋₁ will choose to simulate the run of P between the push and its corresponding pop_n, or the run of P after the corresponding pop_n has taken place.
- Tests will be used to determine which control state could appear after the corresponding pop_n.
- If the part of the run not being simulated output some as, then P will output a single a in place of the omitted as.

Although P₋₁ will output far fewer a characters than P (since it does not execute the full run), we show that it still outputs enough as for the language to remain unbounded.

We thus have the following theorem.

Theorem 1 (Decidability of the Diagonal Problem) Given an n-PDA P and output character a, whether Diagonal_a(P) holds is decidable.

Proof. We construct via Lemma 2 an (n−1)-PDA P′ such that Diagonal_a(P) iff Diagonal_a(P′). We repeat this step until we have a 1-PDA. It is known that Diagonal_a(P) for an 1-PDA is decidable since it is possible to compute the downward closure [39, 9].

3.2 HOPDA with Tests

When executing a branch of the tree decomposition, to be able to ensure the branch is correct and whether we should output an extra a we need to know how the system could have behaved on the skipped branch. To do this we add tests to the HOPDA that allow it to know if the current stack belongs to a given regular set. We show in the following sections that the properties required for our reduction can be represented as regular sets of stacks. Although we take Broadbent et al.’s logical reflection as the basis of our proof, HOPDA with tests can be seen as a generalisation of pushdown systems with regular valuations due to Esparza et al. [12].

Definition 1 (n-PDA with Tests) Given a sequence of automata A₁, …, A_m recognising regular sets of stacks, an n-PDA with tests is a tuple P = (P, Σ, Γ, R, F, p_in, γ_in) where P, Σ, Γ, F, p_in, and γ_in are as in HOPDA, and

R ⊆ P × Γ ×

⎧
⎨
⎩

A₁, …, A_m

⎫
⎬
⎭

× Σ × Ops_n × P

is a set of transition rules.

We write (p, γ, A_i) —^b→ (p′, o) for (p, γ, A_i, b, o, p′) ∈ R. We have a transition ⟨ p, s ⟩ —^b→ ⟨ p′, s′ ⟩ whenever (p, γ, A_i) —^b→ (p′, o) ∈ R and top₁(s) = γ, s ∈ L(A_i), and s′ = o(s).

We know from Broadbent et al. that these tests do not add any extra power to HOPDA. Intuitively, we can embed runs of the automata into the stack during runs of the HOPDA.

Theorem 2 (Removing Tests) [8, Theorem 3 (adapted)] For every n-PDA with tests P, we can compute an n-PDA P′ with L(P) = L(P′).

Proof. This is a straightforward adaptation of Broadbent et al. [8]. A more general theorem is proved in Theorem 1.

3.2.1 Marking Outputs

When the HOPDA is in a configuration of the form ⟨ p, [s]_n ⟩ – i.e. the outermost stack contains only a single order-(n−1) stack – we require the HOPDA to be able to know whether,

for a given p₁ and p₂, there is a run from ⟨ p₁, [s]_n ⟩ to ⟨ p₂, []_n ⟩ (that is, the HOPDA empties the stack), and
whether, during the run, an a is output.

Given P, we first augment P to record whether an a has been produced. This can be done simply by recording in the control state whether a has been output.

Definition 2 (P_a) Given P = (P, Σ, Γ, R, F, p_in, γ_in) we define

P_a =

⎛
⎝

P ⋃ P_a, Σ, Γ, R ⋃ R_a, F ⋃ F_a, p_in, γ_in

⎞
⎠

where

P_a

⎧
⎨
⎩

p_a

⎪
⎪

p ∈ P

⎫
⎬
⎭

R_a

⎧
⎨
⎩

⎛
⎝

p_a, γ

⎞
⎠

—^b→

⎛
⎝

p′_a, o

⎞
⎠

⎪
⎪

⎛
⎝

p, γ

⎞
⎠

—^b→

⎛
⎝

p′, o

⎞
⎠

∈ R

⎫
⎬
⎭

⋃

⎧
⎨
⎩

⎛
⎝

p, γ

⎞
⎠

—^a→

⎛
⎝

p′_a, o

⎞
⎠

⎪
⎪

⎛
⎝

p, γ

⎞
⎠

—^a→

⎛
⎝

p′, o

⎞
⎠

∈ R

⎫
⎬
⎭

F_a

⎧
⎨
⎩

p_a

⎪
⎪

p ∈ F

⎫
⎬
⎭

It is easy to see that P and P_a accept the same languages, and that P_a is only in a control state p_a if an a has been output.

3.2.2 Building the Automata

Fix some P = (P, Σ, Γ, R, F) and P_a = (P_a, Σ, Γ, R_a, F_a). To obtain a HOPDA with tests, we need, for each p₁, p₂ ∈ P the following automata. Note, we define these automata to accept order-(n−1) stacks since they will be used in an (n−1)-PDA with tests.

A_{p₁, p₂} accepting all stacks s such that there is a run of P from ⟨ p₁, [s]_n ⟩ to ⟨ p₂, []_n ⟩,
A_{p₁, p₂}^a accepting all stacks s such that there is a run of P from ⟨ p₁, [s]_n ⟩ to ⟨ p₂, []_n ⟩ that outputs at least one a.

To do this we will use a reachability result due to Broadbent et al. that appeared in ICALP 2012 [7]. This result uses an automata representation of sets of configurations. However, these automata are slightly different in that they read full configurations “top down”, whereas the automata of Theorem 2 (Removing Tests) read only stacks “bottom up”.

It is known that these two representations are effectively equivalent, and that both form an effective boolean algebra [8, 7]. In particular, for a top-down automaton A and a control state p we can build a bottom-up stack automaton B such that ⟨ p, s ⟩ ∈ L(A) iff s ∈ L(B) and vice versa. We recall the reachability result.

Theorem 3 [7, Theorem 1 (specialised)] Given an HOPDA P and a top-down automaton A, we can construct an automaton A′ accepting Pre_P^∗(A).

Let A_{p, γ} be a top-down automaton accepting configurations of the form ⟨ p, [s]_n ⟩ where top₁(s) = γ. Next, let

A_p =

∪

⎛
⎝

p′, γ

⎞
⎠

—^ε→

⎛
⎝

p, pop_n

⎞
⎠

∈ R

A_p′, γ

and

A_p^a=

∪

⎛
⎝

p′, γ

⎞
⎠

—^ε→

⎛
⎝

p, pop_n

⎞
⎠

∈ R

A_{p′_a, γ}

I.e. A_p and A_p^a accept configurations of P_a from which it is possible to perform a pop_n operation to p and reach the empty stack.

Definition 3 (A_{p₁, p₂} and A_{p₁, p₂}^a) Using the preceding notation, given p₁ and p₂ we define bottom-up automata

A_{p₁, p₂} where L(A_{p₁, p₂}) = { s | ⟨ p₁, [s]_n ⟩ ∈ Pre_P^∗(A_p₂) } .
A_{p₁, p₂}^a where L(A_{p₁, p₂}^a) = { s | ⟨ p₁, [s]_n ⟩ ∈ Pre_{P_a}^∗(A_p₂^a) } .

It is easy to see both A_{p₁, p₂} and A_{p₁, p₂}^a are regular and representable by bottom-up automata since both

Pre_P^∗

⎛
⎝

A_p₂

⎞
⎠

and Pre_{P_a}^∗

⎛
⎝

A_p₂^a

⎞
⎠

are regular from Theorem 3, and bottom-up and top-down automata are effectively equivalent. To enforce only stacks of the form [s]_n we intersect with an automaton A₁ accepting all stacks containing a single order-(n−1) stack (this is clearly regular).

3.3 Reduction to Lower Orders

We are now ready to complete the reduction. Correctness is shown in Section 4. Let A_tt be the automaton accepting all stacks. In the following definition, a control state (p₁, p₂) means that we are currently in control state p₁ and are aiming to empty the stack on reaching p₂, and the rules R_sim simulate all operations apart from push_n and pop_n directly, R_fin detect when the run is accepting, R_push follow the push branch of the tree decomposition, using tests to ensure the existence of the pop branch, and R_pop follow the pop branch of the tree decomposition, also using tests to check the existence of the push branch.

Definition 4 (P₋₁) Given an n-PDA P described by the tuple (P, Σ, Γ, R, {p_f}, p_in, γ_in) as well as families of automata (A_{p₁, p₂})_{p₁, p₂ ∈ P} and (A_{p₁, p₂}^a)_{p₁, p₂ ∈ P} we define an (n−1)-PDA with tests

P₋₁ =

⎛
⎝

P₋₁, Σ, Γ, R₋₁, F₋₁,

⎛
⎝

p_in, p_f

⎞
⎠

, γ_in

⎞
⎠

where

P₋₁

⎧
⎨
⎩

⎛
⎝

p₁, p₂

⎞
⎠

⎪
⎪

p₁, p₂ ∈ P

⎫
⎬
⎭

⊎

⎧
⎨
⎩

⎫
⎬
⎭

R₋₁

R_sim ⋃ R_fin ⋃ R_push ⋃ R_pop

F₋₁

⎧
⎨
⎩

⎫
⎬
⎭

and we define

R_sim is the set containing all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_tt

⎞
⎠ —^b→ ⎛
⎝
⎛
⎝ p′₁, p₂ ⎞
⎠

, o

⎞
⎠

for all (p₁, γ) —^b→ (p′₁, o) ∈ R with o ∉ {push_n, pop_n} and p₂ ∈ P, and
R_fin is the set containing all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_tt

⎞
⎠ —^ε→ ⎛
⎝ f, rew_γ ⎞
⎠

for all (p₁, γ) —^ε→ (p₂, pop_n) ∈ R, and
R_push is the smallest set of rules containing all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_p, p₂

⎞
⎠ —^ε→ ⎛
⎝
⎛
⎝ p′₁, p ⎞
⎠

, rew_γ

⎞
⎠

for all (p₁, γ) —^ε→ (p′₁, push_n) ∈ R and p, p₂ ∈ P, and all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_p, p₂^a

⎞
⎠ —^a→ ⎛
⎝
⎛
⎝ p′₁, p ⎞
⎠

, rew_γ

⎞
⎠

for all (p₁, γ) —^ε→ (p′₁, push_n) ∈ R and p, p₂ ∈ P, and
R_pop is the set containing all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_p′₁, p

⎞
⎠ —^ε→ ⎛
⎝
⎛
⎝ p, p₂ ⎞
⎠

, rew_γ

⎞
⎠

for all (p₁, γ) —^ε→ (p′₁, push_n) ∈ R and p, p₂ ∈ P and all rules of the form
⎛
⎝
⎛
⎝ p₁, p₂ ⎞
⎠

, γ, A_p′₁, p^a

⎞
⎠ —^a→ ⎛
⎝
⎛
⎝ p, p₂ ⎞
⎠

, rew_γ

⎞
⎠

for all (p₁, γ) —^ε→ (p′₁, push_n) ∈ R and p, p₂ ∈ P.

In the next section, we show the reduction is correct.

Lemma 1 (Correctness of P₋₁)

Diagonal_a

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_a

⎛
⎝

P₋₁

⎞
⎠

To complete the reduction, we convert the HOPDA with tests into a HOPDA without tests.

Lemma 2 (Reduction to Lower Orders) For every n-PDA P we can construct an (n−1)-PDA P′ such that

Diagonal_a

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_a

⎛
⎝

P′

⎞
⎠

Proof. From Definition 4 (P₋₁) and Lemma 1 (Correctness of P₋₁), we obtain from P an (n−1)-PDA with tests P₋₁ satisfying the conditions of the lemma. To complete the proof, we invoke Theorem 2 (Removing Tests) to find P′ as required.

4 Correctness of Reduction

This section is dedicated to the proof of Lemma 1 (Correctness of P₋₁).

The idea of the proof is that each run of P can be decomposed into a tree: each push_n operation creates a node whose left child is the run up to the matching pop_n, and whose right child is the run after the matching pop_n. All other operations create a node with a single child which is the successor configuration.

Each branch of such a tree corresponds to a run of P₋₁. To prove that P₋₁ can output an unbounded number of as we prove that any tree containing m edges outputting a must have a branch along which P₋₁ would output log(m) a characters. Thus, if P can output an unbounded number of a characters, so can P₋₁.

4.1 Tree Decomposition of Runs

Given a run

ρ = c₀ —^b₁→ c₁ —^b₂→ ⋯ —^b_m→ c_m

of P where each push_n operation has a matching pop_n, we can construct a tree representation of ρ inductively. That is, we define Tree(c) = T[ε] for the single-configuration run c, and, when

ρ = c —^b→ ρ′

where the first rule applied does not contain a push_n operation, we have

Tree

⎛
⎝

⎞
⎠

= b

⎡
⎣

Tree

⎛
⎝

ρ′

⎞
⎠

⎤
⎦

and, when

ρ = c₀ —^ε→ ρ₁ —^ε→ ρ₂

with c₁ being the first configuration of ρ₂ and where the first rule applied in ρ contains a push_n operation, c₀ = ⟨ p, s ⟩ and c₁ = ⟨ p′, s ⟩ for some p, p′, s and there is no configuration in ρ₁ of the form ⟨ p″, s ⟩, then

Tree

⎛
⎝

⎞
⎠

= ε

⎡
⎣

Tree

⎛
⎝

ρ₁

⎞
⎠

, Tree

⎛
⎝

ρ₂

⎞
⎠

⎤
⎦

An accepting run of P has the form ρ —^ε→ c where ρ has the property that all push_n operations have a matching pop_n and the final transition is a pop_n operation to c = ⟨ p, []_n ⟩ for some p ∈ F. Hence, we define the tree decomposition of an accepting run to be

Tree

⎛
⎝

ρ —^ε→ c

⎞
⎠

= ε

⎡
⎣

Tree

⎛
⎝

⎞
⎠

, T

⎡
⎣

⎤
⎦

4.2 Scoring Trees

In the above tree decomposition of runs, the tree branches at each instance of a push_n operation. This mimics the behaviour of P₋₁, which performs such branching non-deterministically. Hence, given a run ρ of P, each branch of Tree(ρ) corresponds to a run of P₋₁.

We formalise this intuition in the following section. In this section, we assign scores to each subtree T of Tree(ρ). These scores correspond directly to the largest number of a characters that P₋₁ can output while simulating a branch of T.

Note, in the following definition, we exploit the fact that only nodes with exactly one child may have a label other than ε. We also give a general definition applicable to trees with out-degree larger than 2. This is needed in the simultaneous unboundedness section. For the moment, we only have trees with out-degree at most 2.

Let

⎧
⎨
⎩

0	b = ε
1	b = a

and

⎧
⎨
⎩

0	m = 0
1	m > 0

Then,

Score

⎛
⎝

⎞
⎠

⎧
⎪
⎪
⎪
⎪
⎪
⎨
⎪
⎪
⎪
⎪
⎪
⎩

T = T

⎡
⎣

⎤
⎦

Score

⎛
⎝

T₁

⎞
⎠

T = b

⎡
⎣

T₁

⎤
⎦

max

1 ≤ i ≤ m

⎛
⎜
⎜
⎜
⎜
⎝

Score

⎛
⎝

T_i

⎞
⎠

∑

j ≠ i

Score

⎛
⎝

T_j

⎞
⎠

⎞
⎟
⎟
⎟
⎟
⎠

T = ε

⎡
⎣

T₁, …, T_m

⎤
⎦

We then have the following lemma for trees with out-degree 2.

Lemma 1 (Minimum Scores) Given a tree T containing m nodes labelled a, we have

Score

⎛
⎝

⎞
⎠

≥ log

⎛
⎝

⎞
⎠

Proof. The proof is by induction over m. In the base case m = 1 and there is a single node η in T labelled a. By definition, the subtree T′ rooted at η has Score(T′) = 1. Since the score of a tree is bounded from below by the score of any of its subtrees, we have Score(T) ≥ log(1) as required.

Now, assume m > 1. Find the smallest subtree T′ of T containing m nodes labelled a. We necessarily have either

T′ = a[T₁], or
T′ = ε[T₁, T₂] where T₁ and T₂ each have at least one node each labelled a.

In case (1) we have by induction

Score

⎛
⎝

T′

⎞
⎠

= 1 + log

⎛
⎝

m − 1

⎞
⎠

≥ log

⎛
⎝

⎞
⎠

In case (2) we have

Score

⎛
⎝

T′

⎞
⎠

= max

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎝

Score

⎛
⎝

T₁

⎞
⎠

Score

⎛
⎝

T₂

⎞
⎠

Score

⎛
⎝

T₂

⎞
⎠

Score

⎛
⎝

T₁

⎞
⎠

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎠

We pick whichever of T₁ and T₂ has the most nodes labelled a. This tree has at least ⌈m / 2⌉ nodes labelled a. Note, since both trees contain nodes labelled a, the right-hand side of the addition is always 1. Hence, we need to show

log

⎛
⎝

⎡

m / 2

⎤

⎞
⎠

+ 1 ≥ log

⎛
⎝

⎞
⎠

which follows from

log

⎛
⎝

⎞
⎠

− log

⎛
⎝

⎡

m / 2

⎤

⎞
⎠

= log

⎛
⎜
⎜
⎝

⎡

m / 2

⎤

⎞
⎟
⎟
⎠

≤

log

⎛
⎜
⎜
⎝

m / 2

⎞
⎟
⎟
⎠

= log

⎛
⎝

⎞
⎠

= 1 .

By our choice of T′ we thus have Score(T) = Score(T′) ≥ log(m) as required.

4.3 From Branches to Runs

Lemma 2 (Scores to Runs) Given an accepting run ρ of P, if Score(Tree(ρ)) = m then a^m∈ L(P₋₁).

Proof. Let p_f be the final (accepting) control state of P and let T = Tree(ρ). We begin at the root node of T, which corresponds to the initial configuration of ρ. Let ⟨ p, [s]_n ⟩ be this initial configuration and let ⟨ (p, p_f), s ⟩ be the initial configuration of P₋₁.

Thus, assume we have a node η of T, with a corresponding configuration c = ⟨ p, s ⟩ of P and configuration c₋₁ = ⟨ (p, p_pop), top_n(s) ⟩ of P₋₁ and a run ρ₋₁ of P₋₁ ending in c₋₁ and outputting (m − Score(T′)) a characters where T′ is the subtree of T rooted at η. The subtree T′ corresponds to a sub-run ρ′ of ρ where the transition immediately following ρ′ is a pop_n transition to a control state p_pop.

There are two cases when we are dealing with internal nodes.

T′ = b[T₁].
In this case there is a transition c —^b→ c′ via a rule (p, γ) —^b→ (p′, o) where o ∉ {push_n, pop_n}. Hence, we have the rule ((p, p_pop), γ, A_tt) —^b→ ((p′, p_pop), o) in P₋₁ and thus we can extend ρ₋₁ with a transition c₋₁ —^b→ c′₋₁ via this rule where ρ₋₁, c′ and c′₋₁ maintain the assumptions above.
T′ = ε[T₁, T₂].
In this case we have that T′ corresponds to a sub-run
c —^ε→ ρ₁ —^ε→ ρ₂

of ρ. The transition from c to the beginning of ρ₁ is via a rule r₁ = (p, γ) —^ε→ (p₁, push_n) and the transition from the end of ρ₁ to the start of ρ₂ is via a rule r₂ = (p₂, γ₁) —^ε→ (p₃, pop_n). Moreover, from the definition of the decomposition, the final configuration in ρ₂ is followed in ρ by a pop rule r₃ = (p₄, γ₂) —^ε→ (p_pop, pop_n).
There are two further cases depending on whether the score of T′ is derived from the score of T₁ or T₂.
- In the case of T₁, then, first observe that ρ₂ followed by an application of r₃ is a run from ⟨ p₃, s ⟩ to ⟨ p_pop, pop_n(s) ⟩ where the stack pop_n(s) does not appear in ρ₂. Thus, there is a run of P from ⟨ p₃, [top_n(s)]_n ⟩ to ⟨ p_pop, []_n ⟩ and moreover, this run outputs an a whenever the original run does. Hence, there is also a corresponding run of P from which outputs an a whenever the original run does.
  If an a is output, we have c₋₁ ∈ L(A_{p₃, p_pop}^a) and Score(T′) − Score(T₁) = 1. We can extend ρ via an application of the rule ((p, p_pop), γ, A_{p₃, p_pop}^a) —^a→ ((p₁, p₃), rew_γ) that exists in P₋₁ since c₋₁ ∈ L(A_{p₃, p_pop}^a). This transition maintains the property on the stacks since the push_n copies the topmost stack, hence P₋₁ does not need to change its stack. It maintains the property on the scores since it outputs a, accounting for the part of the score contributed by T₂. Finally, the condition on control states is satisfied since the second component is set to p₂.
  If an a is not output, then the case is similar to the above, except T₂ does not contribute to the score, we have c₋₁ ∈ L(A_{p₃, p_pop}), and the transition of P₋₁ is labelled ε instead of a.
- The case of T₂ is almost symmetric to T₁. Observe that ρ₁ followed by an application of r₂ is a run from ⟨ p₁, push_n(s) ⟩ to ⟨ p₃, s ⟩ where the stack s does not appear in ρ₁. Thus, there is a run of P from ⟨ p₁, [top_n(s)]_n ⟩ to ⟨ p₃, []_n ⟩ and moreover, this run outputs an a whenever the original run does. Hence, there is also a corresponding run of P from which outputs an a whenever the original run does.
  If an a is output, we have c₋₁ ∈ L(A_{p₁, p₃}^a) and Score(T′) − Score(T₂) = 1. We can extend ρ via an application of the rule ((p, p_pop), γ, A_{p₁, p₃}^a) —^a→ ((p₃, p_pop), rew_γ) that exists in P₋₁ since c₋₁ ∈ L(A_{p₁, p₃}^a) This transition maintains the property on the stacks since the stack after the pop_n is identical to the stack before the push_n, hence P₋₁ does not need to change its stack. It maintains the property on the scores since it outputs a, accounting for the part of the score contributed by T₁. Finally, the condition on control states is satisfied since the second component is unchanged.
  If an a is not output, then the case is similar to the above, except T₁ does not contribute to the score, we have c₋₁ ∈ L(A_{p₁, p₃}) and the transition of P₋₁ is labelled ε instead of a.

Finally, we reach a leaf node η with a run outputting the required number of as. We need to show that the run constructed is accepting. Let η′ be the first ancestor of η that contains η in its leftmost subtree. Let T′ be the subtree rooted at η′. This tree corresponds to a sub-run ρ′ of ρ that is followed immediately by a pop_n rule (p, γ) —^ε→ (p_pop, pop_n). Moreover, we have ((p, p_pop), γ, A_tt) —^ε→ (f, rew_γ) with which we can complete the run of P₋₁ as required.

4.4 The Other Direction

Finally, we need to show that each accepting run of P₋₁ gives rise to an accepting run of P containing at least as many as.

Lemma 3 (P₋₁ to P) We have Diagonal_a(P₋₁) implies Diagonal_a(P).

Proof. Let p_f be the unique accepting conrol state of P. Take an accepting run ρ₋₁ of P₋₁. We show that there exists a corresponding run ρ of P outputting at least as many as.

Let

c₀ —^b→ ⋯ —^b→ c_m—^ε→ ⟨ f, s ⟩

for some s be the accepting run of P₋₁. We define inductively for each 0 ≤ i ≤ m a pair of runs ρ₁ⁱ, ρ₂ⁱ of P such that

ρ₂ⁱ ends in a configuration ⟨ p_f, []_n ⟩ (i.e. is accepting), and
if c_i= ⟨ (p, p_pop), s ⟩ then
1. the final configuration of ρ₁ⁱ is ⟨ p, [s s₁ … s_l]_n ⟩, for some s₁, …, s_l, and
2. the first configuration of ρ₂ⁱ is ⟨ p_pop, [s₁ … s_l]_n ⟩, and
the sum of the number of a characters output by ρ₁ⁱ and ρ₂ⁱ is at least the number of a characters output by c₀ —^b₁→ ⋯ —^b_i→ c_i.

Initially we have c₀ = ⟨ (p_in, p_f), s ⟩ and s = [[ γ_in ]]. We define ρ₁⁰ = ⟨ p_in, [s]_n ⟩ and ρ₂⁰ = ⟨ p_f, []_n ⟩ which immediately satisfy the required conditions.

Assume we have ρ₁ⁱ and ρ₂ⁱ as required. We show how to obtain ρ₁ⁱ⁺¹ and ρ₂ⁱ⁺¹. There are several cases depending on the rule used on the transition c_i—^b_i+1→ c_i+1. Let c_i= ⟨ (p, p_pop), s ⟩, the final configuration of ρ₁ⁱ be ⟨ p, [s s₁ … s_l]_n ⟩ and the first configuration of ρ₂ⁱ be ⟨ p_pop, [s₁ … s_l]_n ⟩.

If the rule was ((p, p_pop), γ, A_tt) —^b→ ((p′, p_pop), o) with o ∉push_n, pop_n then we have (p, γ) —^b→ (p′, o) ∈ R and we define ρ₁ⁱ⁺¹ to be ρ₁ⁱ extended by an application of this rule. We also define ρ₂ⁱ⁺¹ = ρ₂ⁱ.
The required conditions are inherited from ρ₁ⁱ and ρ₂ⁱ since o only changes the top_n stack, the final configuration of ρ₂ⁱ⁺¹ is the same as ρ₂ⁱ, p_pop is not changed, and the rule of P outputs an a iff the rule of P₋₁ does.
If the rule was ((p, p_pop), γ, A_{p′_pop, p_pop}) —^ε→ ((p′, p′_pop), rew_γ) then we have a rule r = (p, γ) —^ε→ (p′, push_n) ∈ R. Moreover, from the test A_{p′_pop, p_pop} we know there is a run of P from ⟨ p′_pop, [s]_n ⟩ to ⟨ p_pop, []_n ⟩ and hence there is also a run ρ from ⟨ p′_pop, [s s₁ … s_l]_n ⟩ to ⟨ p_pop, [s₁ … s_l]_n ⟩. We set ρ₂ⁱ⁺¹ = ρ ρ₂ⁱ and ρ₁ⁱ⁺¹ to be ρ₁ⁱ extended by an application of r.
Since the final configuration of ρ₁ⁱ⁺¹ is ⟨ p′, [s s s₁ … s_l]_n ⟩ it is easy to check the required correspondence with the first configuration ⟨ p′_pop, [s s₁ … s_l]_n ⟩ of ρ₂ⁱ⁺¹.
The remaining conditions are immediate since no a is output and the final configuration of ρ₂ⁱ⁺¹ is the same as ρ₂ⁱ.
The case of ((p, p_pop), γ, A_{p′_pop, p_pop}^a) —^a→ ((p′, p′_pop), rew_γ) is almost identical to the previous case. To adapt the proof, one needs only observe that since c_i∈ L(A_{p′_pop, p_pop}^a) the run ρ used to extend ρ₂ⁱ also outputs at least one a character.
If the rule was ((p, p_pop), γ, A_{p₁, p₂}) —^ε→ ((p₂, p_pop), rew_γ) then there is also a rule r = (p, γ) —^ε→ (p₁, push_n) ∈ R and from the test A_{p₁, p₂} we know there is a run of P from ⟨ p₁, [s]_n ⟩ to ⟨ p₂, []_n ⟩ and therefore there is also a run ρ that goes from ⟨ p₁, [s s s₁ … s_l]_n ⟩ to ⟨ p₂, [s s₁ … s_l]_n ⟩. We set ρ₁ⁱ⁺¹ to be ρ₁ⁱ extended with an application of r and then the run ρ. We also set ρ₂ⁱ⁺¹ = ρ₂ⁱ.
To verify that the properties hold, we observe that c_i+1 = ⟨ (p₂, p_pop), s ⟩, and ρ₁ⁱ⁺¹ ends with ⟨ p₂, [s s₁ … s_l]_n ⟩ and ρ₂ⁱ⁺¹ still begins with ⟨ p_pop, [s₁ … s_l]_n ⟩ and has the required final configuration. The property on the number of as holds since the rule of P₋₁ did not output an a.
The case of ((p, p_pop), γ, A_{p₁, p₂}^a) —^a→ ((p₂, p_pop), rew_γ) is almost identical to the previous case. To adapt the proof, one needs only observe that since c_i∈ L(A_{p₁, p₂}^a) the run ρ used to extend ρ₁ⁱ also outputs at least one a character.

Finally, when we reach i = m we have from the final transition of the run of P₋₁ that there is a rule (p, γ) —^ε→ (p_pop, pop_n). We combine ρ₁^m and ρ₂^m with this pop transition, resulting in an accepting run of P that outputs at least as many a characters as the run of P₋₁.

5 Multiple Characters

We generalise the previous result to the full diagonal problem. Naïvely, the previous approach cannot work. Consider the HOPDA executing

push₁^m; push_n; pop₁^m; pop_n; pop₁^m

where the first sequence of pop₁ operations output a₁ and the second sequence output a₂.

The corresponding run trees are of the form given in Figure 3. In particular, P₋₁ can only choose one branch, hence all runs of P₋₁ produce a bounded number of a₁s or a bounded number of a₂s. They cannot be simultaneously unbounded.

Figure 3: An example showing that following a single branch does not work for simultaneous unboundedness.

For P₋₁ to be able to output both an unbounded number of a₁ and a₂ characters, it must be able to output two branches of the tree. To this end, we define a notion of α-branch HOPDA, which output trees with up to α branches. We then show that the reduction from n-PDA to (n−1)-PDA can be generalised to α-branch HOPDA.

5.1 Branching HOPDA

We define n-PDA outputting trees with at most α branches, denoted (n, α)-PDA. Note, an n-PDA that outputs a word is an (n, 1)-PDA. Indeed, any (n, α)-PDA is also an (n, α′)-PDA whenever α ≤ α′.

Definition 1 ((n, α)-PDA) We define an order-n α-branch pushdown automaton ((n, α)-PDA) to be given by a tuple P = (P, Σ, Γ, R, F, p_in, γ_in, θ) where P, Σ, Γ, F, p_in, and γ_in are as in HOPDA. The set of rules R ⊆ ∪_{1 ≤ m ≤ α} P × Γ × Σ × Ops_n × P^m together with a mapping θ : P → {1, …, α} such that for all (p, γ, b, o, p₁, …, p_m) ∈ R we have θ(p) ≥ θ(p₁) + ⋯ + θ(p_m).

We use the notation (p, γ) —^b→ (p₁, …, p_m, o) to denote a rule (p, γ, b, o, p₁, …, p_m) ∈ R. Intuitively, such a rule generates a node of a tree with m children. The purpose of the mapping θ is to bound the number of branches that this tree may have. Hence, at each branching rule, the quota of branches is split between the different subtrees. The existence of such a mapping implies this information is implicit in the control states and an (n, α)-PDA can only output trees with at most α branches.

From the initial configuration c₀ = ⟨ p_in, [[ γ_in ]] ⟩ a run of an (n, α)-PDA is a tree T = (D, λ) whose nodes are labelled with n-PDA configurations, and generates an output tree T′ = (D, λ′) whose nodes are labelled with symbols from the output alphabet. Precisely

λ(ε) = c₀, and
for a node η with children η₁, …, η_m and λ(η) = ⟨ p, s ⟩ there is a rule (p, γ) —^b→ (p₁, …, p_m, o) such that for all 1 ≤ i ≤ m we have λ(η_i) = ⟨ p_i, s′ ⟩ where top₁(s) = γ, s′ = o(s). Moreover we have λ′(η) = b.
For all leaf nodes η we have λ′(η) = ε.

The run is accepting if for all leaf nodes η we have λ(η) = ⟨ p, []_n ⟩ and p ∈ F. Let L(P) be the set of output trees of P.

Given an output tree T we write |T|_a to denote the number of nodes labelled a in T. For an (n, α)-PDA P, we define

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

∀ m . ∃ T ∈ L

⎛
⎝

⎞
⎠

. ∀ 1 ≤ i ≤ α .

⎪
⎪

_{a_i} ≥ m .

6 Reduction For Simultaneous Unboundedness

Given an (n, α)-PDA P we construct an (n−1, α)-PDA P₋₁ such that

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_{a₁, …, a_α}

⎛
⎝

P₋₁

⎞
⎠

Moreover, we show Diagonal_{a₁, …, a_α}(P) is decidable for a (0, α)-PDA (i.e. a regular automaton outputting an α-branch tree) P.

For simplicity, we assume for all rules (p, γ) —^b→ (p₁, …, p_m, o) if m > 1 then o = rew_γ (i.e. the stack is unchanged). Additionally we have b = ε.

We also make analogous assumptions to the single character case. That is, we assume Σ = {a₁, …, a_α, ε} and use b to range over Σ. Moreover, all rules of the form (p, γ) —^b→ (p′, o) with o = push_n or o = pop_n have b = ε. Finally, we assume acceptance is by reaching a unique control state in F with an empty stack.

6.1 Some Intuition

We briefly sketch the intuition behind the algorithm. We illustrate the reduction from (n, α)-PDA to (n−1, α)-PDA in Figure 4.

We begin with an n-PDA which we first interpret as an (n, α)-PDA. This is possible because an (n, α)-PDA can produce at most α branches. Thus, an n-PDA — which produces a single branch — is also a (n, α)-PDA. We work with HOPDA producing α branches because, after each reduction step, we will need to output one branch for each character in a₁, …, a_α.
We have an (n, α)-PDA P that outputs a tree with at most α branches. In Figure 4 we show part of a run tree with 2 branches. The push_n and pop_n operations are shown on the edges of the tree. Nodes are numbered to help identify them during the different transformations.
We “decompose” this tree into another tree where the branches appearing after the pop_n operations are hung from the same parent as their matching push_n. This is shown in the middle of Figure 4. Notice that this tree has an unbounded number of branches (it branches at each push_n). However, we know that the maximum out-degree of any of its nodes is (α + 1) since the source of a push_n-labelled edge has one child, and we add at most α extra children corresponding to the pop_n on each of its at most α branches.
We prove a generalisation of Lemma 1 (Minimum Scores) that shows a run tree with at least m instances of a character a has a branch with a score of at least log_(α+1)(m). Thus, we need to select one branch for each a we wish to output.
We build an (n−1, α)-PDA P₋₁ that non-deterministically picks out the highest scoring branches for each a. This is shown on the right of Figure 4.

Figure 4: Illustrating the reduction steps.

6.2 Branching HOPDA with Regular Tests

As before, we instrument our HOPDA with tests. Removing these tests requires a simple adaptation of Broadbent et al. [8].

Definition 1 ((n, α)-PDA with Tests) Given a sequence of automata A₁, …, A_m, an (n, α)-PDA with tests is given by a tuple P = (P, Σ, Γ, R, F, p_in, γ_in, θ) where P, Σ, Γ, F, p_in, γ_in are as in HOPDA. The set of rules R ⊆ ∪_{1 ≤ m ≤ α} P × Γ × {A₁, …, A_m} × Σ × Ops_n × P^m together with a mapping θ : P → {1, …, α} such that for all (p, γ, A, b, o, p₁, …, p_m) ∈ R we have θ(p) ≥ θ(p₁) + ⋯ + θ(p_m).

We use the notation (p, γ, A) —^b→ (p₁, …, p_m, o) to denote a rule (p, γ, A, b, o, p₁, …, p_m) ∈ R.

From the initial configuration c₀ = ⟨ p_in, [[ γ_in ]] ⟩ a run of an (n, α)-PDA with tests is a tree T = (D, λ) and generates an output tree ρ = (D, λ′) where

λ(ε) = c₀, and
for a node η with children η₁, …, η_m and λ(η) = ⟨ p, s ⟩ there is a rule (p, γ, A) —^b→ (p₁, …, p_m, o) such that s ∈ L(A) and for all 1 ≤ i ≤ m we have λ(η_i) = ⟨ p_i, s′ ⟩ where top₁(s) = γ, and s′ = o(s). Moreover we have λ′(η) = b.
For all leaf nodes η we have λ′(η) = ε.

The run is accepting if for all leaf nodes η we have λ(η) = ⟨ p, []_n ⟩ and p ∈ F. Let L(P) be the set of output trees of P.

Theorem 1 (Removing Tests) [8, Theorem 3 (adapted)] For every (n, α)-PDA with tests P, we can compute an (n, α)-PDA P′ with L(P) = L(P′).

Proof. This is a straightforward adaptation of Broadbent et al. [8]. Let the (n, α)-PDA with tests be P = (P, Σ, Γ, R, F, p_in, γ_in, θ) with test automata A₁, …, A_m. We build an (n, α)-PDA that mimics P almost directly. The only difference is that each character γ appearing in the stack is replaced by

⎛
⎜
⎝

γ,

₁, …,

⎞
⎟
⎠

For each test A we have a vector of functions

₌

⎛
⎝

τ₁, …, τ_n

⎞
⎠

The function τ_k : Q → Q intuitively describes runs of A from the bottom of top_k+1(s) to the top of pop_k(top_k+1(s)). Thus, we can reconstruct an entire run over pop₁(s) from initial state q as

q′ = τ₁

⎛
⎝

⋯τ_n

⎛
⎝

⎞
⎠

and then we can consult Δ to complete the run by adding the effect of reading top₁(s).

Thus, let A_i= (Q_i, Γ_[], q_inⁱ, Δ_i, Q_Fⁱ). We define

P^T =

⎛
⎝

P, Σ, Γ^T, R^T, F, p_in, γ_in^T, θ

⎞
⎠

where

Γ^T =

⎧
⎨
⎩

⎛
⎜
⎝

γ,

₁, …,

⎞
⎟
⎠

⎪
⎪
⎪

γ ∈ Γ ∧ ∀ i .

_i ∈

⎛
⎝

Q_i→ Q_i

⎞
⎠

ⁿ

⎫
⎬
⎭

and R^T is the smallest set of rules of the form

⎛
⎝

p, γ^T

⎞
⎠

—^b→

⎛
⎝

p₁, …, p_l,

Update

⎛
⎝

o, γ^T

⎞
⎠

where γ^T = (γ, τ₁, …, τ_m) and (p, γ, A_i) —^b→ (p₁, …, p_l, o) ∈ R and Accepts(γ, τ_i, Δ_i, q_inⁱ, Q_Fⁱ) and we define

Accepts

⎛
⎝

γ, τ₁, …, τ_n, Δ, q_in, Q_F

⎞
⎠

⇐⇒

q = τ₁

⎛
⎝

⋯ τ_n

⎛
⎝

q_in

⎞
⎠

∧ Δ

⎛
⎝

q, [_n ⋯ [₁ γ

⎞
⎠

∈ Q_F

where Δ(q, [_n ⋯ [₁ γ) is shorthand for the repeated application of Δ on γ then [₁, back to [_n, and we define Update(o, γ^T) = o^T following the cases below. Let γ^T = (γ, τ₁, …, τ_m).

When o = rew_σ then o^T = (σ, τ₁, …, τ_m).
When o = push_k then o^T = push_o; rew_{(γ, τ₁′, …, τ_m′)} where for all i we have

τ

_i = ⎛
⎝ τ₁, …, τ_k−1, τ_k′, τ_k+1, … τ_n ⎞
⎠

and
τ_k′ ⎛
⎝ q ⎞
⎠ = τ_k ⎛
⎝
Δ_i ⎛
⎝
τ₁ ⎛
⎝
⋯ τ_k ⎛
⎝ q ⎞
⎠

⎞
⎠ , ]_k−1 [_k−1 ⋯ [₁ γ

⎞
⎠

⎞
⎠ .

I.e., we apply the functions to read the whole stack once, and then the correct part of the copy created by the push_k.
When o = pop_k then
o^T = pop_o;
⎛
⎜
⎝
σ,

τ

₁′, …,

τ

_m′

⎞
⎟
⎠

?; rew

⎛
⎜
⎝
σ,

τ

₁″, …,

τ

_m″

⎞
⎟
⎠

where for all i we have τ_i = (τ₁, …, τ_n) and τ_i′ = (τ₁′, …, τ_n′) and

τ

_i″ = ⎛
⎝ τ₁′, …, τ_k−1′, τ_k, … τ_n ⎞
⎠ .

We can see that this is correct since we do not update the functions that read parts of the stack unchanged (i.e., stacks outside of those changed by the pop_k), and we take the functions that are correct for the newly exposed top parts of the stack for the remaining functions.

Finally, we set γ_in^T = (γ_in, τ₁, …, τ_m) where for each i we have τ_i = (τ₁, …, τ_n) such that for each k we have τ_k(q) = Δ(q, ]_k ⋯ ]_n).

6.3 Building The Automata

Previously we built automata A_{p₁, p₂} to indicate that from p₁, the current top stack could be removed, arriving at p₂. This is fine for words, however, we now have α-branch trees. It is no longer enough to specify a single control state: the top stack may be popped once on each branch of the tree, hence for a control state p we need to recognise configurations with control state p from which there is a run tree where the leaves of the trees are labelled with configurations with control states p₁, …, p_m and empty stacks. Moreover we need to recognise the set O of characters output by the run tree. More precisely, for these automata we write

A_{p,p₁,…,p_m}^O

where θ(p) ≥ θ(p₁) + ⋯ + θ(p_m) and O ⊆ {a₁, …, a_α}. We have s ∈ L(A_{p,p₁,…,p_m}^O) iff there is a run tree T with the root labelled ⟨ p, [s]_n ⟩ and m leaf nodes labelled ⟨ p₁, []_n ⟩, …, ⟨ p_m, []_n ⟩ respectively. Moreover, we have a ∈ O iff the corresponding output tree T′ has |T′|_a > 0.

6.3.1 Alternating HOPDA

To construct the required stack automata, we need to do reachability analysis of (n, α)-PDA. We show that such analyses can be rephrased in terms of alternating higher-order pushdown systems (HOPDS), for which the required algorithms are already known [7]. Note, we refer to these machines as “systems” rather than “automata” because they do not output a language.

Definition 2 (Alternating HOPDS) An alternating order-n pushdown system is a tuple P = (P, Γ, R) where P is a finite set of control states, Γ is a finite stack alphabet, and

R ⊆

⎛
⎝

P × Γ × Ops_n × P

⎞
⎠

⋃

⎛
⎝

P × Γ × 2^P

⎞
⎠

is a set of transition rules.

We write (p, γ) → (p, o) to denote (p, γ, o, p) ∈ R and (p, γ) → p₁, …, p_m to denote (p, γ, {p₁, …, p_m}) ∈ R.

An run of an alternating HOPDS may split into several configurations, each of which must reach a target state. Hence, the branching of the alternating HOPDS mimics the branching of the (n, α)-PDA. Given a set C of configurations, we define Pre_P^∗(C) to be the smallest set C′ such that

C′

C ⋃

⎧
⎪
⎪
⎨
⎪
⎪
⎩

⟨ p, s ⟩

⎪
⎪
⎪
⎪
⎪
⎪
⎪

⎛
⎝

p, γ

⎞
⎠

→

⎛
⎝

p′, o

⎞
⎠

∈ R ∧

top₁

⎛
⎝

⎞
⎠

= γ ∧

⟨ p′,

⎛
⎝

⎞
⎠

⟩ ∈ C′

⎫
⎪
⎪
⎬
⎪
⎪
⎭

⋃

⎧
⎪
⎪
⎨
⎪
⎪
⎩

⟨ p, s ⟩

⎪
⎪
⎪
⎪
⎪
⎪

⎛
⎝

p, γ

⎞
⎠

→ p₁, …, p_m ∈ R ∧

top₁

⎛
⎝

⎞
⎠

= γ ∧

∀ i . ⟨ p_i, s ⟩ ∈ C′

⎫
⎪
⎪
⎬
⎪
⎪
⎭

6.3.2 Constructing the Tests

In order to use standard results to obtain A_{p,p₁,…,p_m}^O we construct an alternating HOPDS P_⋄ and automaton A such that checking c ∈ Pre_{P_⋄}^∗(A) for a suitably constructed c allows us to check whether s ∈ L(A_{p,p₁,…,p_m}^O).

The alternating HOPDS P_⋄ will mimic the branching of P with alternating transitions¹ (p, γ) → p₁, …, p_m of P_⋄. It will maintain in its control states information about which characters have been output, as well as which control states should appear on the leaves of the branches. This final piece of information prevents all copies of the alternating HOPDS from verifying the same branch of P.

Definition 3 (P_⋄) Given an (n, α)-PDA P described by the tuple (P, Σ, Γ, R, F, p_in, γ_in), of P, we define

P_⋄ =

⎛
⎝

P_⋄, Γ, R_⋄

⎞
⎠

where

P_⋄ =

⎧
⎪
⎪
⎨
⎪
⎪
⎩

⎛
⎝

p, O, p₁, …, p_m

⎞
⎠

⎪
⎪
⎪
⎪
⎪
⎪

1 ≤ m ≤ α ∧

O ⊆

⎧
⎨
⎩

a₁, …, a_α

⎫
⎬
⎭

∧

p₁, …, p_m∈ P

⎫
⎪
⎪
⎬
⎪
⎪
⎭

and R_⋄ is the set of rules containing, for each

⎛
⎝

p, γ

⎞
⎠

—^b→

⎛
⎝

p′, o

⎞
⎠

∈ R

all rules

⎛
⎝

p, O, p₁, …, p_i

⎞
⎠

, γ

⎞
⎠

→

⎛
⎜
⎝

p₁, O ∖

⎧
⎨
⎩

⎫
⎬
⎭

, p₁, …, p_i

⎞
⎟
⎠

, o

⎞
⎟
⎠

and for each

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p₁, …, p_m, rew_γ

⎞
⎠

∈ R

with m > 1 all alternating rules

⎛
⎝

p, O, p′₁, …, p′_i

⎞
⎠

, γ

⎞
⎠

→

⎛
⎝

p₁, O₁, p₁¹, …, p_i₁¹

⎞
⎠

, …

⎛
⎝

p_m, O_m, p₁^m, …, p_{i_m}^m

⎞
⎠

where p′₁, …, p′_i is a permutation of p₁¹, …, p_i₁¹, … p₁^m, …, p_{i_m}^m and O = O₁ ∪ ⋯ ∪ O_m.

In the above definition, the permutation condition ensures that the target control states are properly distributed amongst the newly created branches.

Lemma 1 We have s ∈ L(A_{p,p₁,…,p_m}^O) iff

⟨

⎛
⎝

p, O, p₁, …, p_m

⎞
⎠

⎡
⎣

⎤
⎦

⟩ ∈ Pre_{P_⋄}^∗

⎛
⎝

⎞
⎠

where A is such that

⎛
⎝

⎞
⎠

⎧
⎨
⎩

⟨

⎛
⎝

p, ∅, p

⎞
⎠

⎡
⎣

⎤
⎦

⟩

⎪
⎪
⎪

p ∈

⎧
⎨
⎩

p₁, …, p_m

⎫
⎬
⎭

Proof. First take s ∈ L(A_{p,p₁,…,p_m}^O) and the run tree witnessing this membership. We can move down the tree, maintaining a frontier c₁, …, c_l and building a tree witnessing that ⟨ (p, O, p₁, …, p_m), [s]_n ⟩ ∈ Pre_{P_⋄}^∗(A). Initially we have the frontier ⟨ p, [s]_n ⟩ and the initial configuration ⟨ (p, O, p₁, …, p_m), [s]_n ⟩.

Hence, take a configuration c = ⟨ p′, s′ ⟩ from the frontier and corresponding configuration c′ = ⟨ (p′, O′, p′₁, …, p′_i), s′ ⟩. If the rule applied to c is not a branching rule, we simply take the matching rule of P_⋄ and apply it to c′. Note, that if the rule output b we remove b from O′. Hence, O′ contains only characters that have not been output on the path from the initial configuration.

If the rule applied is branching, that is (p′, γ) —^ε→ (p″₁, …, p″_j, rew_γ) then we apply the rule

⎛
⎝

p′, O, p′₁, …, p′_i

⎞
⎠

, γ

⎞
⎠

→

⎛
⎝

p″₁, O₁, p₁¹, …, p_i₁¹

⎞
⎠

, …

⎛
⎝

p″_j, O_j, p₁^j, …, p_{i_j}^j

⎞
⎠

where p′₁, …, p′_i is a permutation of p₁¹, …, p_i₁¹, … p₁^j, …, p_{i_j}^m and O = O₁ ∪ ⋯ ∪ O_m. These partitions are made in accordance with the distribution of the leaves and outputs of the run tree of P. I.e. if a control state p″ appears on the i′th subtree, then it should appear in the i′th target state of P_⋄. Similarly, if the i′th subtree outputs an b ∈ O, then b should be placed in O_i′. Applying this alternating transition creates a matching configuration for each new branch in the frontier.

We continue in this way until we reach the leaf nodes of the frontier. Each leaf ⟨ p′, s ⟩ has a matching ⟨ (p′, ∅, p′), s ⟩ and hence is in L(A). Thus, we have witnessed ⟨ (p, O, p₁, …, p_m), [s]_n ⟩ ∈ Pre_{P_⋄}^∗(A) as required.

To prove the other direction, we mirror the previous argument, showing that the witnessing tree for P_⋄ can be used to build a run tree of P.

It is known that Pre_P^∗(A) is computable for alternating HOPDS.

Theorem 2 [7, Theorem 1 (specialised)] Given an alternating HOPDS P and a top-down automaton A, we can construct an automaton A′ accepting Pre_P^∗(A).

Hence, we can now build A_{p,p₁,…,p_m}^O from the control state p and top-down automaton representation of Pre_{P_⋄}^∗(A) since we can effectively translate from top-down to bottom-up stack automata.

6.4 Reduction to Lower Orders

We generalise our reduction to (n, α)-PDA. Let A_tt be the automata accepting all configurations. Note, in the following definition we allow all transitions (including branching) to be labelled by sets of output characters. To maintain our assumed normal form we have to replace these transitions using intermediate control states to ensure all branching transitions are labelled by ε and all transitions labelled O are replaced by a sequence of transitions outputting a single instance of each character in O.

The construction follows the intuition of the single character case, but with a lot more bookkeeping. Given an (n, α)-PDA P we define an (n−1, α)-PDA with tests P₋₁ such that P satisfies the diagonal problem iff P₋₁ also satisfies the diagonal problem. The main control states of P₋₁ take the form

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

where p, p₁, …, p_m are control states of P and both O and B are sets of output characters. We explain the purpose of each of these components.

We will define P₋₁ to generate up to m branches of the tree decomposition of a run of P. In particular, for each of the characters a ∈ {a₁, …, a_α} there will be a branch of the run of P₋₁ responsible for outputting “enough” of the character a to satisfy the diagonal problem. Note that two characters a and a′ may share the same branch. When a control state of the above form appears on a node of the run tree, the final component B makes explicit which characters the subtree rooted at that node is responsible for generating in large numbers. Thus, the initial control state will have B = {a₁, …, a_α} since all characters must be generated from this node. However, when the output tree branches – i.e. a node has more than one child – the contents of B will be partitioned amongst the children. That is, the responsibility of the parent to output enough of the characters in B is divided amongst its children.

The remaining components play the role of a test A_{p,p₁,…,p_m}^O. That is, the current node is simulating the control state p of P, and is required to produce m branches, where the stack is emptied on each leaf and the control states appearing on these leaves are p₁, …, p_m. Moreover, the tree should output at least one of each character in O.

Note, P₋₁ also has (external) tests of the form A_{p,p₁,…,p_m}^O that it can use to make decisions, just like in the single character case. However, it also performs tests “online” in its control states. This is necessary because the tests were used to check what could have happened on branches not followed by P₋₁. In the single character case, there was only one branch, hence P₋₁ would uses tests to check all the branches not followed, and then continue down a single branch of the tree. In the multi-character case the situation is different. Suppose a subtree rooted at a given node was responsible for outputting enough of both a₁ and a₂. Amongst the possible children of this node we may select two children: one for outputting enough a₁ characters, and one for outputting enough a₂ characters. The alternatives not taken will be checked using tests as before. However, the child responsible for outputting a₁ may have also wanted to run a test on the child responsible for outputting a₂. Thus, as well as having to output enough a₂ characters, this latter child will also have to run the test required by the former. Thus, we have to build these tests into the control state. As a sanity condition we enforce O ∩ B = ∅ since a branch outputting a should never ask itself if it is able to produce at least one a.

We explain the rules of P₋₁ intuitively. It will be beneficial to refer to the formal definition (below) while reading the explanations. The case for R_push is illustrated in Figure 5 since it covers most of the situations appearing in the other rules as well.

The rules in R_init guess how many branches will be needed to output enough of each a. (This might be less than α since one branch might account for several characters.)
The rules in R_fin check whether the run can be finished (always via a pop_n since we are aiming for the empty stack). This is true if we only have one branch to complete (just reach p′) and we have no more characters that we’re obliged to output.
The rules in R_sim simulate a non-branching operation. They do this faithfully, simply passing along all information (updating O if a character is output by the simulated transition).
The rules in R_br are the first of the complicated rules. This is mainly a matter of notation. The reasoning behind the rules is that we’re at a point where the tree splits into l different branches. These have control states p′₁, …, p′_l respectively. We non-deterministically guess which of these branches should output which of the characters in B. Thus, we split B into B₁, …, B_i. This means we are exploring i branches. Let x₁, …, x_i be the control states on these branches. The remaining branches we handle using tests on the stack. Let y₁, …, y_j be the control states appearing on these branches. We require that all of p′₁, …, p′_l are accounted for, so we assert that p′₁, …, p′_l is a permutation of x₁, …, x_i, y₁, …, y_j.
Similarly, in the current subtree we are obliged to pop to leaf nodes containing the control states p₁, …, p_m. We split these obligations between the branches we are exploring and those we are handling using tests. We use another permutation check to ensure the obligations have been distributed properly.
Finally, we are required to output characters in O. We may also, in choosing a particular branch for a character a, need to output a to account for instances appearing on a missed branch. Hence we also output O′ to account for these. We distribute the obligations O and O′ amongst the different branches using X₁, …, X_i and Y₁, …, Y_j.
The rules in R_push and R_pop follow the same intuition as in the single character case, except we have the branching to deal with. In particular, at a push we have one branch corresponding to exploring what happens between the push and the corresponding pops, and a branch for each of the corresponding pops. We choose a selection of these branches to track with the HOPDA and a selection to handle using tests. The difference between R_push and R_pop is that the former explores the branch of the push using the HOPDA and the latter uses a test.
In these rules, after the push we’re in control state p′ and we guess that we will pop to control states p′₁, …, p′_l. Hence we have a branch or a test to ensure that this happens. The remaining branches and tests are for what happens after the pops. The start from the states p′₁, …, p′_l and must, in total, pop to the original pop obligation p₁, …, p_m. Hence, we distribute these tasks in the same way as the R_br.

Figure 5: Illustrating the rules in R_push.

Before giving the formal definition, we summarise the discussion above by recalling the meaning of the various components. A control state (p, p₁, …, p_m, O, B) means we’re currently simulating a node at control state p that is required to produce m branches terminating in control states p₁, …, p_m respectively, that the produced tree should output at least one of each character in O and the entire subtree should output enough of each character in B to satisfy the diagonal problem. In the definition below, the set O′ is the set of new single character output obligations produced when the automaton decides which branches to follow faithfully and which to test (for the output of at least one of each character). The sets X₁, …, X_i and Y₁, …, Y_j represent the partitioning of the single character output obligations amongst the tests and new branches.

The correctness of the reduction is stated after the definition. A discussion of the proof appears in Section 7.

Definition 4 (P₋₁) Given an (n, α)-PDA P described by (P, Σ, Γ, R, {p_f}, p_in, γ_in, θ) and automata A_{p,p₁,…,p_m}^O for all 1 ≤ m ≤ α, p, p₁, …, p_m∈ P, and O ⊆ {a₁, …, a_α} we define an (n−1, α)-PDA with tests

P₋₁ =

⎛
⎝

P₋₁, Σ, Γ, R₋₁, F₋₁, p_in⁻¹, γ_in, θ₋₁

⎞
⎠

where P₋₁ is the set

⎧
⎪
⎪
⎨
⎪
⎪
⎩

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

⎪
⎪
⎪
⎪
⎪
⎪
⎪

1 ≤ m ≤ α ∧

p, p₁, …, p_m∈ P ∧

O, B ⊆

⎧
⎨
⎩

a₁, …, a_α

⎫
⎬
⎭

∧

O ⋂ B = ∅

⎫
⎪
⎪
⎬
⎪
⎪
⎭

⊎

⎧
⎨
⎩

p_in⁻¹, f

⎫
⎬
⎭

and

R₋₁

R_init ⋃ R_sim ⋃ R_br ⋃ R_fin ⋃ R_push ⋃ R_pop

F₋₁

⎧
⎨
⎩

⎫
⎬
⎭

and θ₋₁((p, p₁, …, p_m, O, B)) = |B| and is 1 for all other control states. We define the sets of rules, where in all cases, p₁, …, p_m∈ P and O, O′, B ⊆ {a₁, …, a_α}, to be as follows:

R_init is the set containing all rules of the form
⎛
⎝ p_in⁻¹, γ_in ⎞
⎠ —^ε→ ⎛
⎜
⎝
⎛
⎜
⎝
p_in, p_f, …, p_f, ∅,
⎧
⎨
⎩ a₁,…,a_α ⎫
⎬
⎭

⎞
⎟
⎠

, rew_{γ_in}

⎞
⎟
⎠

where |p_f, …, p_f| ≤ α, and
R_fin is the set containing all rules of the form
⎛
⎝
⎛
⎝ p, p′, ∅, B ⎞
⎠

, γ, A_tt

⎞
⎠ —^ε→ ⎛
⎝ f, rew_γ ⎞
⎠

for all (p, γ) —^ε→ (p′, pop_n) ∈ R and B ⊆ {a₁, …, a_α}, and

R_sim is the set containing all rules of the form

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ, A_tt

⎞
⎠

—

⎧
⎨
⎩

⎫
⎬
⎭

⋂ B

→

⎛
⎜
⎝

p′, p₁, …, p_m,

O ∖

⎧
⎨
⎩

⎫
⎬
⎭

, B

⎞
⎟
⎠

, o

⎞
⎟
⎠

for (p, γ) —^b→ (p′, o) ∈ R, and o ∉ {push_n, pop_n}, and

R_br is the set containing all rules of the form

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

where

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′₁, …, p′_l, rew_γ

⎞
⎠

∈ R

and p′₁, … p′_l is a permutation of

x₁, …, x_i, y₁, …, y_j

and p₁, …, p_m is a permutation of

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j

and

O ⋃ O′ = X₁ ⋃ ⋯ ⋃ X_i⋃ Y₁ ⋃ ⋯ ⋃ Y_j

and B = B₁ ∪ ⋯ ∪ B_i.

R_push is the set containing all rules of the form

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎝

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎠

where

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′, push_n

⎞
⎠

and p′₁, … p′_l is a permutation of

x₁, …, x_i, y₁, …, y_j

and p₁, …, p_m is a permutation of

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j

and

O ⋃ O′ = X ⋃ X₁ ⋃ ⋯ ⋃ X_i⋃ Y₁ ⋃ ⋯ ⋃ Y_j

and B = B₀ ∪ ⋯ ∪ B_i.

we have R_pop is the set containing all rules of the form

⎛
⎜
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{p′,p′₁,…,p′_l}^Y ⋂

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

where

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′, push_n

⎞
⎠

and p′₁, … p′_l is a permutation of

x₁, …, x_i, y₁, …, y_j

and p₁, …, p_m is a permutation of

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j

and

O ⋃ O′ = Y ⋃ X₁ ⋃ ⋯ ⋃ X_i⋃ Y₁ ⋃ ⋯ ⋃ Y_j

and B = B₁ ∪ ⋯ ∪ B_i.

In Section 7 we show that the reduction is correct.

Lemma 2 (Correctness of P₋₁)

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_{a₁, …, a_α}

⎛
⎝

P₋₁

⎞
⎠

To complete the reduction, we convert the (n, α)-PDA with tests into a (n, α)-PDA without tests.

Lemma 3 (Reduction to Lower Orders) For every (n, α)-PDA P we can build an order-(n−1) α-branch HOPDA P′ such that

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_{a₁, …, a_α}

⎛
⎝

P′

⎞
⎠

Proof. From Definition 4 (P₋₁) and Lemma 2 (Correctness of P₋₁), we obtain from P an (n−1, α)-PDA with tests P₋₁ satisfying the conditions of the lemma. To complete the proof, we invoke Theorem 1 (Removing Tests) to find P′ as required.

We show correctness of the reduction in Section 7. First we show that we have decidability once we have reduced to order-0.

6.5 Decidability at Order-0

We show that the problem becomes decidable for a 0-PDA P. This is essentially a finite state machine and we can linearise the trees generated by saving the list of states that have been branched to in the control state. After one branch has completed, we run the next in the list, until all branches have completed. Hence, a tree of P becomes a run of the linearised 0-PDA, and vice-versa. Since each output tree has a bounded number of branches, the list length is bounded. Thus, we convert P into a finite state word automaton, for which the diagonal problem is decidable. Note, this result can also be obtained from the decidability of the diagonal problem for pushdown automata.

Definition 5 (P) Given an (0, α)-PDA P described by the tuple (P, Σ, Γ, R, F, p_in, γ_in, θ) we define a 0-PDA

⎛
⎜
⎝

, Σ, Γ,

, F, p_in, γ_in

⎞
⎟
⎠

such that

P =

⎧
⎪
⎨
⎪
⎩

⎛
⎝

p, p₁, γ₁, … p_m, γ_m

⎞
⎠

⎪
⎪
⎪
⎪

p, p₁, …, p_m∈ P ∧

γ₁, …, γ_m∈ Γ ∧

0 ≤ m ≤ α

⎫
⎪
⎬
⎪
⎭

⋃

⎧
⎨
⎩

⎫
⎬
⎭

and R is the set containing all rules of the form

⎛
⎝

p, p₁, γ₁, …, p_m, γ_m

⎞
⎠

, γ

⎞
⎠

—^b→

⎛
⎜
⎝

p′₁,

p₁, γ₁, …, p_m, γ_m,

p′₂, σ, …, p′_l, σ

⎞
⎟
⎠

, rew_σ

⎞
⎟
⎠

for each

⎛
⎝

p, γ

⎞
⎠

—^b→

⎛
⎝

p′₁, …, p′_l, rew_σ

⎞
⎠

∈ R

and all rules

⎛
⎝

p, p₁, γ₁, …, p_m, γ_m

⎞
⎠

, γ

⎞
⎠

—^ε→

⎛
⎝

p₁, p₂, γ₂, …, p_m, γ_m

⎞
⎠

, rew_γ₁

⎞
⎠

whenever p ∈ F.

Lemma 4 (Decidability at Order-0) We have

Diagonal_{a₁, …, a_α}

⎛
⎝

⎞
⎠

⇐⇒ Diagonal_{a₁, …, a_α}

⎛
⎜
⎝

⎞
⎟
⎠

and hence Diagonal_{a₁, …, a_α}(P) is decidable.

Proof. Take an accepting run tree ρ of P. If this tree contains no branching, then it is straightforward to construct an accepting run of P. Hence, assume all trees with fewer than α branches have a corresponding run of P. At a subtree c[T₁, …, T_m] we take the run trees ρ₁, …, ρ_m corresponding to the subtrees. Let c = ⟨ p, γ ⟩ and c₁ = ⟨ p₁, γ ⟩, …, c_m= ⟨ p_m, γ ⟩ be the configurations at the roots of the subtrees. We build a run beginning at c and transitioning to ⟨ (p₁, p₂, γ, …, p_m, γ), γ ⟩. The run then follows ρ₁ with the extra information in its control state. After ρ₁ accepts, we transition to ⟨ (p₂, p₃, γ, …, p_m, γ), γ ⟩ and then replay ρ₂. We repeat until all subtrees have been dispatched. This gives an accepting run of P outputting the same number of each a.

In the other direction, we replay the accepting run ρ of P until we reach a configuration ⟨ (p₁, p₂, γ, …, p_m, γ), γ ⟩ via a rule

⎛
⎝

p, σ

⎞
⎠

—^ε→

⎛
⎝

p₁, p₂, γ, …, p_m, γ

⎞
⎠

, rew_γ

⎞
⎠

At this point we apply

⎛
⎝

p, σ

⎞
⎠

—^ε→

⎛
⎝

p₁, …, p_m, rew_γ

⎞
⎠

of P. We obtain runs for each of the new children as follows. We split the remainder of the run ρ′ into m parts ρ′₁, …, ρ′_m where the break points correspond to each application of a rule of the second kind. For each i we replay the transitions of ρ′₁ from ⟨ p_i, γ ⟩ to obtain a new run of P with fewer applications of the second rule. Inductively, we obtain an accepting run of P that we plug into the ith child. This gives us an accepting run of P outputting the same number of each a.

6.6 Decidability of The Diagonal Problem

We thus have the following theorem.

Theorem 3 (Decidability of the Diagonal Problem) For an n-PDA P and output characters a₁, …, a_α, it is decidable whether Diagonal_{a₁, …, a_α}(P).

Proof. We first interpret P as an (n, α)-PDA and then construct via Lemma 3 (Reduction to Lower Orders) an (n−1, α)-PDA P′ such that Diagonal_{a₁, …, a_α}(P) iff Diagonal_{a₁, …, a_α}(P′). We repeat this step until we have an (0, α)-PDA. Then, from Lemma 4 (Decidability at Order-0) we obtain decidability as required.

7 Correctness for Simultaneous Unboundedness

In this section we prove Lemma 2 (Correctness of P₋₁). The proof follows the same outline as the single character case. To show there is a run with at least m of each character, we take via Lemma 1 (Section 7.2), m′ = (α+1)^m, and a run of P outputting at least this many of each character. Then from Lemma 2 (Section 7.3) a run of P₋₁ outputting at least m of each character as required. The other direction is shown in Lemma 3 (Section 7.4).

We first generalise our tree decomposition and notion of scores. We then show that every α-branch subtree of a tree decomposition generates a run tree of P₋₁ matching the scores of the tree. Finally we prove the opposite direction.

7.1 Tree Decomposition of Output Trees

Given an output tree T of P where each push_n operation has a matching pop_n on all branches, we can construct a decomposed tree representation of the run inductively as follows. We define Tree(T[ε]) = T[ε] and, when

T = b

⎡
⎣

T₁, …, T_m

⎤
⎦

where the rule applied at the root does not contain a push_n operation, we have

Tree

⎛
⎝

⎞
⎠

= b

⎡
⎣

Tree

⎛
⎝

T₁

⎞
⎠

, …, Tree

⎛
⎝

T_m

⎞
⎠

⎤
⎦

In the final case, let

T = ε

⎡
⎣

T′

⎤
⎦

where the rule applied at the root contains a push_n operation and the corresponding pop_n operations occur at nodes η₁, …, η_m.

Note, if the output trees had an arbitrary number of branches, m may be unbounded. In our case, m ≤ α, without which our reduction would fail: P₋₁ would be unable to accurately count the number of pop_n nodes. In fact, our trees would have unbounded out degree and Lemma 1 (Minimum Scores) would not generalise.

Let T₁, …, T_m be the output trees rooted at η₁, …, η_m respectively and let T′ be T with these subtrees removed. Observe all branches of T are cut by this operation since the push_n must be matched on all branches. We define

Tree

⎛
⎝

⎞
⎠

= ε

⎡
⎣

Tree

⎛
⎝

T′

⎞
⎠

, Tree

⎛
⎝

T₁

⎞
⎠

, …, Tree

⎛
⎝

T_m

⎞
⎠

⎤
⎦

An accepting run of P has an extra pop_n operation at the end of each branch leading to the empty stack. Let T′ be the tree obtained by removing the final pop_n-induced edge leading to the leaves of each branch. The tree decomposition of an accepting run is

Tree

⎛
⎝

⎞
⎠

= ε

⎡
⎣

Tree

⎛
⎝

T′

⎞
⎠

, T

⎡
⎣

⎤
⎦

, …, T

⎡
⎣

⎤
⎦

where there are as many T[ε] as there are leaves of T.

Notice that our trees have out-degree at most (α + 1).

7.2 Scoring Trees

We score branches in the same way as the single character case. We simply define Score_a(ρ) to be Score(ρ) when a is considered as the only output character (all others are replaced with ε).

We have to slightly modify our minimum score lemma to accommodate the increased out-degree of the nodes in the trees.

Lemma 1 (Minimum Scores) Given a tree T with maximum out-degree (α + 1), containing, for each a ∈ {a₁, …, a_α}, at least m nodes labelled a, for each a ∈ {a₁, …, a_α} we have

Score_a

⎛
⎝

⎞
⎠

≥ log_(α+1)

⎛
⎝

⎞
⎠

Proof. This is a simple extension of the proof of Lemma 1 (Minimum Scores). We simply replace the two-child case with a tree with up to (α+1) children. In this case, we have to use log_(α+1) rather than log to maintain the lemma.

7.3 From Branches to Runs

Lemma 2 (Scores to Runs) Given an accepting output tree ρ of P, if for all a ∈ {a₁, …, a_α} we have Score_a(Tree(ρ)) ≥ m, then ∃ T ∈ L(P₋₁) with |T|_a ≥ m for all a ∈ {a₁, …, a_α}.

Proof. We will construct a tree ρ₋₁ in L(P₋₁) top down. At each step we will maintain a “frontier” of ρ₋₁ and extend one leaf of this frontier until the whole tree is constructed. The frontier is of the form

⎛
⎝

c₁, η₁, O₁, B₁, …, c_l, η_l, O_lB_l

⎞
⎠

which means that there are l nodes in the frontier. We have B₁ ⊎ ⋯ ⊎ B_l= {a₁, …, a_α} and each B_i indicates that the ith branch, ending in configuration c_i, is responsible for outputting enough of each of the characters in B_i. Each η_i is the corresponding node in Tree(ρ) that is being tracked by the ith branch of the output of P₋₁.

Let p_f be the final (accepting) control state of P and let T = Tree(ρ). We begin at the root node of T, which corresponds to the initial configuration of ρ. Let ⟨ p, [s]_n ⟩ be this initial configuration and let c = ⟨ (p, p_f, …, p_f, ∅, {a₁, …, a_α}), s ⟩ be the configuration of P₋₁ after an application of a rule from R_init. The initial frontier is (c, ε, {a₁, …, a_α}).

Thus, assume we have a frontier

⎛
⎝

c₁, η₁, O₁, B₁, …, c_h, η_h, O_h, B_h

⎞
⎠

and for each of the sequences c₋₁, η, O, B of the frontier we have

T′ is the subtree of T rooted at η, and
c = ⟨ p, s ⟩ labelling η, and
c₋₁ = ⟨ (p, p₁, …, p_m, O, B), top_n(s) ⟩, and
the node of ρ corresponding to η has m locations where the top_n stack is first popped via rules reaching p₁, …, p_m, moreover, these leaves have corresponding leaves in T′, and
the branch from the root of the constructed run to the node labelled c₋₁ in the frontier outputs, for each a ∈ B, at least (m − Score_a(T′)) occurrences of a, and
O ∩ B = ∅ and for each a ∈ O there is at least one node labelled by a in T′.

Pick such a sequence c₋₁, η, O, B. We replace this sequence using a transition of P₋₁ in a way that produces a new frontier with the above properties and moves us a step closer to reaching leaves of T. There are three cases when we are dealing with internal nodes.

T′ = b[T₁].

In this case there is a transition c —^b→ c′ via a rule (p, γ) —^b→ (p′, o) where o ∉ {push_n, pop_n}. Hence, we have

⎛
⎝

p, p₁, … p_m, O, B

⎞
⎠

, γ, A_tt

⎞
⎠

—

⎧
⎨
⎩

⎫
⎬
⎭

⋂ B

→

⎛
⎜
⎝

p′, p₁, … p_m,

O ∖

⎧
⎨
⎩

⎫
⎬
⎭

, B

⎞
⎟
⎠

, o

⎞
⎟
⎠

in P₋₁ and thus we can extend ρ₋₁ with a transition c₋₁ —^b→ c′₋₁ via this rule. The new frontier is obtained by replacing c₋₁, η, O, B with c′₋₁, η′, O ∖ {b}, B where η′ is the child of η. The properties on the frontier are easily seen to be retained.

T′ = ε[T₁, …, T_l] from a rule (p, γ) —^ε→ (p′₁, …, p′_l, rew_γ) of P.

We separate B = B′₁ ⊎ ⋯ ⊎ B′_i such that B′_j is the set of characters a that have their score derived from T_j (i.e. the subtree with the higher score for a characters). Let O′ be the set of all a who had a +1 in their score derived from another subtree. Let ⟨ x₁, s ⟩, … ⟨ x_i, s ⟩ be the configurations labelling the root nodes η₀, η₁, …, η_i of these subtrees. Let ⟨ y₁, s ⟩, …, ⟨ y_j, s ⟩ be the configurations labelling the root nodes of the remaining subtrees. Since T′ includes m leaves that are followed in ρ by pops to p₁, …, p_m we can distribute these control states amongst the branches, obtaining

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j .

Finally, we can distribute

O ⋃ O′ = X₁ ⋃ ⋯ ⋃ X_i⋃ Y₁ ⋃ ⋯ ⋃ Y_j

amongst the subtrees T₁, …, T_l since O can be distributed by assumption and we chose O′ such that this can be done.

From the runs corresponding to T₁, …, T_l and our choices above we know that the tests will pass. That is, c₋₁ ∈ L(A_{p′₁,y₁¹,…,y_i₁¹}^Y₁), …, c₋₁ ∈ L(A_{p′_j,y₁^j,…,y_{i_j}^j}^Y_j).

Hence, we apply to c₋₁ the rule

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹, …, y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j, …, y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

and obtain configurations c₋₁¹, …, c₋₁ⁱ and a new frontier satisfying the required properties by replacing c₋₁, η, O, B with the sequence

c₋₁¹, η₁, X₁, B′₁, … c₋₁ⁱ, η_i, X_i, B′_i .

T′ = ε[T₁, …, T_l] not from a rule (p, γ) —^ε→ (p′₁, …, p′_l, rew_γ) of P.

In this case we have that T′ (subtree of the decomposition T) corresponds to a run tree ρ_T′ that can be decomposed into

c[ρ′] with c′ = ⟨ p′, push_n(s) ⟩ at the root of ρ′ via a rule (p, γ) —^ε→ (p′, push_n) and l leaf nodes labelled c₁, …, c_l respectively, and
runs ρ₁, …, ρ_l with the roots labelled c′₁ = ⟨ p′₁, s ⟩, …, c′_l= ⟨ p′_l, s ⟩ where, for each i, we have c_i—^ε→ c′_i via a pop_n rule, and these are the first points s is seen along each branch, and
the leaves of ρ₁, …, ρ_l are the leaves of ρ_T′.

There are two cases depending on whether we send the HOPDA down the branch corresponding to the push.

We separate B = B′₀ ⊎ B′₁ ⊎ ⋯ ⊎ B′_i such that B′_j is the set of characters a that have their score derived from T_j (i.e. the subtree with the higher score for a characters). Assume T₁ is amongst these subtrees (and will get B′₀). Let O′ be the set of all a who had a +1 in their score derived from another subtree. Let ⟨ p′, push_n(s) ⟩, ⟨ x₁, s ⟩, … ⟨ x_i, s ⟩ be the configurations labelling the root nodes η₁, …, η_i of these subtrees, with the first belonging to T₁. Let ⟨ y₁, s ⟩, …, ⟨ y_j, s ⟩ be the configurations labelling the root nodes of the remaining subtrees. Since T′ has m leaves that are followed in ρ by pops to p₁, …, p_m we can distribute these control states amongst the branches, obtaining

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j .

We can also distribute

O ⋃ O′ = X ⋃ X₁ ⋃ ⋯ ⋃ X_i⋃ Y₁ ⋃ ⋯ ⋃ Y_j

amongst the subtrees T₁, …, T_l with X belonging to T₁ since O can be distributed by assumption and we chose O′ such that this can be done.

From the existence of the runs ρ₁, …, ρ_l we know c₋₁ ∈ L(A_{p′₁,y₁¹,…,y_i₁¹}^Y₁), …, c₋₁ ∈ L(A_{p′_j,y₁^j,…,y_{i_j}^j}^Y_j).

Hence, we apply to c₋₁ the rule

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹, …, y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j, …, y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎝

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎠

and obtain configurations c₋₁⁰, c₋₁¹, …, c₋₁ⁱ and a new frontier satisfying the required properties by replacing c₋₁, η, O, B with the sequence

c₋₁⁰, η₀, X, B′₀, c₋₁¹, η₁, X₁, B′₁, … c₋₁ⁱ, η_i, X_i, B′_i .

We separate B = B′₁ ⊎ ⋯ ⊎ B′_i such that B′_j is the set of characters a that have their score derived from T_j (i.e. the subtree with the higher score for a characters). Assume T₁ is not amongst these subtrees. Let O′ be the set of all a who had a +1 in their score derived from another subtree. Let ⟨ x₁, s ⟩, … ⟨ x_i, s ⟩ be the configurations labelling the root nodes η₁, …, η_i of these subtrees. Let ⟨ p′, push_n(s) ⟩, ⟨ y₁, s ⟩, …, ⟨ y_j, s ⟩ be the configurations labelling the root nodes of the remaining subtrees, with the first belonging to T₁. Since T′ has m leaves that are followed in ρ by pops to p₁, …, p_m we can distribute these control states amongst the branches, obtaining

x₁¹, …, x_j₁¹, … x₁ⁱ, …, x_{j_i}ⁱ y₁¹, …, y_i₁¹, … y₁^j, …, y_{i_j}^j .

We can also distribute

O ⋃ O′ = X₁ ⋃ ⋯ ⋃ X_i⋃ Y ⋃ Y₁ ⋃ ⋯ ⋃ Y_j

amongst the subtrees T₁, …, T_l with Y belonging to T₁ since O can be distributed by assumption and we chose O′ such that this can be done.

From the existence of ρ′ we know that c₋₁ ∈ L(A_{p′,p′₁,…,p′_l}^Y) and from the existence of ρ₁, …, ρ_l we also know c₋₁ ∈ L(A_{p′₁,y₁¹,…,y_i₁¹}^Y₁), …, c₋₁ ∈ L(A_{p′_j,y₁^j,…,y_{i_j}^j}^Y_j).

Hence, we apply to c₋₁ the rule

⎛
⎜
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{p′,p′₁,…,p′_l}^Y ⋂

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

and obtain configurations c₋₁¹, …, c₋₁ⁱ and a new frontier satisfying the required properties by replacing c₋₁, η, O, B with the sequence

c₋₁¹, η₁, X₁, B′₁, … c₋₁ⁱ, η_i, X_i, B′_i .

Finally, we reach a leaf node η with a run outputting the required number of as. We need to show that the run constructed is accepting. From the tree decomposition, we know that the corresponding node of ρ is immediately followed by a pop_n. Thus, from our conditions on the frontier, we must have m = 1 and O = ∅. We also have a rule (p, γ) —^ε→ (p₁, pop_n) and therefore ((p, p₁, ∅, B), γ, A_tt) —^ε→ (f, rew_γ) with which we can complete the run of P₋₁ as required.

7.4 The Other Direction

Finally, we need to show that each accepting run tree of P₋₁ gives rise to an accepting run tree of P containing at least as many of each output character a.

Lemma 3 (P₋₁ to P) We have Diagonal_{a₁, …, a_α}(P₋₁) implies Diagonal_{a₁, …, a_α}(P).

Proof. Take an accepting run tree ρ₋₁ of P₋₁. We show that there exists a corresponding run tree ρ of P outputting at least as many as.

We maintain a frontier

c₁, …, c_h

of ρ₋₁ and a run ρ of P “with holes” such that

there are h nodes of ρ labelled by c₁, …, c_h respectively (these are the holes), and
each of these holes labelled c is the only child of a parent node labelled c′ of P, and
for each corresponding pair c and c′ we have
- c′ = ⟨ p, s ⟩, and
- c = ⟨ (p, p₁, …, p_m, O, B), top_n(s) ⟩, and
- the node labelled by c has m children with the ith child being labelled ⟨ p_i, pop_n(s) ⟩, and
- all leaf nodes of ρ are accepting, and
- for each a ∈ {a₁, …, a_α} the number of a output by run tree of P is at least as many as on the branch of P₋₁ to the configuration with a ∈ B less 1 if a ∈ O.

Initially after a rule from R_init we have the frontier c = ⟨ (p, p_f, …, p_f, ∅, {a₁, …, a_α}), s ⟩ with corresponding run ρ of P being

⟨ p,

⎡
⎣

⎤
⎦

⟩

⎡
⎣

⟨ p_f,

⎡
⎣

⎤
⎦

⟩, …, ⟨ p_f,

⎡
⎣

⎤
⎦

⟩

⎤
⎦

Pick a configuration c₋₁ = ⟨ (p, p₁, …, p_m, O, B), top_n(s) ⟩ of the frontier that is not a leaf of ρ₋₁ and its corresponding node in ρ with parent labelled c = ⟨ p, s ⟩. Let ρ′₋₁ be the subtree of P₋₁ rooted at this configuration.

We show how to extend the frontier closer to the leaves of ρ₋₁. There are several cases depending on the transition of P₋₁ used to exit our chosen node.

ρ′₋₁ = c₋₁[ρ₋₁¹] and the rule applied is of the form

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ, A_tt

⎞
⎠

—

⎧
⎨
⎩

⎫
⎬
⎭

⋂ B

→

⎛
⎜
⎝

p′, p₁, …, p_m,

O ∖

⎧
⎨
⎩

⎫
⎬
⎭

, B

⎞
⎟
⎠

, o

⎞
⎟
⎠

Let c′₋₁ be the configuration labelling the root of ρ₋₁¹. We have (p, γ) —^b→ (p′, o) ∈ R and o ∉ {push_n, pop_n}. We can apply c —^b→ c′. Let η be the node labelled c₋₁. We insert above η a node labelled c′. Then we change the label of η to c′₋₁. We keep the same children of η. This extended run maintains all properties as required.

ρ′₋₁ = c₋₁[ρ₋₁¹, …, ρ₋₁ⁱ] via a rule

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

derived from some rule

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′₁, …, p′_l, rew_γ

⎞
⎠

∈ R .

In this case, we apply the above rule to ρ which means taking the node η labelled c and replacing its “hole” child with l new children. We need to rebuild the rest of the tree the from these nodes. These nodes have configurations ⟨ p′₁, s ⟩, …, ⟨ p′_l, s ⟩. These control states are distributed between x₁, …, x_i and y₁, …, y_j. Consider y₁ (the other y₂, …, y_j are identical). We have from the respective passed test that ⟨ y₁, s ⟩ has a run where the first popping of the top_n stack leads to configurations ⟨ y₁¹, s ⟩, …, ⟨ y_i₁¹, s ⟩. We insert this run underneath the node corresponding to the y₁. Since y₁¹, …, y_i₁¹ appear amongst p₁, …, p_m we append the subtrees that appeared as the relevant children of the node labelled c₋₁ to complete these branches. The remaining subtrees corresponding to p₁, …, p_m are distributed amongst x₁¹, …, x_j₁¹, …, x₁ⁱ, …, x_{j_i}ⁱ. Consider x₁ (the others are identical arguments), we have a new child labelled by ⟨ (x₁, x₁¹, …, x_j₁¹, X₁, B₁), top_n(s) ⟩. We take the subrees distributed to x₁¹, …, x_j₁¹ as children of this new child to satisfy the requirements.

The new frontier replaces c₋₁ with

⟨

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩,

…,

⟨

⎛
⎝

x_j, x₁ⁱ, …, x_{j_i}ⁱ, X_j, B_j

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩

which satisfies all properties as needed.

ρ′₋₁ = c₋₁[ρ₋₁¹, …, ρ₋₁ⁱ] via a rule

⎛
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎝

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎠

derived from some rule

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′, push_n

⎞
⎠

In this case, we apply the above rule to ρ. This means replacing the node labelled c₋₁ with one labelled ⟨ p′, push_n(s) ⟩. This new node has a new child node with the label

⟨

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩ .

We need to add l children to this new “hole” node.

These nodes have configurations ⟨ p′₁, s ⟩, …, ⟨ p′_l, s ⟩ (since s = pop_n(push_n(s))). These control states are distributed between x₁, …, x_i and y₁, …, y_j. Consider y₁ (the other y₂, …, y_j are identical). We have from the passed test that ⟨ y₁, s ⟩ has a run where the first popping of the top_n stack leads to configurations ⟨ y₁¹, pop_n(s) ⟩, …, ⟨ y_i₁¹, pop_n(s) ⟩. We append this run tree as a child of the node corresponding to y₁. Since y₁¹, …, y_i₁¹ appear amongst p₁, …, p_m we append the relevant subtrees we had already constructed for these nodes to complete these branches with the required properties.

Now consider x₁ (the other cases are symmetric). In this case we append a node labelled ⟨ (x₁, x₁¹, …, x_j₁¹, X₁, B₁), top_n(s) ⟩ as a child of the node corresponding to x₁. Since x₁¹, …, x_j₁¹ appear amongst p₁, …, p_m we append the relevant subtrees we had already constructed for these nodes to complete these branches with the required properties.

The new frontier replaces c₋₁ with

⟨

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩

and

⟨

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩,

…,

⟨

⎛
⎝

x_j, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩

which satisfies all the required properties.

ρ′₋₁ = c₋₁[ρ₋₁¹, …, ρ₋₁ⁱ] via a rule

⎛
⎜
⎜
⎜
⎝

⎛
⎝

p, p₁, …, p_m, O, B

⎞
⎠

, γ,

A_{p′,p′₁,…,p′_l}^Y ⋂

A_{y₁,y₁¹,…,y_i₁¹}^Y₁

⋂ ⋯ ⋂

A_{y_j,y₁^j,…,y_{i_j}^j}^Y_j

⎞
⎟
⎟
⎟
⎠

—^O′ ⋂ B→

⎛
⎜
⎜
⎜
⎜
⎝

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

…,

⎛
⎝

x_i, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

, rew_γ

⎞
⎟
⎟
⎟
⎟
⎠

derived from some rule

⎛
⎝

p, γ

⎞
⎠

—^ε→

⎛
⎝

p′, push_n

⎞
⎠

In this case, we again apply the above rule to ρ. This means replacing the node labelled c₋₁ with one labelled ⟨ p′, push_n(s) ⟩. Since we know the test A_{p′,p′₁,…,p′_l}^y passed we have a run popping the newly pushed stack to controls p′₁, …, p′_l. We set this run tree as the only child of the node whose label we replaced. This new tree has l leaves which we need to complete.

These leaf nodes are completed using the same argument as the previous case. That is, they are labelled with configurations ⟨ p′₁, s ⟩, …, ⟨ p′_l, s ⟩. These control states are distributed between x₁, …, x_i and y₁, …, y_j. Consider y₁ (the other y₂, …, y_j are identical). We have from the passed test that ⟨ y₁, s ⟩ has a run where the first popping of the top_n stack leads to configurations ⟨ y₁¹, pop_n(s) ⟩, …, ⟨ y_i₁¹, pop_n(s) ⟩. We append this run tree as a child of the node corresponding to y₁. Since y₁¹, …, y_i₁¹ appear amongst p₁, …, p_m we append the relevant subtrees we had already constructed for these nodes to complete these branches with the required properties.

The new frontier replaces c₋₁ with

⟨

⎛
⎝

p′, p′₁, …, p′_l, X, B₀

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩

and

⟨

⎛
⎝

x₁, x₁¹, …, x_j₁¹, X₁, B₁

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩,

…,

⟨

⎛
⎝

x_j, x₁ⁱ, …, x_{j_i}ⁱ, X_i, B_i

⎞
⎠

top_n

⎛
⎝

⎞
⎠

⟩

which satisfies all the required properties.

ρ′₋₁ = c₋₁[⟨ f, s ⟩].
In this case c has the form
⟨
⎛
⎝ p, p′, ∅, B ⎞
⎠

,
top_n ⎛
⎝ s ⎞
⎠

⟩

and there is a rule
⎛
⎝ p, γ ⎞
⎠ —^ε→ ⎛
⎝ p′, pop_n ⎞
⎠ .

We can remove the hole from ρ by applying this rule. That is, we remove the hole node, setting its parent to have its (only) child as its child. This is possible since by our conditions the child has the label ⟨ p′, pop_n(s) ⟩. We remove c₋₁ from the frontier.

Thus, the frontier moves towards the leaves of the tree and finally is empty. At this point we have an accepting run of P as required. To see that the run outputs enough of each character, one needs to observe that at each stage the tests and O component of the control state ensured at least one character output for each that appeared in some O′ labelling a transition. Then, for characters output along branches followed were reproduced faithfully.

8 Conclusions

We have shown, using a recent result by Zetzsche, that the downward closures of languages defined by HOPDA are computable. We believe this to be a useful foundational result upon which new analyses may be based. Our result already has several immediate consequences, including separation by piecewise testability and asynchronous parameterised systems.

Regarding the complexity of the approach. We are unaware of any complexity bounds implied by Zetzsche’s techniques. Due to the complexity of the reachability problem for HOPDA, the test automata may be a tower of exponentials of height n for HOPDA of order n. These test automata are built into the system before proceeding to reduce to order (n−1). Thus, we may reach a tower of exponentials of height O(n²).

A natural next step is to consider collapsible pushdown systems, which are equivalent to recursion schemes (without the safety constraint). However, it is not currently clear how to generalise our techniques due to the non-local behaviour introduced by collapse. We may also try to adapt our techniques to a higher-order version of BS-automata [3], which may be used, e.g., to check boundedness of resource usage for higher-order programs.

Acknowledgements

We thank Georg Zetzsche for keeping us up to date with his work, Jason Crampton for knowing about logarithms when they were most required, and Chris Broadbent for discussions. This work was supported by the Engineering and Physical Sciences Research Council [EP/K009907/1 and EP/M023974/1].

References

[1]: K. Aehlig, J. G. de Miranda, and C.-H. L. Ong. Safety is not a restriction at level 2 for string languages. In Foundations of Software Science and Computational Structures, 8th International Conference, FOSSACS 2005, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2005, Edinburgh, UK, April 4-8, 2005, Proceedings, pages 490–504, 2005.
[2]: A. V. Aho. Indexed grammars - an extension of context-free grammars. J. ACM, 15(4):647–671, 1968.
[3]: Mikolaj Bojanczyk. Beyond omega-regular languages. In 27th International Symposium on Theoretical Aspects of Computer Science, STACS 2010, March 4-6, 2010, Nancy, France, pages 11–16, 2010.
[4]: Ahmed Bouajjani, Markus Müller-Olm, and Tayssir Touili. Regular symbolic analysis of dynamic networks of pushdown systems. In CONCUR, pages 473–487, 2005.
[5]: C. H. Broadbent, A. Carayol, M. Hague, and O. Serre. C-shore: a collapsible approach to higher-order verification. In ICFP, pages 13–24, 2013.
[6]: C. H. Broadbent and N. Kobayashi. Saturation-based model checking of higher-order recursion schemes. In CSL, pages 129–148, 2013.
[7]: Christopher H. Broadbent, Arnaud Carayol, Matthew Hague, and Olivier Serre. A saturation method for collapsible pushdown systems. In Automata, Languages, and Programming - 39th International Colloquium, ICALP 2012, Warwick, UK, July 9-13, 2012, Proceedings, Part II, pages 165–176, 2012.
[8]: Christopher H. Broadbent, Arnaud Carayol, C.-H. Luke Ong, and Olivier Serre. Recursion schemes and logical reflection. In Proceedings of the 25th Annual IEEE Symposium on Logic in Computer Science, LICS 2010, 11-14 July 2010, Edinburgh, United Kingdom, pages 120–129, 2010.
[9]: B. Courcelle. On constructing obstruction sets of words. Bulletin of the EATCS, 44:178–186, 1991.
[10]: A. Cyriac, P. Gastin, and K. N. Kumar. MSO decidability of multi-pushdown systems via split-width. In CONCUR, pages 547–561, 2012.
[11]: W. Czerwiński and W. Martens. A note on decidable separability by piecewise testable languages. CoRR, abs/1410.1042, 2014.
[12]: J. Esparza, A. Kucera, and S. Schwoon. Model checking LTL with regular valuations for pushdown systems. Inf. Comput., 186(2):355–376, 2003.
[13]: Javier Esparza and Pierre Ganty. Complexity of pattern-based verification for multithreaded programs. In POPL, pages 499–510, 2011.
[14]: Javier Esparza and Andreas Podelski. Efficient algorithms for pre* and post* on interprocedural parallel flow graphs. In POPL, pages 1–11, 2000.
[15]: M. Hague. Saturation of concurrent collapsible pushdown systems. In FSTTCS, pages 313–325, 2013.
[16]: M. Hague. Senescent ground tree rewrite systems. In Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), CSL-LICS ’14, Vienna, Austria, July 14 - 18, 2014, pages 48:1–48:10, 2014.
[17]: M. Hague and A. W. Lin. Synchronisation- and reversal-bounded analysis of multithreaded programs with counters. In Computer Aided Verification - 24th International Conference, CAV 2012, Berkeley, CA, USA, July 7-13, 2012 Proceedings, pages 260–276, 2012.
[18]: M. Hague, A. S. Murawski, C.-H. Luke Ong, and O. Serre. Collapsible pushdown automata and recursion schemes. In LICS, pages 452–461, 2008.
[19]: L.H. Haines. On free monoids partially ordered by embedding. J. Combinatorial Theory, 6:94–98, 1969.
[20]: Vineet Kahlon. Boundedness vs. unboundedness of lock chains: Characterizing decidability of pairwise CFL-reachability for threads communicating via locks. In LICS, pages 27–36, 2009.
[21]: T. Knapik, D. Niwinski, and P. Urzyczyn. Higher-order pushdown trees are easy. In FoSSaCS ’02: Proceedings of the 5th International Conference on Foundations of Software Science and Computation Structures, pages 205–222, London, UK, 2002. Springer-Verlag.
[22]: T. Knapik, D. Niwinski, P. Urzyczyn, and I. Walukiewicz. Unsafe grammars and panic automata. In ICALP, pages 1450–1461, 2005.
[23]: N. Kobayashi. Model-checking higher-order functions. In PPDP, pages 25–36, 2009.
[24]: N. Kobayashi. GTRecS2: A model checker for recursion schemes based on games and types. A tool available at http://www-kb.is.s.u-tokyo.ac.jp/~koba/gtrecs2/, 2012.
[25]: N. Kobayashi and A. Igarashi. Model-checking higher-order programs with recursive types. In ESOP, pages 431–450, 2013.
[26]: N. Kobayashi, R. Sato, and H. Unno. Predicate abstraction and cegar for higher-order model checking. In PLDI, pages 222–233, 2011.
[27]: Akash Lal and Thomas W. Reps. Reducing concurrent analysis under a context bound to sequential analysis. Formal Methods in System Design, 35(1):73–97, 2009.
[28]: P. Madhusudan and G. Parlato. The tree width of auxiliary storage. In POPL, pages 283–294, 2011.
[29]: A. N. Maslov. Multilevel stack automata. Problems of Information Transmission, 15:1170–1174, 1976.
[30]: R. P. Neatherway, S. J. Ramsay, and C.-H. L. Ong. A traversal-based algorithm for higher-order model checking. In ICFP, pages 353–364, 2012.
[31]: P. Parys. On the significance of the collapse operation. In Proceedings of the 27th Annual IEEE Symposium on Logic in Computer Science, LICS 2012, Dubrovnik, Croatia, June 25-28, 2012, pages 521–530, 2012.
[32]: V. Penelle. Rewriting higher-order stack trees. In Computer Science - Theory and Applications - 10th International Computer Science Symposium in Russia, CSR 2015, Listvyanka, Russia, July 13-17, 2015, Proceedings, pages 364–397, 2015.
[33]: G. Ramalingam. Context-sensitive synchronization-sensitive analysis is undecidable. ACM Trans. Program. Lang. Syst., 22(2):416–430, 2000.
[34]: S. J. Ramsay, R. P. Neatherway, and C.-H. L. Ong. A type-directed abstraction refinement approach to higher-order model checking. In The 41st Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL ’14, San Diego, CA, USA, January 20-21, 2014, pages 61–72, 2014.
[35]: A. Seth. Games on higher order multi-stack pushdown systems. In RP, pages 203–216, 2009.
[36]: S. La Torre, A. Muscholl, and I. Walukiewicz. Safety of parametrized asynchronous shared-memory systems is almost always decidable. In CONCUR, 2015. To appear.
[37]: Salvatore La Torre and Margherita Napoli. Reachability of multistack pushdown systems with scope-bounded matching relations. In CONCUR, pages 203–218, 2011.
[38]: H. Unno, N. Tabuchi, and N. Kobayashi. Verification of tree-processing programs via higher-order model checking. In APLAS, 2010.
[39]: J. van Leeuwen. Effective constructions in well-partially-ordered free monoids. Discrete Mathematics, 21(3):237–252, 1978.
[40]: Georg Zetzsche. An approach to computing downward closures. In Automata, Languages, and Programming - 42nd International Colloquium, ICALP 2015, Kyoto, Japan, July 6-10, 2015, Proceedings, Part II, pages 440–451, 2015.

1: We slightly alter the alternation rule from ICALP 2012 [7] by matching the top stack character as well as the control state. This is a benign alteration since it one can track the top of stack character in the control state.

This document was translated from L^AT_EX by H^EV^EA.

Unboundedness and Downward Closures of Higher-Order Pushdown Automata

Matthew Hague Royal Holloway, University of London Jonathan Kochems Department of Computer Science, Oxford University C.-H. Luke Ong Department of Computer Science, Oxford University

1 Introduction

2 Preliminaries

2.1 Downward Closures

2.2 Trees

2.3 HOPDA

2.3.1 Regular Sets of Stacks

3 The Single Character Case

3.1 Outline of Proof

3.2 HOPDA with Tests

3.2.1 Marking Outputs

3.2.2 Building the Automata

3.3 Reduction to Lower Orders

4 Correctness of Reduction

4.1 Tree Decomposition of Runs

4.2 Scoring Trees

4.3 From Branches to Runs

4.4 The Other Direction

5 Multiple Characters

5.1 Branching HOPDA

6 Reduction For Simultaneous Unboundedness

6.1 Some Intuition

6.2 Branching HOPDA with Regular Tests

6.3 Building The Automata

6.3.1 Alternating HOPDA

6.3.2 Constructing the Tests

6.4 Reduction to Lower Orders

6.5 Decidability at Order-0

6.6 Decidability of The Diagonal Problem

7 Correctness for Simultaneous Unboundedness

7.1 Tree Decomposition of Output Trees

7.2 Scoring Trees

7.3 From Branches to Runs

7.4 The Other Direction

8 Conclusions

Acknowledgements

References

Matthew Hague
Royal Holloway, University of London

Jonathan Kochems
Department of Computer Science, Oxford University

C.-H. Luke Ong
Department of Computer Science, Oxford University