MinIR rewriting, operationally

3.5. MinIR rewriting, operationally

The previous section proposed to view minIR rewrites as the result of a (DPO or SqPO) graph transformation. This yields valid rewriting semantics elegantly (and with little effort!). However, the conditions that must be imposed on the transformation to be valid, along with the fact that pushouts may not exist mean that the existence of a rewrite given a transformation rule and a match is not guaranteed.

In this section, we address this by considering a more restricted notion of minIR rewriting, for which the existence of the right-hand side of the rewrite is guaranteed. In addition, in place of the categorical presentation of the last section, we express the rewriting operation operationally, i.e. as data and a procedure on sets that translates directly into an algorithmic implementation.

We find that this rewrite definition is sufficient in practice. We conclude the section with an example of how more complex rewrites can be achieved by composition of simpler rewrites that can be expressed in this framework.

Graph glueings and rewrites #

Throughout, we consider graph glueings on disjoint vertex and (hyper)edge sets. To underline this, we will use the $\sqcup$ symbol to denote disjoint set unions.
As we will be working exclusively with vertex and edge sets in this section (as opposed to the objects in the indexing category), we will drop the bold typeface for sets, writing e.g. $V$ instead of $\mathbf V$ for the set of vertices of a hypergraph.
Finally, all minIR graphs in this section are IO free.

We define local graph rewrites in terms of graph glueings. Consider first the case of two arbitrary graphs $G_1 = (V_1, E_2)$ and $G_2 = (V_2, E_2)$ , along with a relation $\mu\ \subseteq V_1 \times V_2$ . Let $\sim_\mu \ \subseteq (V_1 \sqcup V_2)^2$ be the equivalence relation induced by $\mu$ , i.e. the smallest relation on $V_1 \sqcup V_2$ that is reflexive, symmetric and transitive, and satisifes for all $v_1 \in V_1$ and $v_2 \in V_2$ ,

(v_1, v_2) \in \mu \Rightarrow v_1 \sim_\mu v_2.

Then, we can define

$V = (V_1 \sqcup V_2)/\sim_\mu$ is the set of all equivalence classes of $\sim_\mu$ , and
for $v \in V_1 \sqcup V_2$ , $\alpha_\mu(v) \in V$ is the equivalence class of $\sim_\mu$ that $v$ belongs to.

Definition 3.8Graph glueing

The glueing of $G_1$ and $G_2$ according to the glueing relation $\mu$ is given by the vertices $V = (V_1 \sqcup V_2)/\sim_\mu$ and the edges

E = \{(\alpha_\mu(u), \alpha_\mu(v)) \mid (u,v) \in E_1 \sqcup E_2 \} \subseteq V^2.

We write the glueing graph as $(G_1 \sqcup G_2) / \sim_\mu$ .

In other words, the glueing is the disjoint union of the two graphs, with identification (and merging) of vertices that are related in $\mu$ .

This allows us to define a rewrite on a graph $G$ :

Definition 3.9Graph rewrite

A rewrite $r$ on a graph $G = (V, E)$ is given by a tuple $r = (G_R, V^-, E^-, \mu)$ , with

$G_R = (V_R, E_R)$ is a graph called the replacement graph,
$V^- \subseteq V$ is the vertex deletion set,
$E^- \subseteq E \cap dom(\mu)^2$ is the edge deletion set, and
$\mu: V^- \rightharpoonup V_R$ is the glueing relation, a partial function that maps a subset of the deleted vertices of $G$ to vertices in the replacement graph.

The domain of definition $dom(\mu)$ is known as the boundary values of $r$ .

A graph rewrite per this definition can always be generated by a single pushout (SPO) transformation Löwe, 1991Michael Löwe. 1991. Extended algebraic graph transformation. Retrieved from http://d-nb.info/910935696.

define $L$ as the graph $(V^-, E^-)$ . Then the injection $L \subseteq G$ is the match morphism $L \to G$ ;
the partial map $\mu$ maps a subset of $V^-$ to vertices in the replacement $R = G_R$ . By injectivity of the match morphism, it also defines a partial map $L \rightharpoonup R$ .

We opted for SPO-like semantics in this definition, as they are the simplest to write in set-theoretic terms and coincide with DPO and SqPO in our restricted domain of interest.

The result of the rewrite is computed by gluing the right-hand side $G_R$ to the context subgraph $G_C = (V_C, E_C)$ of $G$ given by

\begin{aligned}V_C &= (V \smallsetminus V^-) \ \cup\ dom(\mu)\\E_C &= (E \smallsetminus E^-)\ \cap\ V_C^2.\end{aligned}

The partial function $\mu$ is a special case of a glueing relation $\mu \subseteq V_C \times V_R$ , and thus defines a glueing of $G_C$ with $G_R$ . The rewritten graph resulting from applying $r$ to $G$ is $r(G) = (G_C \sqcup G_R) / \sim_\mu.$

An example of a graph rewrite is given in the next figure. This is equivalent to an SPO transformation with the graph induced by $V^-$ on the left-hand side, the graph $G_R$ on the right-hand side and the partial map $L \hookrightarrow R$ given by $\mu$ .

Application of a graph rewrite. On the left, the original graph GGG along with the replacement graph GRG_RGR (grey box). On the right, the rewritten graph r(G)r(G)r(G). Only the vertex ggg has been deleted, as other vertices in V−V^-V− are in the boundary dom(μ)dom(\mu)dom(μ) (in orange). The (singleton) edge deletion set is red. The blue edge connects a vertex of V∖V−V \smallsetminus V^-V∖V− to a boundary vertex, and is thus also present on the right-hand side. The purple edge, on the other hand, connects a vertex of V∖V−V \smallsetminus V^-V∖V− to a non-boundary vertex of V−V^-V−, and is thus deleted. — Application of a graph rewrite. On the left, the original graph $G$ along with the replacement graph $G_R$ (grey box). On the right, the rewritten graph $r(G)$ . Only the vertex $g$ has been deleted, as other vertices in $V^-$ are in the boundary $dom(\mu)$ (in orange). The (singleton) edge deletion set is red. The blue edge connects a vertex of $V \smallsetminus V^-$ to a boundary vertex, and is thus also present on the right-hand side. The purple edge, on the other hand, connects a vertex of $V \smallsetminus V^-$ to a non-boundary vertex of $V^-$ , and is thus deleted.

When there are no edges between $V \smallsetminus V^-$ and $V^- \smallsetminus dom(\mu)$ (purple in the example above), this definition corresponds to graph rewrites that can be produced using DPO transformations (see discussion in section 3.4). Otherwise, such edges are deleted.

The notions of graph glueing and graph rewrite can straightforwardly be lifted to hypergraphs and, by extension, to minIR graphs. Notice that in this case, values are glued together, not operations (the former were defined as the graph’s vertices, the latter as its hyperedges).

However, the glueing of two valid minIR graphs – and the result of applying a valid rewrite – may not be a valid minIR graph. Glueing two values of a linear type, for instance, is a sure way to introduce multiple uses (or definitions) of it. Thus, we must be careful to only consider glueings and rewrites of minIR graphs that preserve all the constraints we have imposed in Definition 3.4.

Ensuring rewrite validity: interfaces #

As a sufficient condition for valid minIR rewrites, we introduce minIR interfaces, a concept closely related to the “hypergraph with interfaces” construction of Bonchi, 2017Filippo Bonchi, Fabio Gadducci, Aleks Kissinger, Paweł Sobociński and Fabio Zanasi. 2017. Confluence of Graph Rewriting with Interfaces or the supermaps of quantum causality Hefford, 2024James Hefford and Matt Wilson. 2024. A Profunctorial Semantics for Quantum Supermaps. In Proceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science, July 2024. ACM, 1--15. doi: 10.1145/3661814.3662123. We eschew the presentation of holes as a slice category in favour of a definition that fits naturally within minIR and is sufficient for our purposes.

Let $G$ be a $\Sigma$ -typed minIR graph with data types $T$ and linear types $T_L \subseteq T$ . Consider type strings $S, S' \in T^\ast$ . We define the index sets

\begin{aligned}\mathrm{Idx}(S) &= \{i \in \mathbb{N} \mid 1 \leq i \leq |S|\}\\\mathrm{Idx}_L(S) &= \{i \in \mathrm{Idx}(S) \mid S_i \in T_L\} \subseteq \mathrm{Idx}(S)\end{aligned}

corresponding respectively to the set of all indices into $S$ and the subset of indices of linear types. For any $i \in \mathrm{Idx}(S)$ , we denote by $S_i$ the type at position i in $S$ .

We define a partial order $\preccurlyeq$ ¹ on $T^\ast$ where $S \preccurlyeq S'$ and say that $S'$ can be coerced into $S$ if there exists an index map $\rho: \mathrm{Idx}(S) \to \mathrm{Idx}(S')$ such that

types are preserved: $S_i = S'_{\rho(i)}$ , and
$\rho$ is well-defined and bijective on the restriction to indices of linear types
$\left.\rho\right|_{\mathrm{Idx}_L(S)}: \mathrm{Idx}_L(S) \to \mathrm{Idx}_L(S').$

Definition 3.10Interface

Let $T$ be a set of data types. An interface $I = (U, D)$ is a pair of type strings $U, D \in T^\ast$ .

We say that an interface $I' = (U', D')$ can be coerced into an interface $I = (U, D)$ , written $I \triangleleft I'$ , if $U \succcurlyeq U'$ and $D \preccurlyeq D'$ .

We can define the interface associated with an operation $o$ in a minIR graph $G$ by considering the values used and defined by $o$ . Calling $\tau$ the type morphism on $G$ and assuming $o \in E_{st}$ to be an operation in $G$ with $s$ inputs and $t$ outputs, we define the interface of $o$ in $G$ as the pair of strings in $T^\ast$

I(o) = (\tau(\textit{src}_1(o))\cdots\tau(\textit{src}_s(o)), \tau(\mathit{tgt}_1(o))\cdots\tau(\mathit{tgt}_t(o))).

Similarly, we can assign interfaces to subgraphs of minIR graphs:

Definition 3.11MinIR subgraph

Consider a subset of values and operations $V_H \subseteq V$ and $E_H \subseteq E$ . Define the use and define boundary sets

\begin{aligned} B_U &= \{v \in V_H \mid \mathit{def}\,(v) \in E \smallsetminus E_H \},\\B_D &= \{v \in V_H \mid use(v) \in E \smallsetminus E_H \}.\end{aligned}

The tuple $H = (V_H, E_H)$ of $G$ is called a minIR subgraph of $G$ if there exists a region $R$ of $G$ such that all boundary values of $H$ are in $R$ :

B= B_U \cup B_D \subseteq R.

We write $H \subseteq G$ to indicate that $H$ is a minIR subgraph of $G$ .

Note that $B_U$ is exactly the set of inputs $I$ in the non-IO free minIR graph given by the subgraph $(V_H, E_H)$ of the minIR graph. $B_D$ is a superset of the outputs $O$ of $H$ : it includes all linear values in $H$ that do not have a use in $H$ , but also any non-linear value that has a use outside of $H$ .

Unlike interfaces, subgraph boundary values are not ordered. An ordering of $B \subseteq V$ is a string $S \in V^\ast$ along with a bijective map

\mathrm{ord}: B \to \mathrm{Idx}(S) \quad\textrm{such that}\quad v = S_{\mathrm{ord}(v)}.

If there are strings $S_U, S_D \in V^\ast$ and orderings of $B_U$ and $B_D$

\begin{aligned}\textrm{ord}_U:\ &B_U \to \mathrm{Idx}(S_U)&\quad\textrm{ord}_D:\ &B_D \to \mathrm{Idx}(S_D),\end{aligned}

then we can set $\textit{src}_i\,(H) = (S_U)_i$ and $\textit{tgt}_i\,(H) = (S_D)_i$ in complete analogy to operations. We will write $\textit{src}(H)$ and $\textit{tgt}(H)$ for the strings $\textit{src}_1(H)\cdots\textit{src}_{|S_U|}(H)$ and $\textit{tgt}_1(H)\cdots\textit{tgt}_{|S_D|}(H)$ respectively. We say that the subgraph $H$ implements the interface

I_H = (\tau(\textit{src}(H)), \tau(\mathit{tgt}(H)),

where the type morphism $\tau$ was extended element-wise to strings $V^\ast$ .

Remark, though, that unlike operations, the same subgraph may implement more than one interface as a result of various choices of orderings $\textrm{ord}_U$ and $\textrm{ord}_D$ .

As mentioned, the subgraph $H$ forms a non-IO free minIR graph. We can always construct an IO-free minIR graph from $H$ by adding two operations $o_{in}$ and $o_{out}$ in the root region respectively in $E_{0, |S_U|}$ and $E_{|S_D|, 0}$ inputs-outputs, defined by

\textit{tgt}_i\,(o_{in}) = \textit{src}_i(H),\quad\quad \textit{src}_i(o_{out}) = \textit{tgt}_i\,(H).

We call the resulting graph $\bar{H}$ an interface graph. It implements the interface $I_H$ if $H$ implements $I_H$ . Calling to mind the illustrations of section 3.3, $\bar{H}$ looks like one of the nested regions within regiondef operations that we were considering.

MinIR operation rewrite #

Consider

an operation $o$ in a minIR graph $G$ with values $V,$
an interface graph $\bar{H}$ with values $V_H$ and its associated subgraph $H \subseteq \bar{H}$ , such that $H$ implements an interface $I(o) \triangleleft I_H,$
the index maps $\rho: \mathrm{Idx}(\textit{src}(H)) \to \mathrm{Idx}(\textit{src}(o))$ and $\sigma: \mathrm{Idx}(\textit{tgt}\,(o)) \to \mathrm{Idx}(\textit{tgt}\,(H))$ that define the generalisation $I(o) \triangleleft I_H$ (per Definition 3.10).

We can define a glueing relation $\mu_o \subseteq V \times V_H$

\begin{aligned}\mu_o =\ & \{ \left(\textit{src}_{\rho(i)}(o), \textit{src}_{i}(H)\right) \mid i \in \mathrm{Idx}(\textit{src}(H)) \}\ \cup \\& \{ \left(\mathit{tgt}_{i}\,(o), \mathit{tgt}_{\sigma(i)}(H)\right) \mid i \in \mathrm{Idx}(\textit{tgt}\,(o)) \}.\end{aligned}

This is almost enough to define a rewrite that replaces the operation $o$ in $G$ with the values and operations of $H$ – the interface compatibility constraint $I(o) \triangleleft I_H$ that we have imposed ensures that the resulting minIR graph is valid. Unfortunately, $\mu_o$ is not a partial function as required by Definition 3.4.

This is resolved in the following proposition:

Proposition 3.5MinIR operation rewrite

Let $G$ , $o$ and $H$ such that $I(o) \triangleleft I_H$ , as defined above. Then

\big((G \sqcup H) / \sim_{\mu_o}\!\big) \smallsetminus \{o\},

i.e. the graph obtained by removing the operation $o$ from the glueing of $G$ and $H$ along $\mu_o$ , is a valid minIR graph.

There is a graph $G_R$ with values $V_R$ and a partial function $\mu_o': V \rightharpoonup V_R$ such that the graph (5) is the graph $r_o(G)$ , obtained from the rewrite

r_o = (G_R, dom(\mu_o), \{o\}, \mu_o').

We call $r_o$ the rewrite of $o$ into $H$ .

The definition of the rewrite of $o$ into a graph $H$ behaves as one would expect – the only subtleties relate to handling non-linear (i.e. copyable) values at the boundary of the rewrite. The following example illustrates some of these edge cases.

Rewriting operation ooo in the graph GGG (top left) into the operations o1o_1o1 and o2o_2o2 of the graph Hˉ\bar{H}Hˉ (bottom left). Coloured dots indicate the index maps ρ\rhoρ and σ\sigmaσ from inputs BUB_UBU of Hˉ\bar{H}Hˉ to inputs of ooo, respectively from outputs of ooo to outputs BDB_DBD of Hˉ\bar{H}Hˉ. — Rewriting operation $o$ in the graph $G$ (top left) into the operations $o_1$ and $o_2$ of the graph $\bar{H}$ (bottom left). Coloured dots indicate the index maps $\rho$ and $\sigma$ from inputs $B_U$ of $\bar{H}$ to inputs of $o$ , respectively from outputs of $o$ to outputs $B_D$ of $\bar{H}$ .

When the index maps $\rho$ and $\sigma$ are not injective (yellow and green dots), values are merged, resulting in multiple uses of the value (i.e. copies). This is why the index maps must be injective on linear values (dots in shades of blue). Value merging also happens when a value is used multiple times in $o$ (yellow and red dots). This will never happen with linear values (as they can never have more than one use in $o$ ), nor with any value definitions (the same value can never be defined more than once). Finally, values not in the image of $\rho$ or $\sigma$ (purple dot) are discarded. This case is also excluded for linear values by requiring surjectivity.

Proof

We start this proof with the explicit construction of $G_R$ and $\mu_o'$ . Define $\sim_R \subseteq (V_H)^2$ as the smallest equivalence relation such that

\textit{src}_{\rho(i)}(o) = \textit{src}_{\rho(j)}(o) \Rightarrow \textit{tgt}_i\,(o_{in}) \sim_R \textit{tgt}_j\,(o_{in}).

Then we define $\bar{G}_R = \bar{H} / \sim_R$ , the graph obtained by glueing together values within the same equivalence class of $\sim_R$ .

Claim 1: $\bar{G}_R$ is a valid minIR graph.

Claim 1 follows from the observation that only values of non-linear types are glued together. If $v \sim_R v'$ , then either $v = v'$ or there exist $i \neq j$ such that

\textit{tgt}_i\,(o_{in}) \sim_R \textit{tgt}_j\,(o_{in}).

\rho(i) = \rho(j)

, then

\rho

is not injective on

i

and

j

, and by the definition of

\rho

\tau(v)\notin T_L

and

\tau(v') \notin T_L

. Otherwise, there are

i' = \rho(i) \neq \textit{src}_{\rho(j)}(o) = j'

such that

\textit{src}_{i'}(o) = \textit{src}_{j'}(o)

. The same value is used twice, which is only a valid minIR graph if

v

and

v'

are not linear, thus proving Claim 1.

Define $G_R$ as the subgraph obtained from $\bar{G}_R$ by removing the operations $\{o_{in}, o_{out}\}$ . Let $V_R = V_H / \sim_R$ be the set of values of $G_R$ (and of $\bar{G}_R$ ). Writing $\alpha_R(v) \in V_R$ for the equivalence class of $\sim_R$ that $v \in V_H$ belongs to, we can define $\mu_o' \in V \times V_R$ as:

(v, w) \in \mu_o \Leftrightarrow (v, \alpha_R(w)) \in \mu_o'.

Claim 2: $\mu_o'$ is a partial function $V \rightharpoonup V_R$ .

In other words, for all $(v, \alpha_1), (v, \alpha_2) \in \mu_o'$ , then $\alpha_1 = \alpha_2$ . Let $w_1 \in \alpha_1$ and $w_2 \in \alpha_2$ be values in $V_H$ . First of all, $\textit{src}_i(o) \neq \textit{tgt}_j\,(o)$ for all $i, j$ , otherwise $G$ is not acyclic. So either $use(v) = o$ , or $\textit{def}(v) = o$ , but not both.

The simpler case: if $\textit{def}\,(v) = o$ , then there exists $i$ such that $\textit{tgt}\,(o)_i = v$ . Furthermore $i$ is unique because by minIR definition, $v$ has a unique definition in $G$ . It follows from (4) that $w_1 = \textit{src}_{\rho(i)}(o_{out}) = w_2$ and hence $\alpha_1 = \alpha_2$ .

Otherwise, there exists $i$ and $j$ such that $v = \textit{src}_{\rho(i)}(o) = \textit{src}_{\rho(j)}(o)$ and $\textit{tgt}_i\,(o_{in}) = w_1$ as well as $\textit{tgt}_j\,(o_{in}) = w_2$ . By definition of $\sim_R$ , we have $w \sim_R w'$ , and thus

\alpha_1 = \alpha_R(w_1) = \alpha_R(w_2) = \alpha_2,

proving Claim 2.

Claim 3: $r_o(G)$ is given by $((G \sqcup H) / \sim_{\mu_o}) \smallsetminus \{o\}$ .

It follows directly from our construction of $\sim_R$ and $\mu_o'$ that the equivalence classes of (the smallest equivalence relation closure of) $\mu_o' \circ \alpha_R$ is equal to the equivalence classes of (the smallest equivalence relation closure of) $\mu_o$ . The claim follows by Definition 3.8 and the definition of $r_o$ .

And finally, Claim 4: $r_o(G)$ is a valid minIR graph.

Per Definition 3.4, We must check four properties: (i) every value is defined exactly once, (ii) every linear value is used exactly once, (iii) the graph is acyclic, and (iv) every region has (at most) one parent.

(iii) follows from the fact that $G$ and $H$ are acyclic and a single operation $o$ in $G$ is replaced: any cycle across $G$ and $H$ would also be a cycle in $G$ by replacing the subpath in $H$ with $o$ . (iv) follows from the fact that $o_{in}$ and $o_{out}$ are in the root region of $\bar{H}$ , by definition of interface implementation. (i): removing $o$ from $G$ removes the unique definitions of all values that are targets of $o$ . Each such value $v$ is glued to a unique value $\mathit{src}_i\,(o_{out})$ in $H$ – the new and unique definition of $v$ in $r_o(G).$ (ii) follows from the same argument as in (i), but relying on injectivity of $\rho$ on linear values to establish uniqueness.

Arbitrary minIR rewrites #

We have so far defined rewrites of single operations into graphs $H$ . We can generalise these rewrites to rewrite subgraphs $P \subseteq G$ , provided the minIR subgraphs satisfy some constraints. We require for this a notion of convexity, as discussed in Bonchi, 2022Filippo Bonchi, Fabio Gadducci, Aleks Kissinger, Pawel Sobocinski and Fabio Zanasi. 2022. String diagram rewrite theory II: Rewriting with symmetric monoidal structure. Mathematical Structures in Computer Science 32, 4 (April 2022, 511--541). doi: 10.1017/s0960129522000317.

As usual, let us consider a minIR graph $G$ with values $V$ , linear values $V_L \subseteq V$ , edges $E$ , the incidence maps $\textit{src}_i$ and $\textit{tgt}_j$ as well as their inverses $\textit{use}$ and $\textit{def}$ . Consider further a subgraph of $G$ that we will now call $P = (V_P, E_P) \subseteq G$ , to distinguish from $H$ .

Let us further define the partial $parent$ morphism that maps a value $v \in V$ to the parent of the region of $v$ .

Definition 3.12Convex minIR subgraph

A minIR subgraph $P \subseteq G$ is convex if the following conditions hold:

for all $v_1, v_2 \in V_P$ , any path along $\leadsto$ from $v_1$ to $v_2$ contains only vertices in $V_P$ ,
parent-child relations are contained within the subgraph, i.e.
$v \in V_P \cap dom(\mathit{parent}) \Leftrightarrow \mathit{parent}(v) \in V_P.$

Define the sets of boundary values $B_U, B_D$ and $B = B_U \cup B_D$ , as in (2); then fix the boundary orderings $\textit{src}(P)$ and $\textit{tgt}\,(P)$ as in (3). The subgraph $P$ implements the interface

Consider an interface graph $\bar{H}$ that implements $I_H$ such that $I_P \triangleleft I_H$ . Instead of defining a gluing relation from values of an operation $o$ to values of $H$ , we replace the interface $I(o)$ with $I_P$ . This generalises the definition of $\mu_o$ from (4) to a glueing $\mu\subseteq B \times V_H$ defined as

\begin{aligned}\mu =\ & \{ \left((\textit{src}_{\rho(i)}(P)), \textit{src}_{i}\,(H)\right) \mid i \in \mathrm{Idx}(\textit{src}(H)) \}\ \cup \\& \{ \left((\textit{tgt}_{i}\,(P)), \textit{tgt}_{\sigma(i)}\,(H)\right) \mid i \in \mathrm{Idx}(\textit{tgt}\,(P)) \},\end{aligned}

With the set of boundary operations defined as²

E_B = \left\{o \in E_P \mid \left(\mathit{tgt}\,(o) \cup \textit{src}(o)\right) \subseteq B\right\},

we are able to define minIR rewrites in their most general form.

Proposition 3.6MinIR subgraph rewrite

Let $P \subseteq G$ and $H$ such that $I_P \triangleleft I_H$ and $P$ is convex, as defined above. Then,

\big((G \sqcup H) / \sim_{\mu}\!\big) \smallsetminus (V_P \smallsetminus B, E_B),

i.e. the graph obtained by removing the values $V_P \smallsetminus B$ and operations $E_B$ from the glueing of $G$ and $H$ along $\mu$ , is a valid minIR graph.

There is a graph $G_R$ with values $V_R$ and a partial function $\mu': V_P \rightharpoonup V_R$ such that the graph (8) is the graph $r_P(G)$ , obtained from the rewrite

r_P = (G_R, V_P, E_B, \mu').

We call $r_P$ the rewrite of $P$ into $H$ .

Proof

Consider an operation $o$ that implements $I_P = (U_P, D_P)$ . We can define the interface graph $\bar{H}_o$ given by three operations $o_{in}$ , $o_{out}$ and $o$ . Its associated subgraph $H_o \subseteq \bar{H}_o$ only includes $o$ . Let $\tilde \mu$ be the glueing relation

\begin{aligned}\tilde \mu =\ &\{ (\textit{src}_i(P), \textit{src}_i(o)) \mid i \in \mathrm{Idx}(U_P) \}\ \cup \\& \{ (\textit{tgt}_i\,(P), \textit{tgt}_i\,(o)) \mid i \in \mathrm{Idx}(D_P) \}.\end{aligned}

Consider the rewrite $r = (H_o, V_P \smallsetminus B, E_B, \tilde\mu)$ . If we write $G' = (V', E')$ for the subgraph of $G$ given by

\begin{aligned}V' &= (V \smallsetminus (V_P \smallsetminus B)) \\E' &= (E \smallsetminus E_B) \cap (V')^\ast,\end{aligned}

then according to (1), the graph resulting from applying

r

G

can be expressed as the glueing

G_o = r(G) = (G' \sqcup H_o) / \sim_{\tilde \mu}.

Our claim is that $G_o$ is a valid minIR graph.

The graph (8) is then obtained by applying the rewrite $r_o$ as given by (6) to $G_o$ . Defining the rewrite $r_P$ as the composition of $r$ followed by $r_o$ , the result follows from our claim and Proposition 3.5.

We now prove the claim, by showing the four properties of minIR graphs as per Definition 3.4. Property i) requires showing that every value is defined exactly once. As $G'$ is obtained by removing values and operations from a valid minIR graph $G$ , no value in $V'$ can be defined more than once. A value $v \in V'$ that is not defined in $G'$ must be in the boundary $v \in B$ of $P$ . By the boundary definitions of (2), $v$ cannot be in $B_U$ and thus must be in $B_D$ . It follows by the definition of the glueing $\tilde \mu$ that in $G_o$ , $v$ will be in the definitions of $o$ : $\textit{def}(v) = o$ . The glueing $\tilde \mu$ is bijective between the values of $P$ and $o$ and thus we can conclude that $v$ has a unique definition in $G_o$ .

The same argument applies to property ii). Property iii) follows from the convexity requirement of $P$ . Finally, property iv) (every region has at most one parent) follows from two observations. First, by convexity of $P$ , no deleted value or operation could be the parent of any value not in $P$ , and thus the $parent$ relation is well-defined on $G'$ : $im(parent) \subseteq E'$ . Secondly, all new values and operations added to the boundary region of $G'$ are from the root region of $H$ , and thus do not have a parent, ensuring that parent uniqueness is preserved.

This simple and limited graph transformation framework captures a remarkably large set of minIR program transformations. It may seem at first that the restriction to boundary values within a single region of Definition 3.11, as well as the convexity requirements of Definition 3.12 represent significant limitations on the expressivity of the rewrites. In practice, however, the semantics of minIR operations can be used to decompose more complex rewrites into a sequence of simple rewrites to which Proposition 3.6 applies.

Consider minIR graphs with a type system that includes regiondef and call operations as discussed in examples of the previous section – respectively defining a code block by a nested region and redirecting control flow to a code block defined using a regiondef. Then all constraints that we impose on rewriting can be effectively side-stepped using the region outlining and value hoisting transformations.

Region outlining moves a valid minIR subgraph into its own separate region, and replaces the hole left by the subgraph in the computation by a call operation to the newly outlined region.

Value hoisting moves a value definition within a region to its parent region and passes the value down to the nested region through an additional input. In case of linear values, we can similarly hoist the unique use of the value to the parent region.

Using these transformations, non-convex subgraphs can always be made convex by taking the convex hull and outlining any parts within it that are not part of the subgraph. Outlined regions can then be passed as additional inputs to the subgraph. Step 1 of the figure below illustrates this transformation. Similarly, a subgraph that includes operations without their parent can be extended to cover the entire region and its parent, outlining any parts of the region that are not part of the subgraph.

Finally, whenever a boundary value $v$ belongs to a region that is not the top level region of the subgraph³, we can repeatedley hoist $v$ to its parent region until it is in the top level region. The value is then recursively passed as argument to descendant regions until the region that it is required in. Subgraphs can thus always be transformed to only have input and output boundary values at the top level region. Step 2 of the figure below illustrates this transformation.

A non-convex minIR graph rewrite, obtained by decomposition into valid convex rewrites, using outlining and hoisting. For simplicity, regiondef operations were made implicit and represented by nested boxes: a region within an operation corresponds to a region definition that is passed as an argument to the operation. Edge colours correspond to value types. Step 1 outlines the ... operations into a dedicated region, which step 2 hoists outside of the region being rewritten. Step 3 and 4 together correspond to a minIR sugraph rewrite. They have been split into two steps following the proof strategy. Step 4 is an instance of a minIR operation rewrite. — A non-convex minIR graph rewrite, obtained by decomposition into valid convex rewrites, using outlining and hoisting. For simplicity, `regiondef` operations were made implicit and represented by nested boxes: a region within an operation corresponds to a region definition that is passed as an argument to the operation. Edge colours correspond to value types. Step 1 *outlines* the `...` operations into a dedicated region, which step 2 *hoists* outside of the region being rewritten. Step 3 and 4 together correspond to a minIR sugraph rewrite. They have been split into two steps following the proof strategy. Step 4 is an instance of a minIR operation rewrite.

To be precise, $\preccurlyeq$ is a partial order on the type strings up to isomorphism. ↩︎
The set operations $\subseteq$ and $\cup$ are again understood to apply to the unordered set of elements contained in the lists $\mathit\,{tgt}(o)$ and $\mathit{src}\,(o)$ . ↩︎
We can always extend a subgraph to contain more ancestor regions, until there is indeed a unique top-level region in the subgraph. ↩︎

←Return to chapter