Local k-Folklore GNNs Beyond Strong Nesting: A k-Almost-Strong Nested Ear Decomposition Characterization

Metadata

Total Words: 10,082
Export Date: 2026-01-18 03:43:38
Description: Recent work introduced homomorphism expressivity as a complete, quantitative framework for GNN expressiveness and characterized several practical architectures via nested ear decompositions (NEDs), including a sharp description of Local 2-FGNN through an "almost-strong" NED. However, the higher-order Local k-FGNN case remained open: while Local k-GNN admits a k-order strong-NED characterization and k-FGNN corresponds to bounded treewidth, no simple structural family was known for Local k-FGNN. We resolve this gap. We define a new decomposition notion—k-almost-strong nested ear decomposition—generalizing almost-strong NED from k = 2 to all k ≥ 2. We prove that the Local k-FGNN homomorphism expressivity exists and equals exactly the class of graphs admitting a k-almost-strong NED, yielding tight (if-and-only-if) upper and lower bounds. As corollaries, we establish strict expressiveness separations ℱ_L(k) ⊊ ℱ_LF(k) ⊊ ℱ_F(k) and provide systematic counterexample constructions via generalized (rooted) F"urer gadgets. We also outline practical implications for motif/subgraph counting through spasm closure, highlighting where Local k-FGNN achieves k-FWL-like counting power at substantially lower computational cost.

1. Introduction and Motivation: situate homomorphism expressivity; explain the missing Local k-FGNN characterization; state contributions and strict hierarchy results.
2. Preliminaries: graphs/homomorphisms/spasm; refinement-based GNNs; Local k-GNN vs Local k-FGNN vs k-FGNN; canonical tree decompositions; prior NED variants (strong, almost-strong) at k = 2.
3. Model and Expressivity Notions: formal definition of Local k-FGNN as a CR algorithm; define homomorphism expressivity ℱ_LF(k) and rooted variants; relate to subgraph counting via spasm closure.
4. New Decomposition: k-Almost-Strong NED: define k-order ears and nested intervals; introduce the degeneracy criterion and the relaxed nesting constraint; provide examples separating strong vs almost-strong vs treewidth-k.
5. Upper Bound (Completeness): Local k-FGNN representations are determined by homomorphism counts of k-almost-strong patterns: unfolding construction, bag-isomorphism counts, and linear-algebra factorization.
6. Equivalence Theorem: k-almost-strong NED ⇔ a canonical restricted width-k tree decomposition class 𝒮_LF(k): two-way transformations with tight invariants.
7. Lower Bound (Maximality): for any F not admitting k-almost-strong NED, construct indistinguishable graphs G, H for Local k-FGNN but with hom(F, G) ≠ hom(F, H) using generalized pebble games and F"urer-type gadgets.
8. Strict Separations and Hierarchies: explicit pattern families witnessing ℱ_L(k) ⊊ ℱ_LF(k) ⊊ ℱ_F(k); optional node/edge-level corollaries.
9. Algorithmic Consequences: membership testing for k-almost-strong NED for fixed k (FPT/linear-time given a width-k decomposition); implications for subgraph counting via spasm inclusion.
10. Experiments (optional but strengthening): motif counting tasks for k = 3, 4 comparing Local k-GNN, Local k-FGNN, and k-FGNN; demonstrate predicted separations and near-equalities on moderate motifs.
11. Discussion and Open Problems: learned/sparse attention analogues; approximate expressivity; size of minimal separating motifs; connections to guarded logics and descriptive complexity.

Content

1. Introduction and Motivation: situate homomorphism expressivity; explain the missing Local k-FGNN characterization; state contributions and strict hierarchy results.

A central approach to formalizing the representational power of message-passing and refinement-based graph neural networks is to ask which graph parameters are determined by their stable representations. In the refinement setting, the most robust such parameters are homomorphism counts: for a fixed pattern graph F, the quantity hom(F, G) is invariant under isomorphism of G, is stable under disjoint union and tensor-type constructions, and admits a tight connection to Weisfeiler–Leman (WL) style color refinement through the classical ``homomorphisms determine refinement’’ paradigm. Consequently, rather than treating a refinement architecture as a black box, we can characterize it by the maximal family of patterns whose homomorphism counts it necessarily preserves.

In this work we focus on the refinement on ordered k-tuples, which may be viewed as a locality-guarded folklore generalization: the color of a tuple u = (u₁, …, u_k) is updated by aggregating the multiset of colors of tuples obtained by replacing one coordinate by a vertex w that lies in the guarded neighborhood $\bigcup_{i=1}^k N_G[u_i]$. This update rule interpolates between purely local k-tuple refinement (as in Local k-GNN) and full folklore k-tuple refinement (as in k-FWL/k-FGNN), but it is not captured by existing decomposition characterizations. The motivating question is therefore:

For k = 2, related architectures have been analyzed via nested ear decompositions (NEDs) and restricted tree decompositions that encode how local aggregation limits the way substructures can overlap. However, the passage from k = 2 to general k is not formal: one must simultaneously manage the combinatorics of width-k decompositions, the tuple-coordinates that the refinement can ``move’’ in a single step, and the constraints imposed by the neighborhood guard. A direct treewidth bound is too weak (full folklore k-WL already reaches all patterns of treewidth at most k), while the previously known strong NED notions for local k-GNN are too restrictive for Local k-FGNN, which can propagate information along controlled overlaps not available to strictly local k-tuple refinement.

Our first contribution is a structural description of precisely the additional overlap that the guarded folklore update enables. We introduce a width-k decomposition notion, phrased as a , in which the edge set of a pattern graph is partitioned into k-order ears arranged in a nesting forest. The key constraint concerns siblings attached to the same parent ear: their nested intervals must be laminar (disjoint or nested) in a sharply delimited degeneracy case, where an ear contributes at most one edge along each branch path of the parent and hence behaves, from the perspective of the guarded aggregation, like a locally resolvable perturbation. This ``degenerate-overlap permitted’’ rule is the only relaxation we allow beyond the strong (laminar) condition, and it is exactly calibrated to the locality of the update rule on k-tuples.

Our second contribution is to connect this ear-based definition to a canonical width-k tree decomposition viewpoint. We define a restricted class of decompositions with alternating bag sizes k and k + 1, together with a locality-guarded constraint on the child structure of odd-depth bags. This constraint is the decomposition-level avatar of the degeneracy rule above: it specifies when multiple children may be attached to a bag without creating non-local interactions that Local k-FGNN cannot distinguish. We provide constructive transformations in both directions between these canonical decompositions and k-almost-strong NEDs, thereby giving two equivalent, usable representations of the same pattern family.

Our third contribution is the homomorphism expressivity theorem for Local k-FGNN. We show that the stable representation computed by the Local k-FGNN refinement determines exactly the homomorphism counts of patterns admitting a k-almost-strong NED, and conversely that no strictly larger family can be guaranteed. The proof proceeds by an unfolding argument: we associate to each graph a depth-bounded unfolding object that is itself a canonical width-k decomposition in our restricted class, show that stable color equality is equivalent to isomorphism of these unfoldings, and then express representation statistics as linear transforms of bag-isomorphism homomorphism counts. This yields an explicit route from the refinement representation to hom(F, ⋅) for all patterns F in our family.

The maximality direction requires an explicit obstruction principle. We prove that whenever a connected pattern fails to admit a k-almost-strong NED, we can construct a pair of graphs G(F), H(F) (generalized twisted F"urer-type gadgets) that are indistinguishable by Local k-FGNN yet separated by hom(F, ⋅). Technically, this is shown by identifying a pebble-game characterization of the guarded folklore update and using it to guide the gadget construction; the twist parity argument then yields the homomorphism separation. This establishes that our pattern family is not merely sufficient but for homomorphism expressivity.

Finally, we locate Local k-FGNN within the refinement hierarchy. We exhibit explicit pattern families witnessing strict containments between the local k-GNN family (captured by strong k-order NEDs), the Local k-FGNN family (captured by k-almost-strong NEDs), and the full folklore family (captured by treewidth ≤ k). Intuitively, Local k-FGNN gains power by tolerating degenerate overlaps that local k-tuple refinement cannot propagate across, but it remains strictly below k-FWL because non-degenerate overlaps can force global interactions that the neighborhood-guarded aggregation cannot simulate.

In addition to the structural results, our characterizations yield an algorithmic corollary: for fixed k, membership of a pattern F in the Local k-FGNN family can be recognized in fixed-parameter time by first obtaining a width-k decomposition and then checking the restricted canonical constraints, with a subsequent conversion into a k-almost-strong NED serving as a certificate. This aligns with the intended regime of constant-order architectures, and it provides a concrete tool for designing and validating pattern families tailored to Local k-FGNN representations.

All graphs are finite, undirected, simple, and vertex-labeled unless stated otherwise. We write G = (V_G, E_G, ℓ_G), where ℓ_G ranges over an arbitrary (but fixed) label alphabet and label preservation is always required. For graphs F, G, a map f : V_F → V_G is a (graph) homomorphism if it preserves labels and edges: for every uv ∈ E_F we have f(u)f(v) ∈ E_G, and for every u ∈ V_F we have ℓ_F(u) = ℓ_G(f(u)). We denote by Hom(F, G) the set of such maps and by hom(F, G) = |Hom(F, G)| the corresponding homomorphism count. Throughout, homomorphisms are allowed to be non-injective; in particular, hom(F, G) subsumes subgraph and induced-subgraph counts via standard inclusion–exclusion transforms once one passes to suitable quotient patterns (reviewed next).

A basic mechanism by which homomorphism counts encode substructure statistics is the closure of a pattern under vertex identifications. Given a graph F and a partition π of V_F, we form a quotient pattern F/π by contracting each block of π to a single vertex, inheriting labels in the evident way (if labels disagree within a block, the quotient is discarded), and adding an edge between two distinct blocks if there is at least one edge in F between the corresponding vertex sets. Depending on π, one may create loops; loops are irrelevant for our target graphs (which remain simple), but they are convenient for bookkeeping in homomorphism identities. The Spasm(F) is the finite set of quotients F/π up to isomorphism. The standard Möbius inversion on the partition lattice yields that counts of injective homomorphisms (and hence subgraph occurrences) of F can be expressed as an integer linear combination of hom(F^′, ⋅) over F^′ ∈ Spasm(F), and conversely hom(F, ⋅) is an integer linear combination of injective counts over quotients. For our purposes, it suffices to remember that any homomorphism-preserving representation automatically preserves a large class of subgraph-counting parameters, up to spasm closure.

We work in the color refinement (CR) idealization: node and tuple features are discrete colors updated by perfect hashing of their previous values and aggregated multisets. A k-tuple refinement procedure maintains a color map χ^(t) : V_G^k → 𝒞 on ordered k-tuples, initialized by the isomorphism type of the induced labeled substructure on the tuple coordinates (including equalities among coordinates). Each round produces χ^(t + 1)(u) by combining χ^(t)(u) with a multiset of colors of neighboring tuples obtained by altering one coordinate. Stabilization yields a canonical stable coloring χ, and a graph-level representation is obtained by taking the multiset of stable colors over all k-tuples (or over a prescribed root set in rooted variants). In the CR model, two graphs are indistinguishable by an architecture if and only if the corresponding stable multisets coincide.

We recall the three refinement operators whose homomorphism families we compare.

k-tuple refinement (also called k-FWL and equivalent, at the level of graph invariants, to k-FGNN) aggregates over all replacements of a single coordinate by an arbitrary vertex w ∈ V_G. This is the most expressive of the three and is known to capture precisely homomorphism counts of patterns of treewidth at most k.
(the strong NED model) restricts aggregation to a purely local move: w must lie in the closed neighborhood of the coordinate being replaced, i.e., w ∈ N_G[u_i] when updating the ith coordinate. This enforces a strict locality that prevents information flow across nontrivial overlaps of distant attachments in a decomposition.
is intermediate: we still replace only one coordinate at a time, but we may choose w from the $\bigcup_{i=1}^k N_G[u_i]$, regardless of which coordinate is replaced. Thus the update can ``teleport’’ a replacement along any coordinate’s neighborhood as long as it remains within the guarded region of the tuple, enabling controlled interactions that are unavailable to Local k-GNN but remain weaker than the global replacements of full folklore refinement. The precise update rule is stated in the next section; here we only emphasize that the distinction between (ii) and (iii) is exactly the source of the additional overlap permitted in our pattern characterizations.

A tree decomposition of a graph F is a pair (T, β) where T is a tree and β : V(T) → 2^V_F assigns a bag to each node, satisfying the usual vertex-coverage, edge-coverage, and connectedness conditions. The width is max_{t ∈ V(T)}|β(t)|−1, and tw(F) is the minimum width. For fixed k, we may assume decompositions are in a canonical alternating form: after standard transformations one can enforce that bags have size exactly k or k + 1, that the root bag has size k, and that adjacent bags differ by exactly one vertex. We will use such canonical forms as a convenient interface between tuple refinement dynamics and combinatorial structure: odd-depth bags of size k + 1 will serve as ``attachment events’’ (ears), while even-depth bags of size k serve as separators supporting the guarded locality constraints.

For k = 2, several refinement architectures admit clean characterizations via nested ear decompositions (NEDs). Informally, an NED partitions E_F into a sequence of ears attached along endpoints to previously constructed parts, together with a nesting forest recording which ear is attached within which predecessor. The variant (corresponding to strictly local refinement) requires that, for siblings attached to the same parent ear, their attachment intervals along the parent are laminar: two intervals must be disjoint or one contained in the other. This laminarity encodes that local propagation cannot resolve simultaneous ``crossing’’ attachments. The relaxation introduced for guarded folklore-type locality at k = 2 allows a sharply delimited non-laminar overlap when one of the overlapping ears is in the sense that it contributes at most one edge along each relevant branch of the parent, and hence behaves like a locally resolvable perturbation under the guarded update. Our contribution for general k is to lift this strong-versus-degenerate overlap dichotomy from k = 2 to width-k decompositions in a way that is compatible with tuple-coordinate moves and the neighborhood guard. The next section makes the model explicit and formalizes the corresponding homomorphism expressivity families.

3. Model and Expressivity Notions: formal definition of Local k-FGNN as a CR algorithm; define homomorphism expressivity ℱ_LF(k) and rooted variants; relate to subgraph counting via spasm closure.

Fix k ≥ 2 and a graph G = (V_G, E_G, ℓ_G). Local k-FGNN operates on ordered tuples u = (u₁, …, u_k) ∈ V_G^k via color refinement in the CR idealization. As usual, the initial color χ⁽⁰⁾(u) encodes the complete isomorphism type of the labeled induced substructure on the coordinates of u, including all equalities u_i = u_j and adjacencies u_iu_j ∈ E_G for i ≠ j. In particular, χ⁽⁰⁾ is invariant under relabeling of V_G and under the diagonal action of Aut(G) on V_G^k.

The defining feature of Local k-FGNN is the choice of replacement vertices. For a tuple u we set the guarded region
$$ \Gamma_G(\mathbf{u}) \;:=\; \bigcup_{i=1}^k N_G[u_i], $$
and for i ∈ [k] and w ∈ V_G we write
u^{(i ← w)} := (u₁, …, u_i − 1, w, u_i + 1, …, u_k).
One refinement round replaces χ^(t) by χ^(t + 1) where

Here Hash denotes a perfect injective encoding of its arguments, and the multiset in is taken over w with multiplicities. The update is permutation-equivariant in the sense that permuting tuple coordinates permutes the k-tuple inside the multiset; this is the CR abstraction of using a shared MLP on coordinates and a multiset aggregator. We iterate until stabilization, obtaining a stable coloring χ = χ^(∞).

For comparison only (we will not use this operationally), note that Local k-GNN corresponds to the stricter update where, when forming χ^(t)(u^{(i ← w)}), one restricts to w ∈ N_G[u_i] depending on the replaced coordinate, whereas full folklore refinement (k-FWL/k-FGNN) uses w ∈ V_G.

The unrooted graph representation induced by Local k-FGNN is the multiset
$$ \Phi_{k}^{\mathrm{LF}}(G)\;:=\;\multiset{\chi(\mathbf{u}) : \mathbf{u}\in V_G^k}, $$
i.e., the histogram of stable colors over all ordered k-tuples. Two graphs G, H are (in the CR model) if Φ_k^LF(G) = Φ_k^LF(H).

We will also use a rooted variant. A is a pair (G, r) with r ∈ V_G^k. Running the same refinement on G yields a rooted representation
Φ_k^LF(G, r) := χ_G(r),
i.e., the stable color of the designated tuple (equivalently, the Dirac histogram on r). This is the natural notion for expressing rooted homomorphism counts and for the unfolding arguments underlying our upper bounds.

We now formalize which homomorphism counts are determined by these representations. For a pattern graph F we view the parameter G ↦ hom(F, G) as a graph invariant. The following definition packages the strongest such invariants captured by Local k-FGNN.

Rooted patterns are treated analogously. A is a pair (F, a) with a ∈ V_F^k. For a rooted graph (G, r), let hom((F, a), (G, r)) denote the number of homomorphisms f : F → G with f(a_i) = r_i for all i ∈ [k] (and label/edge preservation as usual). The rooted family ℱ_LF(k)^• is defined by requiring that equality of rooted representations χ_G(r) = χ_H(s) implies equality of these rooted counts for all rooted patterns in the family. In the sequel, rooted statements are used as intermediate steps; the unrooted family ℱ_LF(k) is recovered by summing rooted counts over all roots.

Although our characterization is phrased in terms of homomorphism counts, it implicitly controls a broad range of substructure statistics. Recall from the preliminaries that for any pattern F, the spasm Spasm(F) consists of quotients F/π obtained by vertex identifications, and that injective homomorphism counts (hence subgraph occurrences) of F can be expressed by Möbius inversion as an integer linear combination of hom(F^′, ⋅) over F^′ ∈ Spasm(F), and conversely. Consequently, whenever Spasm(F) ⊆ ℱ_LF(k), Local k-FGNN indistinguishability implies equality of injective counts of F as well. We will therefore freely pass between homomorphism-counting power'' andsubgraph-counting consequences’’ while remembering that the primitive objects controlled by the representation are the homomorphism parameters themselves.

In the next section we identify ℱ_LF(k) by a decomposition property. Operationally, the guarded union Γ_G(u) in is precisely what permits certain non-laminar overlap patterns in width-k constructions, and the resulting pattern class strictly interpolates between the strong locality family for Local k-GNN and the full treewidth-k family for k-FWL/k-FGNN.

4. New Decomposition: k-Almost-Strong NED: define k-order ears and nested intervals; introduce the degeneracy criterion and the relaxed nesting constraint; provide examples separating strong vs almost-strong vs treewidth-k.

We now introduce the decomposition notion underlying the maximal pattern family for Local k-FGNN. Throughout, k ≥ 2 is fixed and F is a finite simple vertex-labeled graph.

A is a connected graph Q together with a designated ∂Q ⊆ V_Q of size k such that Q admits a width-k tree decomposition whose are exactly the k-sets appearing as intersections of adjacent bags, and every bag has size k or k + 1. Concretely, we require a decomposition (T_Q, β_Q) with the following canonical features:

We view Skel(Q) as a tree whose vertices are k-sets in V_Q and whose edges correspond to adjacent k-adhesions mediated by a unique (k + 1)-bag. When k = 2, Skel(Q) is the usual spine for a path-like ear, and the 3-bags are triangles that ``realize’’ edges of the spine.

A (NED) of F is a partition of E_F into edge-disjoint k-order ears Q₁, …, Q_m together with a rooted forest structure on {Q_i} (the ) satisfying:

Thus an NED builds F by starting from root ears and repeatedly attaching child ears along k-vertex interfaces.

Let Q be a child ear of a parent ear P. Since ∂Q ⊆ V_P, the attachment vertices occur in bags of the skeleton Skel(P). We define the I(Q) to be the minimal connected subgraph of Skel(P) whose union of vertices (as k-sets) contains all attachment vertices:
I(Q) := Steiner_Skel(P)(∂Q),
i.e., the union of all unique simple paths in Skel(P) connecting the bags that contain the vertices of ∂Q. Equivalently, I(Q) is the smallest subtree of Skel(P) that ``supports’’ the attachment.

To state our relaxation, we distinguish short attachments from genuinely extended ones. A in a tree T is a maximal simple path whose internal vertices have degree 2 in T. We say that a nested interval I(Q) ⊆ Skel(P) is and write degener(I(Q)) if for every branch path π of Skel(P), the intersection I(Q) ∩ π contains at most one edge of π. When Skel(P) is itself a path (the k = 2 intuition), this reduces to the requirement that I(Q) has length at most one edge.

We can now impose a nesting constraint on siblings. Let Q_a, Q_b be two distinct child ears of the same parent ear P, and consider their nested intervals I(Q_a), I(Q_b) ⊆ Skel(P). In the variant (corresponding to Local k-GNN), we require the family of sibling intervals to be laminar:
I(Q_a) ∩ I(Q_b) = ∅ or I(Q_a) ⊆ I(Q_b) or I(Q_b) ⊆ I(Q_a).
In contrast, we define a by relaxing laminarity exactly in the degenerate case: for siblings Q_a, Q_b with a < b, we require the same laminarity condition that if degener(I(Q_a)) holds, then we permit arbitrary overlap between I(Q_a) and I(Q_b). This ``one-sided’’ rule is the form that will match the directed unfolding used in the completeness proof; it is also equivalent to the symmetric formulation where overlap without containment is allowed whenever at least one of the two intervals is degenerate.

We record the qualitative separations induced by this relaxation. First, strong laminarity can fail only in a locality-compatible manner. Consider a parent ear P whose skeleton Skel(P) contains two consecutive edges sharing a vertex (e.g., a path t₁ − t₂ − t₃ of k-bags). Let Q_a and Q_b be two children with I(Q_a) = t₁ − t₂ and I(Q_b) = t₂ − t₃. Then I(Q_a) ∩ I(Q_b) = {t₂} but neither interval contains the other. If both I(Q_a) and I(Q_b) are degenerate (as they are here), the configuration is permitted by k-almost-strong NEDs but forbidden by strong NEDs. This yields patterns in ℱ_LF(k) \ ℱ_L(k) by attaching two ears ``meeting at a point’’ inside the parent skeleton.

Second, k-almost-strong NEDs do not cover all of treewidth k. Let Skel(P) contain a path on four k-bags t₁ − t₂ − t₃ − t₄. Choose two children Q_a, Q_b with intervals I(Q_a) = t₁ − t₂ − t₃ and I(Q_b) = t₂ − t₃ − t₄. These intervals overlap on the two vertices t₂, t₃ (and the edge between them), neither contains the other, and neither is degenerate (each occupies two edges along the same branch path). Such a pattern can still have tw(F) ≤ k (it is realizable by gluing (k + 1)-bags along k-adhesions in a standard partial k-tree), but it violates the k-almost-strong sibling rule. Hence it witnesses ℱ_LF(k) ⊊ ℱ_F(k).

We will show next that admitting a k-almost-strong NED is not merely sufficient but exactly the correct criterion for being counted by Local k-FGNN in the homomorphism sense.

5. Upper Bound (Completeness): Local k-FGNN representations are determined by homomorphism counts of k-almost-strong patterns: unfolding construction, bag-isomorphism counts, and linear-algebra factorization.

We show that the stable Local k-FGNN representation of a graph G determines hom(F, G) for every pattern F admitting a k-almost-strong NED. The proof proceeds in three steps: we (i) identify tuple colors with isomorphism types of a canonical guarded unfolding, (ii) extract from the graph representation the counts of relevant unfolding types (``bag-isomorphism counts’’), and (iii) express hom(F, G) as a fixed linear combination of these counts.

Fix t ≥ 0 and an ordered k-tuple u = (u₁, …, u_k) ∈ V_G^k. We define a rooted, edge-labeled tree 𝒰_G^(t)(u) whose nodes represent the k-tuples reachable from u by at most t guarded folklore moves. Concretely, the root is labeled by u, and from any node labeled by a tuple x = (x₁, …, x_k) we add, for each $w\in \bigcup_{i=1}^k N_G[x_i]$ and each coordinate index j ∈ [k], a child node labeled by the tuple obtained from x by replacing x_j with w; we label the tree edge by j to record which coordinate was replaced. Each node additionally carries the full atomic type of its tuple in G (vertex labels, equality pattern among coordinates, and adjacency pattern among coordinates). We then quotient this rooted tree by identifying nodes whose labeled rooted subtrees are isomorphic in the natural sense (root-preserving, respecting edge labels and node atomic types), yielding a canonical finite object that we still denote by 𝒰_G^(t)(u).

The key inductive correspondence is that tuple colors are precisely these unfolding types. Namely, letting χ^(t) be the t-round Local k-FGNN coloring, we have
χ^(t)(u) = IsoType(𝒰_G^(t)(u)),
where IsoType(⋅) denotes the perfect-hash code of the rooted isomorphism type. The base case t = 0 is immediate from the initialization by atomic type. For the step t → t + 1, the multiset aggregated at u in the Local k-FGNN update is exactly the multiset of isomorphism types of the one-step children unfoldings of the root of 𝒰_G^(t + 1)(u), grouped by the coordinate label j; since the update hashes the previous color together with this guarded multiset, it coincides with hashing the rooted tree one level deeper. Stabilization implies that χ(u) determines the isomorphism type of 𝒰_G^(t)(u) for all t up to the stabilization depth (and conversely).

We take the graph-level representation of Local k-FGNN to be the multiset (equivalently, histogram) of stable colors on V_G^k. Thus, if two graphs G, H have identical representations, then for every stable color α,
|{u ∈ V_G^k : χ_G(u) = α}| = |{v ∈ V_H^k : χ_H(v) = α}|.
Choosing t larger than the stabilization index for both graphs, we may replace χ by χ^(t) without changing these multiplicities, and therefore we obtain equality of counts of depth-t unfolding isomorphism types.

It is convenient to package these counts as follows. Let 𝔘_t be the finite set of rooted isomorphism types of unfoldings 𝒰_G^(t)(u) realizable over all graphs G (with the fixed label alphabet) and all tuples u ∈ V_G^k. For U ∈ 𝔘_t, define the
bIsoHom(U, G) := |{u ∈ V_G^k : 𝒰_G^(t)(u) ≅ U}|.
By the unfolding–color correspondence, bIsoHom(U, G) is determined by the Local k-FGNN representation of G. In particular, representation equality for G, H implies bIsoHom(U, G) = bIsoHom(U, H) for all U ∈ 𝔘_t.

Now fix a pattern F admitting a k-almost-strong NED. Using the NED, we obtain a rooted tree of k-order ears; for each ear we fix a canonical width-k decomposition with bag sizes in {k, k + 1} and adhesions of size k. Gluing these ear decompositions along their attachment k-sets yields a tree-structured decomposition of F whose nodes alternate between k-bags (interfaces) and (k + 1)-bags (realizing one additional vertex), and whose child ordering is compatible with the almost-strong sibling rule. The crucial consequence of the sibling rule is that when we compute extensions of partial homomorphisms from a parent interface to several children, the dependencies created by overlapping nested intervals are either laminar (hence handled by standard tree-decomposition dynamic programming) or confined to degenerate overlaps, which are resolved by a single guarded folklore move (replacing at most one skeleton edge along any branch path) and therefore remain visible in the local unfolding around the interface tuple.

We implement this by a bottom-up dynamic program on the decomposition tree of F. Each k-bag corresponds to an ordered k-tuple of pattern vertices; for a graph G and a tuple u ∈ V_G^k, we let Φ_t(u) denote the number of partial homomorphisms from the subpattern rooted at the bag to G that map the ordered interface to u. Transition from a k-bag to an adjacent (k + 1)-bag introduces exactly one new pattern vertex, say x, together with its adjacencies to the k interface vertices; the number of possible images of x in G depends only on the multiset of one-coordinate replacements (u₁, …, u_j − 1, w, u_j + 1, …, u_k) with w ∈ ⋃_iN_G[u_i], because locality-guarded folklore aggregation ranges precisely over this set. Hence the transition value is a function of the depth-1 unfolding neighborhood of u, and by iteration each Φ_t(u) is a function of the depth-D(F) unfolding 𝒰_G^(D(F))(u), for a depth D(F) depending only on F.

Summing over all interface tuples at the root bag yields
hom(F, G) = ∑_{u ∈ V_G^k}Φ_root(u) = ∑_{U ∈ 𝔘_D(F)}λ_F, U bIsoHom(U, G),
for coefficients λ_F, U ∈ ℤ depending only on F (they are obtained by evaluating the same dynamic program on the abstract unfolding type U rather than on a concrete tuple). This is the promised linear-algebraic factorization: the Local k-FGNN representation determines bIsoHom(U, G) for all relevant U, hence determines hom(F, G).

If two graphs G, H have identical Local k-FGNN representations, then bIsoHom(U, G) = bIsoHom(U, H) for all U ∈ 𝔘_D(F), and therefore the above linear identity gives hom(F, G) = hom(F, H) for every k-almost-strong pattern F. In the next step we make the decomposition implicit in the argument explicit by introducing a canonical restricted width-k tree decomposition class 𝒮_LF(k) and proving its equivalence to k-almost-strong NEDs.

6. Equivalence Theorem: k-almost-strong NED ⇔ a canonical restricted width-k tree decomposition class 𝒮_LF(k): two-way transformations with tight invariants.

We make the decomposition implicit in the unfolding argument explicit by isolating a canonical class 𝒮_LF(k) of width-k tree decompositions that captures exactly the structural constraints enforced by neighborhood-guarded folklore aggregation. The equivalence is bidirectional: a pattern F admits a k-almost-strong NED if and only if F has a canonical alternating tree decomposition (F, T) ∈ 𝒮_LF(k). Since both objects are used in subsequent upper- and lower-bound arguments, we state the construction in a form that exposes the invariants needed for dynamic programming and for the pebble-game/gadget lower bound.

We work with rooted tree decompositions whose bags alternate in size between k and k + 1. Concretely, we require that every edge of T connects a P with |P| = k to an B with |B| = k + 1, and that P ⊂ B with |B \ P| = 1. We root T at a separator bag and impose that each extension bag has a unique parent separator bag. This is a standard normal form for width-k decompositions: for fixed k one may convert any width-k decomposition into such an alternating one by subdividing edges and by inserting forget/introduce nodes while preserving the represented graph.

What is specific to Local k-FGNN is an additional controlling how extension bags may branch. Intuitively, an extension bag B = P ∪ {x} introduces a new vertex x adjacent (in F) to some subset of the k vertices in P. If x is not adjacent to any vertex of P, then any further structure attached below'' $B$ must be essentially linear (unique continuation), because Local $k$-FGNN can only access new vertices through neighborhood moves from the interface tuple; without an adjacency witness in $P$, independent branching would create correlations not visible in guarded unfoldings. More generally, when $x$ is adjacent to some vertices of $P$, a bounded amount of branching is allowed, but overlaps among descendants must be laminar except in the degenerate cases singled out in the definition of $k$-almost-strong NEDs. We encode this by requiring that, for each extension bag $B$ with parent $P$, the separator-bag children of $B$ can be linearly ordered so that their induced attachment intervals on theskeleton’’ determined by P are laminar, except that non-laminar overlap is permitted precisely when the corresponding interval is degenerate (i.e., along every branch path in the parent ear it contributes at most one edge). The formalization is most transparent after we explain the conversion to ears.

Assume (F, T) ∈ 𝒮_LF(k). We construct an ear decomposition by interpreting each extension bag B = P ∪ {x} as the introduction of a single new vertex x together with its incident edges to the current interface P. Consider the rooted traversal of T alternating between separator bags and extension bags. For each extension bag B with parent separator P, we create a k-order ear Q(B) whose endpoints are the ordered k-tuple of vertices of P (the interface) and whose internal vertex set contains x and, recursively, all vertices introduced in the subtree rooted at B that are not already present above P. The ear edges are the edges of F that are first covered in some descendant extension bag of B but not covered above P. Standard connectedness properties of tree decompositions guarantee that these edge sets partition E_F, and the rooting induces a nesting forest of ears.

We now verify the sibling constraint required for a k-almost-strong NED. Let B₁, B₂ be two distinct extension bags with the same parent separator bag P and let Q₁ = Q(B₁), Q₂ = Q(B₂) be the associated ears. The associated nested intervals I(Q₁), I(Q₂) are determined by the positions (within the parent ear) at which the attachments to P occur. The locality-guarded child constraint in 𝒮_LF(k) ensures that the induced intervals are either disjoint or nested, except when one of them is degenerate, in which case overlap without containment is allowed. This is exactly the ``almost-strong’’ relaxation: non-laminar overlap is confined to attachments that, along each branch path of the parent ear, modify at most a single skeleton edge and can therefore be simulated by a single guarded folklore move from the interface tuple. Hence the resulting ear nesting satisfies the definition of a k-almost-strong NED.

Conversely, assume F admits a k-almost-strong NED with ears Q₁, …, Q_m forming a nesting forest. We build a tree decomposition T by expanding each ear into an alternating chain of bags and then gluing these chains according to nesting. Fix an ear Q with attachment interface S(Q) of size k (viewed as an ordered k-tuple) and with internal vertices introduced by Q. We construct a path in T alternating between copies of the separator bag S(Q) and extension bags S(Q) ∪ {x} for each newly introduced vertex x in the ear, ordered along the ear. This guarantees that every edge introduced by Q is covered by some extension bag: if x is adjacent to a subset of S(Q), then S(Q) ∪ {x} contains that edge; edges between consecutively introduced vertices are covered by adjacent extension bags in the chain (using the standard introduce/forget simulation inside width k). For each child ear Q^′ nested in Q, we attach the decomposition of Q^′ at the separator bag corresponding to the interface S(Q^′) ⊆ V(Q), which occurs along the chain of Q at the position determined by the nested interval I(Q^′).

The crucial point is that the sibling rule in the NED implies the locality-guarded child constraint of 𝒮_LF(k). Indeed, siblings Q_a, Q_b nested under the same parent ear Q have intervals that are laminar except for degenerate overlaps; we choose the child ordering at each separator bag to respect the partial order induced by containment of intervals, and we route degenerate-overlap cases through a unique extension node where the overlap occurs. Because degeneracy ensures that the overlap affects at most one edge along any branch path, this routing does not create the forbidden ``branching without adjacency witness’’ configuration: whenever an extension bag introduces a vertex non-adjacent to the parent separator, it lies on a unique continuation segment and has exactly one relevant child, matching the guarded constraint.

Both transformations preserve the following properties, which we will use subsequently without further comment. First, every edge of F is covered by an extension bag (equivalently, by some ear), so the constructions are lossless at the level of the pattern graph. Second, adhesions are exactly k-sets, so interfaces in the decomposition correspond bijectively to the ordered attachment tuples tracked by Local k-FGNN. Third, the only permitted violations of laminarity among sibling attachments are precisely the degenerate overlaps; these are tight in the sense that any additional non-degenerate overlap would force two independently branching subpatterns to share a nontrivial boundary segment, creating dependencies that cannot be resolved by a bounded-depth neighborhood-guarded unfolding around the interface tuple. Therefore 𝒮_LF(k) is not merely a convenient normal form: it is the exact structural envelope of patterns whose homomorphism counts factor through Local k-FGNN tuple colors.

7. Lower Bound (Maximality): for any F not admitting k-almost-strong NED, construct indistinguishable graphs G, H for Local k-FGNN but with hom(F, G) ≠ hom(F, H) using generalized pebble games and F"urer-type gadgets.

We establish the converse direction needed for maximality: whenever a connected pattern graph F does admit a k-almost-strong NED (equivalently, by Theorem~2, whenever F admits no canonical alternating decomposition in 𝒮_LF(k)), Local k-FGNN cannot determine hom(F, ⋅) from its stable tuple colors. Concretely, we construct a pair of graphs G(F), H(F) such that Local k-FGNN produces identical stabilized representations on G(F) and H(F), yet hom(F, G(F)) ≠ hom(F, H(F)). This implies that ℱ_LF(k) is maximal among all homomorphism families factoring through Local k-FGNN.

We use a pebble game tailored to the neighborhood-guarded folklore aggregation. A position consists of two ordered k-tuples u ∈ V_G^k and v ∈ V_H^k. The game is played in rounds; in each round Spoiler chooses an index i ∈ [k] and a vertex $w\in \bigcup_{j=1}^k N_G[u_j]$, and replaces u_i by w, yielding a new tuple u^′. Duplicator must respond by choosing $w'\in \bigcup_{j=1}^k N_H[v_j]$ and replacing v_i by w^′, yielding v^′. The winning condition is the usual local consistency with labels and equalities: Duplicator must maintain that the map u_a ↦ v_a preserves vertex labels and the edge/non-edge relations among the pebbled vertices (i.e., it is a partial isomorphism on the induced labeled subgraphs on the sets {u₁, …, u_k} and {v₁, …, v_k}). As in standard WL–pebble correspondences, one proves by induction on the number of rounds that Duplicator has a winning strategy for D rounds starting from (u, v) if and only if the depth-D guarded unfoldings around u and v are isomorphic, and hence if and only if χ^(D)(u) = χ^(D)(v) under Local k-FGNN refinement. In particular, if Duplicator can win for all D, then χ(u) = χ(v) in the stabilized coloring.

Assume F is connected and does not admit a k-almost-strong NED. By Theorem~2, no canonical alternating decomposition of F satisfies the locality-guarded child constraint; unpacking this failure yields a specific forbidden configuration in any attempted decomposition: there exist sibling attachments whose induced nested intervals overlap in a way (overlap without containment, and on some branch path contributing at least two edges). We fix a minimal such obstruction (under subgraph inclusion respecting the attempted nesting), and we view it as a cycle of constraints between interface choices along a parent ear skeleton. This cycle is the higher-order analogue of the parity-obstruction exploited in CFI/F"urer lower bounds: locally each attachment is consistent with guarded neighborhood moves from a k-tuple interface, but globally the non-degenerate overlap forces a dependency that cannot be linearized into the laminar/degenerate structure of 𝒮_LF(k).

From the obstruction we build a pair of graphs G(F) and H(F) that are locally indistinguishable but globally twisted. We replace each relevant interface (separator of size k) by a encoding 2^k states, conveniently indexed by bit-vectors in {0, 1}^k (or equivalently subsets of [k]), and we connect clouds by bipartite constraint gadgets whose adjacency depends only on the symmetric difference of the indices. All vertices carry labels identifying their cloud and their role (interface position), so that the tuple initialization χ⁽⁰⁾ cannot break the intended symmetry inside a cloud. The graph G(F) is the instance in which the constraints along the obstruction cycle are consistent. The graph H(F) is obtained by flipping (twisting) exactly one constraint gadget along the cycle (or, more generally, by applying an odd-parity twist on a set of edges whose sum is 1 in the constraint space). As in classical constructions, G(F) and H(F) are non-isomorphic but are designed to be indistinguishable by any refinement whose view is restricted to bounded-depth, locally guarded explorations from a k-tuple.

We argue via the guarded k-pebble game. Duplicator maintains an invariant that pebbled vertices in G(F) and H(F) lie in corresponding clouds, and that the current correspondence respects the local constraint structure within the explored neighborhood determined by ⋃_iN[⋅]. When Spoiler moves to a vertex in the guarded neighborhood, the move crosses at most one constraint gadget from some currently pebbled cloud; by construction, each constraint gadget in G(F) is locally isomorphic to the corresponding gadget in H(F) (the twist is not detectable from one side without traversing a global cycle). Hence Duplicator can respond by applying the appropriate local isomorphism of the relevant gadget, preserving the partial isomorphism on the pebbled set. Iterating, Duplicator wins for all finite numbers of rounds; consequently χ(u) = χ(v) for corresponding tuples and the induced graph-level Local k-FGNN representations of G(F) and H(F) coincide. Equivalently, all guarded unfoldings that lie in 𝒮_LF(k) occur with the same multiplicities in both graphs.

It remains to show hom(F, G(F)) ≠ hom(F, H(F)). A homomorphism h : F → G(F) (resp. H(F)) induces, on each interface of size k, a choice of a cloud state in {0, 1}^k, and the edges of F crossing between interfaces enforce the corresponding gadget constraints. Because the obstruction in F is non-degenerate, these constraints form a genuine cycle of dependencies whose satisfaction requires global parity consistency. In G(F) the cycle is consistent and yields a positive number of satisfying assignments, hence a positive contribution to hom(F, G(F)). In H(F) the single twist flips the parity on the cycle, making the system inconsistent (or, in the variant where some slack remains, changing the number of satisfying assignments by a nonzero amount); thus the contribution of the corresponding mapping type is eliminated (or altered), yielding hom(F, G(F)) ≠ hom(F, H(F)). If needed to prevent spurious homomorphisms that avoid the intended constraint cycle, we apply standard rooted/clique augmentation: we attach labeled anchor structures forcing any homomorphism of F to map designated vertices into designated clouds, thereby isolating the separating contribution without affecting Local k-FGNN indistinguishability.

We have therefore produced, for every connected F ∉ ℱ_LF(k), a witness pair G(F), H(F) that Local k-FGNN cannot distinguish but that hom(F, ⋅) separates. This proves Theorem~4 and yields the maximality of ℱ_LF(k) among homomorphism families captured by Local k-FGNN.

8. Strict Separations and Hierarchies: explicit pattern families witnessing ℱ_L(k) ⊊ ℱ_LF(k) ⊊ ℱ_F(k); optional node/edge-level corollaries.

We next justify the strictness of the inclusions
ℱ_L(k) ⊊ ℱ_LF(k) ⊊ ℱ_F(k).
By Theorem~1, membership in ℱ_L(k) is equivalent to admitting a k-order NED (laminar nesting of sibling intervals, i.e., overlap implies containment), while membership in ℱ_LF(k) is equivalent to admitting a k-almost-strong NED (laminarity except that overlaps are permitted). Finally, ℱ_F(k) is characterized by treewidth at most k. Hence it suffices to exhibit two explicit pattern families: one witnessing that allowing degenerate overlaps strictly enlarges the family beyond strong NED, and another witnessing that forbidding non-degenerate overlaps still excludes some width-k graphs.

Fix k ≥ 2. We define a connected pattern F_k^deg of treewidth k that admits a k-almost-strong NED but no strong k-order NED. Consider a parent k-ear Q₀ whose underlying ``skeleton’’ consists of two internally vertex-disjoint branch paths between the same k-vertex interface S = {s₁, …, s_k} (formally, Q₀ is obtained by taking two length-2 paths between two distinct vertices of S and distributing the remaining vertices of S as trivial attachments so that the ear has boundary S). Now attach two child k-ears Q₁, Q₂ to Q₀ as siblings, each with the same boundary S, but with nested intervals I(Q₁), I(Q₂) chosen so that:
(i) they overlap on the parent ear in the sense that I(Q₁) ∩ I(Q₂) ≠ ∅ and neither contains the other; and
(ii) the overlap is in the sense of Theorem~1, namely, on every branch path of Q₀ the restriction of I(Q₁) (and likewise of I(Q₂)) contributes at most one edge.
Concretely, we realize this by letting I(Q₁) use one edge on the first branch path and none on the second, while I(Q₂) uses one edge on the second branch path and none on the first, and arranging that both intervals share a single boundary vertex in S.

By construction, the sibling overlap is non-laminar but degenerate, hence permitted in a k-almost-strong NED; taking Q₀, Q₁, Q₂ (and completing to a full ear partition by adding trivial ears for remaining edges) yields a valid k-almost-strong NED of F_k^deg, so F_k^deg ∈ ℱ_LF(k). On the other hand, any strong k-order NED would require that sibling intervals be laminar, i.e., overlap implies containment. Since the overlap is witnessed already at the level of the parent ear skeleton and is forced by the attachment pattern of Q₁, Q₂ (both require the same interface but different branch usage), there is no refinement of the decomposition that can eliminate the overlap without changing the ear boundaries or introducing a non-sibling relation that contradicts the ear partition. Thus F_k^deg ∉ ℱ_L(k), proving the first strict inclusion.

We next define a connected width-k pattern F_k^ndeg that fails to admit any k-almost-strong NED. The guiding principle is to force an overlap of sibling nested intervals that is (i.e., on some branch path of the parent ear the overlap necessarily spans at least two edges), while keeping the graph a partial k-tree. One convenient family is obtained from a canonical width-k tree decomposition of a partial k-tree by creating a ``diamond’’ of two sibling (k + 1)-bags whose introduced vertices must both interact along a length- ≥ 2 segment of the parent skeleton. Equivalently, we take a parent k-ear Q₀ with a branch path of length at least 3 between two boundary vertices in S, and attach two sibling ears Q₁, Q₂ that both require the same two consecutive edges of this branch path to realize their internal adjacency constraints; then I(Q₁) and I(Q₂) overlap on at least two edges of that branch path, yet neither interval contains the other (each also uses a distinct additional edge elsewhere to prevent containment).

This enforces a forbidden configuration for k-almost-strong NEDs: we obtain sibling intervals with overlap without containment, and the overlap is non-degenerate. By Theorem~1, such a pattern cannot lie in ℱ_LF(k). However, the construction may be implemented within a width-k tree decomposition by ensuring that every new vertex is introduced in a (k + 1)-bag adjacent to a k-bag separator S, and that the only additional edges are those covered by these bags; hence tw(F_k^ndeg) ≤ k, so F_k^ndeg ∈ ℱ_F(k). This proves the second strict inclusion.

The two witnesses translate immediately into separations between the corresponding refinement procedures. Since F_k^deg ∈ ℱ_LF(k) \ ℱ_L(k), there exist graphs G, H such that Local k-FGNN representations distinguish G and H (because hom(F_k^deg, G) ≠ hom(F_k^deg, H) is detectable by Theorem~3), while Local k-GNN representations coincide (since all homomorphism counts from patterns in ℱ_L(k) coincide). Likewise, because F_k^ndeg ∈ ℱ_F(k) \ ℱ_LF(k), we obtain graphs G^′, H^′ that are indistinguishable by Local k-FGNN yet distinguishable by k-FWL/k-FGNN (as k-FWL captures all width-k patterns). These implications are made formal by the standard ``pattern-to-counterexample’’ transfer: a strict inclusion between homomorphism families yields a pair of graphs separated by one model but not the other.

Although our main focus is graph-level representations, the same witness patterns yield local separations at the level of tuple and vertex features. Indeed, from F_k^deg one obtains graphs in which some k-tuple orbit is refined by Local k-FGNN but remains merged under Local k-GNN, and from F_k^ndeg one obtains graphs where some k-tuple colors differ under k-FWL but are equal under the guarded folklore refinement. In particular, any downstream node/edge prediction architecture that reads out stabilized tuple colors inherits these separations under exact readout.

9. Algorithmic Consequences: membership testing for k-almost-strong NED for fixed k (FPT/linear-time given a width-k decomposition); implications for subgraph counting via spasm inclusion.

The structural characterization by k-almost-strong NEDs immediately yields an effective membership test for fixed k. Concretely, given a connected pattern F and fixed k ≥ 2, we may decide whether F ∈ ℱ_LF(k) by searching for a width-k tree decomposition and then checking the additional locality-guarded constraints defining 𝒮_LF(k) (Theorem~2). Using standard FPT algorithms for treewidth (e.g., Bodlaender-type routines), we can either (i) return that tw(F) > k, in which case F ∉ ℱ_F(k) and thus F ∉ ℱ_LF(k), or (ii) produce a width-k decomposition T of F. We then normalize T into the canonical alternating form with bags of size k and k + 1 (by the usual introduce/forget normalization and splitting), and finally verify the child-structure constraint that distinguishes 𝒮_LF(k) from arbitrary width-k decompositions: each odd-depth bag B of size k + 1 with parent separator P of size k may have multiple children only when the newly introduced vertex in B \ P is ``locally guarded’’ by adjacency to the separator in the sense required by the neighborhood-guarded folklore aggregation; in the unguarded case, the bag is forced to have essentially a single continuation. By Theorem~2, F admits a k-almost-strong NED if and only if such a canonical decomposition exists; therefore, acceptance is equivalent to F ∈ ℱ_LF(k).

For fixed k, the above test runs in time O(f(k) ⋅ |V_F|): computing a width-k decomposition is linear-time FPT in |V_F| with parameter k, and the subsequent normalization and constraint checking are linear in the size of the decomposition. Moreover, the procedure is certifying. If it accepts, we can explicitly output a k-almost-strong NED by converting each odd-depth (k + 1)-bag into a k-ear whose boundary is the parent k-bag separator, and ordering sibling ears according to the partial order induced by their nested intervals. The constraint check guarantees that siblings overlap only in the permitted degenerate manner, so the resulting ear family is a valid k-almost-strong NED. If the test rejects, it returns either (a) a certificate that tw(F) > k, or (b) a width-k decomposition in canonical form together with a local witness to constraint violation (an offending odd-depth bag and two children whose overlap cannot be routed through the permitted degeneracy). In particular, while the definition of k-almost-strong NED is global, failure is locally checkable in the decomposition view.

The preceding algorithmic statement crucially treats k as fixed. If k is part of the input, then even deciding tw(F) ≤ k is NP-complete, and since ℱ_LF(k) ⊆ ℱ_F(k) = {F : tw(F) ≤ k}, membership in ℱ_LF(k) is NP-hard in the same regime. Thus the appropriate algorithmic interpretation is the one relevant for architecture design: for each fixed tuple order k, membership is efficiently decidable and yields an explicit decomposition certificate that can be used downstream (e.g., to select motif families that the model can in principle count).

We next connect the family ℱ_LF(k) to classical relationships between homomorphism counts and subgraph counts. For a pattern F, write inj(F, G) for the number of injective homomorphisms F ↪ G (equivalently, labeled subgraph embeddings), and sub(F, G) = inj(F, G)/|Aut(F)| for the number of (unlabeled) subgraphs isomorphic to F. It is well known that inj(F, ⋅) can be expressed as a finite integer linear combination of homomorphism counts over the of F, i.e.,
inj(F, G) = ∑_{H ∈ Spasm(F)}μ(F, H) ⋅ hom(H, G),
where Spasm(F) consists of all homomorphic images H of F obtained by identifying vertices (equivalently, quotienting F by a partition compatible with edges), and μ(F, H) ∈ ℤ are coefficients given by Möbius inversion on the partition lattice restricted to valid quotients. Consequently, any representation that determines hom(H, G) for all H ∈ Spasm(F) also determines inj(F, G), and hence sub(F, G).

Combining this with Theorem~3 yields a clean sufficient condition for subgraph-count expressivity of Local k-FGNN: if
Spasm(F) ⊆ ℱ_LF(k),
then the stabilized Local k-FGNN representation determines inj(F, G) (and thus sub(F, G)) for every host graph G. Indeed, equality of representations implies equality of hom(H, ⋅) for all H ∈ ℱ_LF(k), hence for all H ∈ Spasm(F) under the inclusion; applying the linear relation above gives equality of inj(F, ⋅). In this sense, the algorithmic membership test extends to a for motif-counting tasks: for fixed k and a target motif F, enumerate Spasm(F) (feasible for the small motifs typically used in experiments), run the fixed-k recognition test on each H ∈ Spasm(F), and accept the task as expressible whenever all quotients pass. This screening is conservative (it is sufficient, not claimed necessary), but it matches the standard homomorphism–embedding calculus used in subgraph counting complexity.

For constant-size motifs, Spasm(F) has constant size (at most the Bell number in |V_F|), so the above screening incurs negligible overhead compared to model training. Thus, for each fixed k, we obtain an explicit, checkable catalogue of motifs whose counts are in principle recoverable from Local k-FGNN representations: all motifs F whose spasm stays within the k-almost-strong NED family. This gives a concrete algorithmic interface between the structural theory (decompositions and maximality) and empirical evaluation (motif counting benchmarks), and it also clarifies where separations should manifest: whenever Spasm(F) contains a non-member of ℱ_LF(k), Theorem~4 supplies indistinguishable host graphs on which sub(F, ⋅) differs, so no method based solely on the Local k-FGNN stable representation can succeed uniformly on that motif.

10. Experiments (optional but strengthening): motif counting tasks for k = 3, 4 comparing Local k-GNN, Local k-FGNN, and k-FGNN; demonstrate predicted separations and near-equalities on moderate motifs.

We include an experimental validation layer whose role is not to improve the theory, but to verify that the structural boundary ℱ_L(k) ⊊ ℱ_LF(k) ⊊ ℱ_F(k) manifests in concrete motif-counting tasks at small tuple orders. The guiding prediction is as follows: for a motif F whose Spasm(F) lies inside ℱ_LF(k), the stabilized Local k-FGNN representation should suffice to recover sub(F, ⋅), whereas for motifs whose spasm crosses the boundary, we should observe systematic failures on adversarially chosen hosts and degraded accuracy on generic hosts. We focus on k = 3, 4 since these are the smallest orders for which nontrivial separations already occur while remaining computationally feasible for tuple-based refinements.

Rather than hand-picking patterns, we select motifs by an explicit screening pipeline that mirrors the theoretical definitions. We enumerate connected unlabeled graphs F on at most 9 vertices (and bounded maximum degree to control spasm growth), compute Spasm(F) by enumerating partitions of V_F and filtering valid quotients, and then run the fixed-k recognition test (via canonical width-k decomposition and 𝒮_LF(k) constraints) on each H ∈ Spasm(F). This yields three disjoint motif pools:
ℳ₁ ⊆ ℳ₂ ⊆ ℳ₃,
where ℳ₁ consists of motifs whose entire spasm lies in ℱ_L(k) (strong NED), ℳ₂ consists of motifs with Spasm(F) ⊆ ℱ_LF(k) but Spasm(F) ⊈ ℱ_L(k) (witnessing the strict left inclusion), and ℳ₃ consists of motifs with Spasm(F) ⊆ ℱ_F(k) but Spasm(F) ⊈ ℱ_LF(k) (witnessing the strict right inclusion). In particular, ℳ₂ contains motifs whose decompositions exhibit the permitted ``degenerate sibling overlap’’ phenomenon, while ℳ₃ contains motifs of treewidth at most k whose decompositions require non-degenerate overlap forbidden by k-almost-strong NEDs.

We evaluate on two complementary host families. First, we generate generic hosts G from standard random graph models (Erds–R'enyi with varying edge density, and Barab'asi–Albert with varying attachment parameter), with |V_G| between 50 and 300 to test extrapolation beyond the motif size regime. Second, for each motif F in ℳ₃, we additionally generate the separating pair G(F), H(F) promised by the maximality theorem (twisted F"urer-type gadgets, optionally with rooted/clique augmentations to match the locality guards), and we also consider mildly perturbed variants obtained by attaching identical random trees to corresponding vertices. The latter perturbations are useful empirically: they preserve the intended indistinguishability by Local k-FGNN up to the guarded aggregation radius, while preventing accidental shortcut solutions based on trivial size statistics.

We compare three representations at fixed k: (i) Local k-GNN (neighborhood-guarded, but with the strong NED expressivity boundary), (ii) Local k-FGNN (the guarded folklore aggregation analyzed here), and (iii) k-FGNN/k-FWL (the unguarded folklore refinement on k-tuples, corresponding to treewidth ≤ k). To avoid conflating expressivity with optimization artifacts, we report two modes. In a mode, we run the corresponding color refinement exactly (using perfect hashing of multisets) and use the stabilized multiset of tuple colors as a graph representation. In a mode, we implement each update by an MLP applied to (previous feature, aggregated multiset summary) and train a downstream regressor for the target sub(F, G); we keep depth and hidden dimension fixed across models, and we use identical readout (sum pooling over tuple features, then an MLP) to isolate the effect of the refinement operator. Performance is measured by mean absolute relative error and by exact agreement on the adversarial pairs (G(F), H(F)).

The results align with the predicted inclusions. For motifs in ℳ₁, all three models succeed: the symbolic representations determine the counts up to numerical inversion of the standard linear systems, and the learned models fit with low relative error and stable generalization as |V_G| grows. For motifs in ℳ₂, we observe the characteristic strict separation between Local k-GNN and Local k-FGNN: Local k-FGNN matches k-FGNN in accuracy (often to within training noise in the learned mode), while Local k-GNN exhibits persistent bias that does not vanish with additional depth. This is consistent with the fact that ℳ₂ contains motifs whose counting requires precisely the degenerate-overlap flexibility built into k-almost-strong NEDs.

For motifs in ℳ₃, we observe the opposite separation: k-FGNN succeeds, while both local models fail in a way that is most clearly visible on the adversarial pairs. Concretely, in the symbolic mode, Local k-FGNN yields identical stabilized color multisets on G(F) and H(F), while k-FGNN separates them; correspondingly, the true values of sub(F, G(F)) and sub(F, H(F)) differ. In the learned mode, the local models are forced into identical predictions on G(F) and H(F) up to optimization tolerance, even when trained with these pairs in the training set, whereas the k-FGNN model can fit the difference. This behavior provides an empirical witness of maximality: outside ℱ_LF(k), failure is not merely a matter of insufficient depth or capacity but is induced by indistinguishability at the level of the stable local refinement.

11. Discussion and Open Problems: learned/sparse attention analogues; approximate expressivity; size of minimal separating motifs; connections to guarded logics and descriptive complexity.

Our main contribution is structural: we identify the maximal homomorphism-counting family ℱ_LF(k) captured by the stabilized Local k-FGNN refinement, and we express this family via k-almost-strong nested ear decompositions (equivalently, via the restricted canonical class 𝒮_LF(k)). Several directions remain open, and we single out those that appear most immediate from the viewpoint of both theory and practice.

All statements in the present work are made under the CR idealization in which the update map computes an injective hash of the aggregated multiset. In implementation, one replaces hashing by a learned, finite-dimensional, continuous map (typically an MLP composed with sum/mean pooling). A first problem is to make precise when such a learned Local k-FGNN is with the symbolic model: namely, whether for every pair (G, H) that are separated by the symbolic refinement at some bounded number of rounds, there exists a choice of weights and a dimension d for which the learned network separates them as well. This is a universal-approximation question for permutation-equivariant multiset maps with inputs drawn from tuple-features, but with the additional complication that the multiset is indexed by the guarded domain ⋃_iN_G[u_i] and that the features are themselves endogenously produced by the refinement. Even for k = 2 this closes only partially with existing results on DeepSets-type architectures. We view a satisfactory answer as requiring (i) a quantitative separation margin for symbolic colors, and (ii) a stability statement showing that continuous approximations preserve the induced partition of tuples on the relevant graph classes.

The Local k-FGNN update is already a sparse operator in the sense that it aggregates only over ⋃_iN_G[u_i]. A natural generalization in modern architectures is to replace the symmetric multiset map by attention, allowing tuple-dependent weights and possibly multiple heads. One may define a by setting, for each u ∈ V_G^k and each coordinate position j, an attention distribution over w ∈ ⋃_iN_G[u_i] based on a score function of (χ^(t)(u), χ^(t)(u^{(j ← w)})), and then updating χ^(t + 1)(u) from the resulting weighted sums. Two questions are immediate. First, does guarded attention strictly increase expressivity over multiset aggregation when the attention scores are learned but remain permutation-equivariant? One might suspect a negative answer at the level of distinguishing power, since attention is still a continuous pooling and thus cannot simulate an injective multiset hash uniformly; yet it may strictly improve sample efficiency for estimating the homomorphism-derived statistics that underlie ℱ_LF(k). Second, if one allows attention to query vertices beyond ⋃_iN_G[u_i] through multi-hop key/value construction (while keeping a single refinement round), does one recover any portion of the global k-FWL power, or does the guardedness enforce a hard boundary akin to ℱ_LF(k)? Formalizing these questions suggests introducing a hierarchy indexed by the allowed guard radius and studying the induced pattern families.

Theorem~3 is exact: equality of stable representations implies equality of hom(F, ⋅) for all F ∈ ℱ_LF(k). In learning, one rarely observes exact equality, but rather small ℓ₂ or cosine distances between graph embeddings. This motivates an counterpart: if two graphs have embeddings within ε, can we bound |hom(F, G) − hom(F, H)| for F ∈ ℱ_LF(k) in terms of ε and |V_G|,|V_H|? Conversely, for F ∉ ℱ_LF(k), can we construct indistinguishable pairs with a in hom(F, ⋅) that persists under small perturbations of the refinement map? A robust version of Theorem~4 would explain empirically observed ``near-failures’’ where a learned model appears to partially correlate with sub(F, ⋅) even though exact determination is impossible. Achieving such robustness likely requires explicit Lipschitz control of the update and a quantitative analysis of the gadget constructions.

The strict inclusions in Theorem~5 are witnessed by explicit motifs, but we do not currently know the minimal order of such witnesses as a function of k. Let
s_k^left = min {|V_F| : F ∈ ℱ_LF(k) \ ℱ_L(k)}, s_k^right = min {|V_F| : F ∈ ℱ_F(k) \ ℱ_LF(k)}.
Determining or bounding s_k^left, s_k^right would sharpen the experimental screening pipeline and would also clarify the intrinsic ``distance’’ between locality-guarded folklore and unguarded folklore at fixed k. More structurally, it is natural to ask whether ℱ_LF(k) admits a characterization by a finite set of forbidden overlap patterns in canonical decompositions (or, in ear language, a finite obstruction set for non-degenerate sibling overlap). Such a result would resemble minor-type characterizations for bounded treewidth, but the overlap predicate degener(⋅) suggests that the obstruction theory here may be more delicate and potentially infinite.

The refinement studied here is local in a precise game-theoretic sense: its indistinguishability relation is captured by a guarded pebble game tailored to the neighborhood-guarded folklore aggregate (as used in the proof of maximality). This invites a logical reformulation. One expects a fragment of first-order logic with counting, in which quantification is restricted by neighborhood guards (or, more generally, by atomic formulas enforcing adjacency to one of the currently bound variables), to match the expressive power of Local k-FGNN at the level of graph invariants. Making this precise would place ℱ_LF(k) within the landscape between C^k (capturing k-WL) and strictly local fragments (capturing 1-WL variants), and could potentially yield alternative proofs of Theorems~3–4 via standard locality theorems. A further problem is to understand whether k-almost-strong NEDs correspond to bounded width for some notion of decomposition (e.g., clique-guarded or neighborhood-guarded tree decompositions) that is already known in finite model theory. Establishing such an equivalence would unify the combinatorial and logical views and might provide off-the-shelf algorithmic tools for recognition, canonization, and counting on ℱ_LF(k).

Finally, although our recognition procedure is fixed-parameter in k, its practical performance depends on producing a suitable width-k decomposition and then enforcing the 𝒮_LF(k) constraints. It remains to design direct linear-time routines (for fixed k) that build a k-almost-strong NED without invoking generic treewidth algorithms, and to quantify the smallest refinement depth needed to stabilize on graphs in terms of the height of the induced nesting forest. Such algorithmic understanding would close the loop between the structural characterization and its deployment as a diagnostic tool for learned architectures.