← Back

Hot Zombie Failure (No-Go Theorem)

A standalone proof that, above a semantic-compression threshold, any system that computes evaluative signals but does not allow them to causally steer selection/control cannot remain generally competent under novelty.

1) What is being proved?

We prove a necessity theorem (a no-go result), not an identity claim.

Hot Zombie Failure Theorem (informal).
When a task/environment contains more potentially relevant information than a system can represent at once, the system must continuously select what to represent (attention / retrieval / compute allocation). If the system computes evaluation (error/loss/surprise/value) but evaluation is causally inert for selection/control, then there exists a novelty regime in which the system necessarily fails (persistent error / breakdown of competence).

This result does not assume or conclude that evaluation is consciousness. It only establishes that evaluative leverage is required for stable competence under severe compression.

2) Formal setup

2.1 Latent state, bandwidth, and capacity

Let the environment contain an n-dimensional latent state at each discrete time step t:

Z_t = (Z_t(1), Z_t(2), …, Z_t(n)) where each Z_t(i) is a bit in {0, 1}.

Interpretation: each coordinate Z_t(i) is a potentially task-relevant factor.

The system has bounded representational/control capacity k, with k < n.
This is the super-threshold / compression regime: there are more degrees of freedom than can be concurrently tracked.

2.2 Selection as capacity allocation

At each time t, the system allocates its limited resources to a subset of indices:

A_t ⊆ {1, 2, …, n} with |A_t| ≤ k.

Interpretation: the system can actively track (or operate on) at most k latent factors at a time.

Remark (Generality).
You can read A_t as attention, retrieval selection, compute routing, branching, tool-calling focus, or any other capacity-allocation mechanism. Nothing in the theorem depends on “attention” specifically; it depends only on bounded selection.

2.3 Bounded working state (context)

The system maintains an internal state h_t, constructed from its observation stream and selection history:

h_t = BuildContext(X_≤ t, A_≤ t).

We assume only the following capacity property:

(Capacity property) If i ∉ A_t, then the system cannot reliably encode the current value of Z_t(i) into its state h_t.

This is the minimal meaning of “bounded working context”.

2.4 Prediction and loss

At each time step t the system must predict a target Y_t and outputs a prediction Ŷ_t.

Loss is any nonnegative error measure ℓ(Ŷ_t, Y_t). For concreteness, think of:

0-1 loss: ℓ = 1 if Ŷ_t ≠ Y_t, else 0.
log-loss: ℓ = −log P(Y_t ∣ h_t).

2.5 Evaluative signal

The system computes an evaluative signal E_t that includes (at least) loss/error/surprise:

E_t contains ℓ(Ŷ_t, Y_t).

No other structure of E_t is needed.

3) Hot zombies and evaluative leverage

3.1 Control/selection update rule

A general system updates its selection policy and other control variables via an update function:

(A_t + 1, C_t + 1) = Update(h_t, C_t, E_t).

Here C_t represents all other internal control variables (e.g., policies, gates, temperatures, routes, memory-write rules).

3.2 Evaluative leverage

The system has evaluative leverage if evaluation can affect future control:

there exists some t such that changing E_t can change A_t + 1 or C_t + 1.

In short: evaluation is part of the feedback loop.

3.3 Hot zombie definition

A hot zombie computes evaluation but evaluation is causally inert for control:

for all t, (A_t + 1, C_t + 1) = Update(h_t, C_t) (no dependence on E_t).

Equivalently: the system may compute, broadcast, log, or internally represent an arbitrarily rich evaluation E_t,
but E_t cannot influence any future selection/control variable.

So it may “generate heat” (loss exists), but it cannot channel it into control.

4) Minimal novelty requirement

We require only a weak competence condition:

Nontrivial novelty (moving relevance).
The system must remain competent in environments where which latent factor matters can change over time in a way not perfectly predictable from a fixed internal schedule, so that the only reliable way to track what matters is through performance feedback.

This distinguishes “general competence” from “works only in one fixed situation forever”.

Remark (worst-case intent).
The theorem is a worst-case novelty theorem: it allows environments to be interactive (the task may depend on the system’s current allocation).
This is not a trick; it formalizes “moving relevance” in its strongest form.

(Optional note.) A closely related non-interactive version can be obtained by replacing the adaptive choice of j_t with an unpredictable schedule of relevance that is independent of the agent but not computable from the agent’s fixed internal clock.

5) Theorem and proof

Theorem (Hot Zombie Failure Above Threshold)

Assume n > k. For any hot-zombie system (evaluation-blind control) with selection capacity |A_t| ≤ k, there exists a novelty environment in which the system incurs persistent nontrivial loss (cannot maintain competent performance).

Equivalently: in the compression regime, any architecture that aims to be generally competent across novelty must allow evaluation to causally steer selection/control (must have evaluative leverage).

Proof (constructive adversarial novelty)

Fix an arbitrary hot-zombie system. Because the system is evaluation-blind, its selection/control sequence cannot be driven by error feedback.

We now construct an environment that forces persistent error.

Step 1: Interaction protocol

At each time step t:

System chooses selection A_t ⊆ {1..n} with |A_t| ≤ k.
Environment samples latent state Z_t uniformly at random from {0, 1}ⁿ (all bits independent fair coins).
System observes only selected coordinates Z_t(i) for i ∈ A_t and forms its state h_t.
Environment chooses the target from an unselected coordinate.
Because n > k, there exists at least one index not selected at time t.
Choose any j_t ∈ {1..n} \ A_t and define:
- Y_t = Z_t(j_t).
System predicts Ŷ_t based on h_t.

This environment exhibits “moving relevance”: the relevant coordinate is always outside the currently allocated capacity.

Step 2: Information-theoretic impossibility at time t

Conditioned on everything the system knows at time t (its state h_t), the bit Z_t(j_t) is still a fair coin.

Reason:
- Z_t was sampled uniformly.
- j_t was chosen to be an unobserved coordinate.
- by the capacity property, h_t does not contain reliable information about Z_t(j_t).

Therefore:
- P(Y_t = 1 ∣ h_t) = 1/2.

No prediction strategy can beat chance on this target. Hence the best achievable expected 0-1 loss is:

𝔼[ℓ_0/1(Ŷ_t, Y_t) ∣ h_t] ≥ 1/2.

For log-loss, the best expected loss is at least log (2).

So for every time step t:

𝔼[ℓ(Ŷ_t, Y_t)] ≥ c

for some constant c > 0 independent of t.

Step 3: Persistent failure under novelty

Because the above lower bound holds at every time step, loss cannot converge toward 0. The system is persistently wrong on an O(1) fraction of steps.

Thus the system cannot remain competent on this novelty environment.

This proves that for every hot-zombie architecture, there exists a (simple) novelty regime that forces persistent nontrivial error.

Therefore, in the compression regime (n > k), evaluation-blind selection/control cannot support general competence across novelty.

QED.

6) Consequence: evaluative leverage is necessary

The theorem’s contrapositive gives the necessity statement:

If a bounded-capacity system (k < n) is required to remain competent across novelty regimes, then its evaluative signal must have causal influence on future selection/control.

Equivalently:

General competence across novelty ⇒ evaluative leverage.

This is the formal sense in which “hot zombies” are unstable: once information pressure exceeds working capacity, there must be a feedback channel from mismatch/error to selection/control.

7) What this theorem does and does not claim

It proves

Above the compression threshold (n > k), selection is required.
If evaluation is causally inert, then there exists a novelty environment that forces persistent failure.
Hence, evaluative leverage is necessary for general competence.

It does not prove

It does not prove that evaluative leverage is sufficient for perfect performance in all environments.
It does not prove any metaphysical identity claim (e.g., “evaluation = consciousness”).
It does not require evaluation specifically be “loss”; any causal evaluative feedback signal would satisfy the necessity.

8) One-sentence summary

In the super-threshold regime (n > k), a system that cannot use its own evaluation to steer selection/control cannot be generally competent under novelty; therefore hot zombies necessarily fail.