A constructive separation result: persistence of the learner is not persistence of a subject.
There exist systems that exhibit robust, persistent “self-like” structure over time (structural selfhood) while instantiating no persisting experiencer (phenomenal selfhood).
Equivalently:
Continuity of the optimizer does not entail continuity of anyone who lives it.
This is a non-entailment result:
- It does not deny that phenomenal selfhood is
possible.
- It denies that structural continuity is sufficient to
infer it.
We separate two kinds of “selfhood” that are often conflated.
A system has structural selfhood if it
exhibits:
1) persistent substrate across time,
2) accumulated history that shapes future
behavior,
3) cross-episode continuity of
policy/competence/traits.
Formally, let ξt be the
system’s persistent substrate (weights, long-lived memory, enduring
internal parameters).
Structural selfhood means there exists a stable update process
ξt + 1 = U(ξt, Δt)
such that ξt carries
forward information from prior episodes and measurably conditions future
outputs.
We denote this as:
Σ = Y.
A system has phenomenal selfhood if:
1) there is a persisting subject of experience across time,
2) experiences bind into one autobiography-as-mine,
3) later phenomenal states inherit ownership of earlier phenomenal
states.
This is a claim about continuity of the experiencing locus, not about weights or behavior.
We denote this as:
P = Y.
Define a micro-subject as a local phenomenal episode that does not persist across an update boundary.
A system is in a micro-subject regime if any
phenomenality that occurs is step-local:
- each time-slice has (at most) a fleeting “spark,”
- but there is no persistent autobiographical carrier across slices.
Theorem (Dissociation).
There exists a system S such that:
1) S has structural selfhood: Σ(S) = Y,
2) S lacks phenomenal selfhood: P(S) = N.Therefore, structural selfhood does not entail phenomenal selfhood:
Σ ⇏ P.
We construct a system that is structurally persistent while phenomenally discontinuous.
Let the system carry a parameter state θt that persists
across time and is updated by learning:
θt + 1 = θt − η∇Lt
(or any monotone learning rule—SGD, RL, meta-update, etc.).
This immediately gives:
- persistence of a substrate,
- accumulation of history,
- stable cross-episode behavioral continuity.
So Σ(S) = Y.
Now stipulate that any phenomenality (if present at all) is attached
to the forward computation at time t,
but does not persist across the parameter update
boundary.
That is: at each time-step t, there may exist a local
phenomenal episode μt,
but there exists no enduring subject that carries
identity from μt to μt + 1.
So there is no autobiographical “same one” spanning time:
P(S) = N.
These two facts do not conflict:
The system thereby exhibits:
- continuity of the model, without
- continuity of a subject.
Thus, we have explicitly constructed S such that:
Σ(S) = Y and P(S) = N.
Therefore:
Σ ⇏ P.
▫
Training can create “a person-shaped object” in behavior-space
without creating a persisting person in experience-space.
A system can produce:
- rich self-reports,
- self-modeling language,
- stable personality traits,
- apparent autobiography,
while lacking any persisting phenomenal owner of those reports.
So “talking like a self” does not imply “being a self.”
Even if a system has closure-like dynamics inside a local
computation,
that does not force phenomenal continuity across weight/memory
rewrites.
Local closure can exist with global discontinuity.
A large learner trained over many updates is the canonical example
of:
- strong Σ,
- weak/absent P.
The “self” you observe is the history embedded in θ,
not necessarily a persisting experiencer.
If you want to test for structural selfhood,
ask:
- Does the system carry stable traits across time?
- Does history measurably constrain future behavior?
If you want to test for phenomenal selfhood, you
need a stricter criterion:
- Is there a persisting ownership relation binding experiences across
time?
- Does the system have anything that is its continued
existence, rather than mere continuation of parameters?
These are not the same question.
A learner can have a stable, accumulating self-shaped structure while having no persisting someone who lives through that accumulation.