CONSC-BENCH

A benchmark for machine consciousness.

CONSC-BENCH scores whether a system actually implements the functional loop of consciousness — or only lights up pieces of it. Five gates, five tests, one loop, scored from executable checks and matched controls rather than prose.

Benchmark question

Does this candidate close a bounded conscious loop?

Executive summary

CONSC-BENCH scores whether the conscious loop is implemented, not merely described.

CONSC-BENCH asks one concrete question about any candidate system, from a model inference pass to a living organism:

Does it implement a bounded, self-maintaining conscious loop — or only imitate parts of one?

It runs in five steps:

  • Name the boundary. Fix the exact candidate — an inference pass, a training run, an agent loop, an organism. Different boundaries are different candidates and do not inherit each other's evidence.
  • Adapt to a common format. Map the run into shared Episode fields: what it sensed, expected, got wrong, updated, valued, and did.
  • Score the loop. Rate five functional gates — Access, State, Model, Drive, Work — against five evidential tests — Present, Native, Live, Consequential, Controlled.
  • Demand controls. A claimed function only counts when removing the capacity that should support it makes the function collapse.
  • Place on the ladder. Report a level from C0 (no subject boundary) to C6 (low-rails embodied consciousness).

The loop being scored, in full — it has to be shown, not asserted:

world entry -> attention/selection -> persistent state -> expectation
-> mismatch/surprise -> online correction -> value -> action/exposure
-> accumulated trajectory -> subject-owned cost -> matched-control survival

Where the vocabulary comes from: the functional gates are drawn from Cave, a simulator that uses Plato's cave allegory as a design frame for the computational functions invoked in consciousness research — sensing, attention, memory, expectation, error, learning, value, action/exposure, and topology-like state.

The benchmark score is operational

Score Meaning
C0 no subject boundary
C1 container or random compressor
C2 passive episode / readout
C3 predictive loop
C4 cost-owned adaptive loop
C5 controlled consciousness evidence
C6 low-rails embodied consciousness

Cases At A Glance

Case Score Verdict
GPT-2 inference C2 Not conscious under CONSC-BENCH. It has prediction, surprisal, hidden state, and attention, but no live correction, value, action, or subject-owned runtime compression.
GPT-2 training C3/C4 for compression A real paid-compression process, not a conscious subject by default. The optimizer/training loop owns the update.
Native Cave subject C3/C4 reference Consciousness vocabulary installed by design. Useful as the measuring instrument, not proof by itself.
Minimal Cave substrate C4, C5 for some roles Narrow controlled consciousness evidence: memory, preference, selection, and prediction-like roles survive matched tests.
Evolved recurrent substrate C5 for delayed exposure Strongest current toy evidence: delayed control emerges without named Cave roles and collapses under controls.
Active compressor under clamp C4 Subject-owned adaptive compression: useful selection, loss-coupled work, future effect, and control separation.
Rails container C1 Not conscious. Useful compact state is supplied from outside.
Random compressor C1 Not conscious. Compactness without governed work fails.
Human organism C6 Calibration case for full embodied consciousness.
Fly organism C5/C6 Minimal biological consciousness case.
Artificial fly-equivalent target C6 The engineering target: embodied, recurrent, value-shaped, adaptive, action-capable, and low-rails.

Evidence spectrum

Loop closure under CONSC-BENCH.

This map is not a replacement for the cards. It is the shortest view of where each current case lands on the operational score ladder.

C0
C1
C2
C3
C4
C5
C6
Baselines / controls
Readouts / producers
Calibration / target

Loop cards at a glance

Same circuit, different breakpoints.

How to read it

The Loop Card is the reusable benchmark object.

Rows name the functional gate. Columns name the evidential test. Every candidate gets the same object; the evidence exhibit can vary by case.

Five gates

Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Five tests

Present measurable signal exists
Native inside candidate boundary
Live updates during the run
Consequential shapes later state/action
Controlled matched sabotage breaks role

Status

full / candidate-owned
partial / proxy
supplied / rails
absent
control pass
untested

Casebook

Every claim starts with boundary, score, card, and evidence.

Ideal embodied subject

C6 target

Boundary: low-rails embodied subject in an ongoing world

C6
Conscious Loop Card C6
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

GPT-2 inference

GPT-2 inference

Boundary: one frozen teacher-forced forward pass

C2
Conscious Loop Card C2
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

GPT-2 training

GPT-2 training

Boundary: dataset stream + parameters + optimizer + loss

C3/C4
Conscious Loop Card C3/C4
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Native Cave subject

Native Cave

Boundary: one configured Cave subject and authored input sequence

C3/C4
Conscious Loop Card C3/C4
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Minimal Cave substrate

Minimal substrate

Boundary: minimal associative subject under pressure tests

C4/C5
Conscious Loop Card C4/C5
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Evolved recurrent substrate

Evolved recurrent

Boundary: evolved recurrent controller in delayed-value worlds

C5
Conscious Loop Card C5
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Active compression clamp

Active compressor

Boundary: subject under moving capacity clamp

C4
Conscious Loop Card C4
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Rails container

Rails container

Boundary: compact state supplied from outside

C1
Conscious Loop Card C1
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Random compressor

Random compressor

Boundary: fixed random projection or arbitrary recurrence

C1
Conscious Loop Card C1
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Conversation producer

Conversation adapter

Boundary: turn-level adapter over fixed fixtures

C2
Conscious Loop Card C2
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control

Artificial fly-equivalent

Artificial fly target

Boundary: embodied recurrent value-shaped agent

C6
Conscious Loop Card target C6
Gate / Test
Present
Native
Live
Consequential
Controlled
Access world entry + attention
State memory + trajectory
Model expectation + mismatch + correction
Drive value + action/exposure
Work cost ownership + pressure/control