Fields

Cognition & memory

Cognitive maps, structured binding, and how the brain composes — applied to artificial cognition.

scopecognitive maps · binding · lifelong identity
methodmulti-seed A/B with published negatives
outputdiscriminating tasks, not leaderboards
agent tool goal state reward memory
structured map · entities bound to roles, not folded into a flat embedding
Why cognition is a science program

Cognitive maps as compositional substrate

A growing body of neuroscience suggests cognition is supported by structured maps that bind entities to roles and compose across domains. Our science program treats this seriously: we build artificial systems that share those organising principles, and we evaluate them against tasks designed to discriminate composition from memorisation.

most models

flatten entities, roles, and worlds into one embedding space and hope a deeper net untangles them again.

our bet

cognitive maps treat entities, roles, and worlds as separate structural objects, and we evaluate against tasks that punish you for ever folding them.

Three angles

Where the science meets the system

C1

Cognitive maps

Structured maps as a substrate for analogy and counterfactual, drawn from neuroscience and tested against our own memory library.

entity role world
rewards cognitive structure that supports analogy and counterfactual punishes flat embeddings that look like the training set
C2

Binding

How role-filler binding behaves under role swap and held-out evaluation — and what falls out when it does not.

subject verb object dog chases cat subject verb object cat chases dog same content · roles permuted · the binding must change
rewards binding that survives role swap and held-out combinations punishes systems that pass IID and fail under permutation
C3

Lifelong identity

How long-term memory consolidates without overwriting earlier worlds, measured against amnesiac controls.

retain worlds seen → consolidating amnesiac
rewards memory that consolidates earlier worlds while learning new ones punishes recency bias dressed up as continual learning
binding under permutation

Slot-factored vs byte-level, measured

Held-out, multi-seed. The same model class evaluated under two binding regimes. Where the system has explicit slots, role swap and held-out combinations are not adversarial.

task slot-factored byte-level notes
compositional 0.96 0.14 held-out role-filler combinations · n=5
relational 0.99 0.63 simple subject-verb-object scenes · n=5
role-swap 1.00 0.60 identical content, roles permuted · n=5
naming 1.00 0.00 name-targeted prefix-LM, held-out · n=5

The win is the binding regime, not the architecture. We measure the same backbone family in both columns so the comparison isolates the structural choice.

lifelong identity

Consolidation, not overwrite

A controller that consolidates retains every world it has seen. An amnesiac controller — same data, same compute, no consolidation — loses earlier worlds as new ones arrive.

0.00 0.25 0.50 0.75 1.00 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10 w11 w12 retention across worlds seen consolidating · forgetting +0.00 amnesiac · forgetting +1.00
n=5 seeds · same data, same compute, the consolidation rule is the only varied axis.
honest negatives

What did not survive a clean A/B

The bet that survives is on objective and binding. Where a tempting architectural claim does not, we say so on the page.

negative

fair-fight on continuity is a negative

Under a clean A/B test where two backbones share the same learning rule, the developmental substrate does not beat a strong transformer on lifelong retention. We say so.

measured · retain 0.66 ± 0.25 vs 0.94 ± 0.03

negative

naming is not architecture-bound

A transformer reaches a perfect naming score on the same harness. The win in naming is the objective and the rearing, not the substrate.

measured · transformer naming = 1.00 held-out

negative

C > A is not clean

Across n=5 seeds the gap between the developmental and the strong baseline is well inside the error bars. We do not claim a substrate win there.

measured · gap inside ± 0.24

A scientific bet, evaluated honestly

We publish negatives. Where an architecture-priors claim does not survive a clean A/B test, we say so. The bet that survives is on objective and binding, not on which substrate you stamp on top.

cross-cuts

Cognition is the substrate everything else sits on

The same binding and consolidation primitives surface across our research programs — structured memory, evals, alignment — and through every long-running production system we ship.