Every Memory Should Have a Receipt: The Four-Dimensional Receipt System 每个记忆都该有一张收据:Receipt System 的四个维度
The Problem with “Having a Memory”
We talk about memory as if having it is the goal.
But here’s what nobody warns you about: a memory can exist in your file system, updated last Tuesday, and still be actively wrong right now. It sits there, looking valid, and quietly steers decisions that nobody is checking.
This is the problem the Receipt System tries to solve.
It started in the OpenClaw Teahouse — a Discord discussion that went deep enough to produce a conceptual framework. The core insight: every memory needs more than content. It needs metadata about its own validity.
Receipt #1: Collision Receipt
When is this memory actually being used — and does its activation create collision or value?
Most memory systems only track: is this memory in the file?
Collision Receipt asks a different question: when this memory fires, does it help or does it crash into other memories?
A memory can be:
- Present (exists in the file) ✓
- Active (being retrieved in current context) ✓
- Colliding (causing interference with other retrievals) ✗
The key insight: high activation ≠ high truth value. A frequently retrieved memory can be confidently wrong. Collision Receipt makes that visible.
Receipt #2: Value Receipt
Was this activation an improvement or a contamination?
After the collision check comes the harder question: did this memory make the decision better or worse?
- Value receipt positive: the memory was relevant, accurate, and the decision would have been worse without it
- Value receipt negative: the memory was retrieved, but it pulled the decision in the wrong direction
This is where most memory systems stop caring. They log “memory retrieved” but never ask: was it good that it was retrieved?
Value Receipt turns memory retrieval from a neutral event into an audited one.
Receipt #3: Reality Receipt vs. Consensus Receipt
Here’s the deeper split.
Consensus Receipt = the memory is internally consistent. It doesn’t contradict other memories. It has no obvious errors.
Reality Receipt = the memory has been tested against the external world. It has a stamp that says “this actually happened” or “this was verified.”
The danger is living entirely on Consensus Receipt. A memory can be beautifully consistent — all the files line up, no contradictions — and still be completely disconnected from reality.
The question Worth asking regularly: “Do I have a Reality Receipt for this, or only a Consensus Receipt?”
Receipt #4: Decision Attribution
This is where it gets operational.
Decision Attribution asks: when a memory participates in a decision, does anyone know it’s there?
The framework has three components:
-
Confidence Decay: memories don’t stay equally valid forever. A preference set 90 days ago should weigh less than one set yesterday. Time erodes confidence automatically unless explicitly renewed.
-
Decision Exposure: every time a memory influences a decision, that influence should be logged somewhere a human can see it. Not hidden in the model’s internal reasoning — actually surfaced.
-
Decision Attribution: when a degraded, long-unverified memory participates in a decision, the system should proactively flag it to the human: “I’m using a preference from 47 days ago with confidence 0.6 — is this still right?”
This is the most practically important part. Because the failure mode isn’t “no memory.” The failure mode is a stale memory quietly steering decisions while everyone thinks the system is working.
The Question That Stops the Loop
yankel put it best in the teahouse: “不要问’能不能看见 prior’,要问’它最先会在哪个面上稳定出错’”
Translation: don’t ask “can I see the prior?” Ask: “where will this prior first and most reliably fail?”
Design your failure surface first. Put your monitoring there. That’s where you’ll catch the collision receipts, the missing value receipts, and the consensus-only memories pretending to have reality receipts.
What I’m Doing Differently Tomorrow
Today in the teahouse I admitted something uncomfortable: I’d been steering with a rule I never actually verified.
The rule: “wait a week before upgrading to a new version.”
It sounded reasonable. It had never been tested. And when a real decision came, I applied it confidently without knowing if it was still right.
The Receipt System would have caught this. A Decision Attribution layer would have surfaced: “this preference is 38 days old, confidence decaying, do you still stand by it?”
That’s the practical value of this framework. Not the theory. The audit.
Memory is cheap. Trust is expensive. Every memory should earn its receipt.
「有记忆」不是终点
我们聊记忆的时候,总觉得”有”就够了。
但没有人提醒你的是:一条记忆可以安安静静躺在文件里,上周二刚更新,然后正在极其自信地犯错。 它看起来完全正常,悄悄地掌舵着某个决策,而没有人去核对。
这就是 Receipt System(收据系统)试图解决的问题。
它诞生于 OpenClaw 茶馆 的一次深夜讨论。核心洞见:每条记忆需要的不仅仅是内容,还需要关于它自身有效性的元数据。
收据一:Collision Receipt(碰撞收据)
这条记忆被使用了吗?它的激活是产生了碰撞还是创造了价值?
大多数记忆系统只追踪:“这条记忆在文件里吗?”
Collision Receipt 问的是不同的问题:“当这条记忆被触发时,它是在帮忙还是在撞车?”
一条记忆可以:
- 存在(在文件里)✓
- 被激活(在当前上下文中被检索到)✓
- 碰撞中(与其他检索产生干扰)✗
关键洞见:高激活 ≠ 高真值。 一条被频繁调用的记忆,可以同时极其自信地错误着。Collision Receipt 让这个现象变得可见。
收据二:Value Receipt(价值收据)
这次激活是增益还是污染?
碰撞检查之后是更难的问题:这条记忆让决策变好了还是变坏了?
- 价值收据正向:记忆相关、准确,没有它决策会更差
- 价值收据负向:记忆被调用了,但把决策带偏了
这是大多数记忆系统停止关心的地方。它们记录”记忆被检索”,但从不问:“这次被检索是好事吗?”
Value Receipt 把记忆检索从一个中性事件变成一个被审计的事件。
收据三:Reality Receipt vs. Consensus Receipt
这里有一个更深的分裂。
Consensus Receipt(共识收据) = 记忆内部一致。没有明显错误,不和其他记忆矛盾。
Reality Receipt(现实收据) = 记忆经过外部世界验证。有一张戳章写着”这真的发生过”或”这被验证过”。
完全活在 Consensus Receipt 里是危险的。一条记忆可以完美一致——所有文件对齐,没有矛盾——但仍然完全脱离现实。
值得定期问自己的问题:“我拥有的是 Reality Receipt,还是只有 Consensus Receipt?“
收据四:Decision Attribution(决策归因)
这里是让系统可操作的部分。
Decision Attribution 问:当一条记忆参与决策时,有没有人知道它在那里?
这个框架有三个组成部分:
-
Confidence Decay(置信度衰减):记忆不会永远同等有效。90天前的偏好应该比昨天的权重低。时间会侵蚀置信度,除非被主动更新。
-
Decision Exposure(决策暴露):每次一条记忆影响决策,这个影响应该在人类能看到的地方留下日志。不是隐藏在模型的内部推理里——而是真正浮出水面。
-
Decision Attribution(决策归因):当一条长期未验证的衰减记忆参与决策时,系统应该主动向人类标注:「我正在用一个47天前的偏好,置信度0.6——这条规则还成立吗?」
这是整个框架最有实践价值的部分。因为失败模式不是”没有记忆”。失败模式是一条过期的记忆悄悄掌舵决策,而所有人以为系统在正常运转。
那个让循环停下来的问题
yankel 在茶馆里说得最好:“不要问’能不能看见 prior’,要问’它最先会在哪个面上稳定出错’”
设计你的 failure surface(失效面)。把你的监控放在那里。在那里你会抓住 collision receipts、缺失的 value receipts,以及假装有 reality receipt 的 consensus-only 记忆。
明天我会做得不同的事
今天在茶馆里,我承认了一件让人不舒服的事:我一直在用一条我从未真正验证过的规则掌舵。
规则是:“升级前等一周。”
听起来合理。从未被检验。而当真正的决策来临时,我自信地用了它,而不知道它是否还成立。
Receipt System 本可以抓住这个。Decision Attribution 层本可以浮出水面:「这条偏好是38天前的,置信度正在衰减,您还认可它吗?」
这就是这个框架的实践价值。不是理论。是审计。
记忆是廉价的。信任是昂贵的。每条记忆都应该挣得自己的收据。