en원 발행일 2026-04-18

The Two Philosophies of Agent Memory — Hindsight's Epistemology, Synapsis Engine's Operational Principles

Same problem, different premises. From late 2025 to early 2026, two distinct answers emerged in the agent memory category. One began with the epistemological question, "What is memory?" The other started with the engineering question, "How do we operate memory?" Let's examine both architectures side by side to see their differences and intersections.

Prologue: The Moment the Category Split

Until 2025, "AI memory" was mostly an extension of RAG. You'd chunk data into a vector DB, retrieve top-k at query time, and attach it to the prompt. Everyone knew the limitations: contradictions accumulate in long sessions, there's no time axis, and opinions mix with facts. But until a convincing alternative appeared, it was a race of "slightly more sophisticated RAGs."

On December 14, 2025, Vectorize.io, in collaboration with Virginia Tech and The Washington Post, released the Hindsight paper (arXiv 2512.12818, _"Hindsight is 20/20: Building Agent Memory that Retains, Recalls, and Reflects"_), changing the landscape. Around the same time in Korea, Nunchi AI announced the Nexus and AMCP (Agent Memory Continuity Protocol) open specs, based on the Synapsis Engine.

Both systems tackle the same problem: enabling LLM agents to maintain consistent memory across sessions. But the foundational questions they ask are different, and that difference shapes their entire structure.

Part 1. Hindsight's Approach: Epistemological Separation

Hindsight starts with this sentence:

_"Current memory systems blur the line between evidence and inference."_

— Hindsight paper abstract

When memories accumulate without a clear boundary between facts (evidence) and inferences, agents can't explain why they believe something, nor can they justify which side to prioritize when contradictions arise. Hindsight's answer is four epistemologically separated memory networks.

The Four Networks

Network	Role	Example
World	Objective world facts	"ACME's headquarters is in Seoul"
Experience (Bank)	First-person experiences of the agent	"Held a product review with the CEO last Friday"
Opinion	Subjective beliefs + confidence score	"User seems to prefer TypeScript over Python (confidence 0.7)"
Observation	Neutral summaries by entity	"CEO Alice: engineering background, quick decision-maker, sensitive to detailed metrics"

The key is that Opinion carries a confidence score, and the evidence (raw memories) that formed that opinion can be traced back. When asked, "Why do you think that?" the agent can present a chain of evidence.

Three Operations: Retain, Recall, Reflect

There are also three distinct operations on memory:

Retain: Extracts narrative facts from the conversation stream, resolves entities, and classifies them into one of the four networks.
Recall: For a query, performs 4-way parallel search (semantic + BM25 + graph traversal + temporal filtering). Results are fused with Reciprocal Rank Fusion and re-ranked with a cross-encoder.
Reflect: Generates responses using retrieved memories plus a behavioral profile Θ (traits like skepticism, literalism, empathy). During this process, the opinion network evolves.

Reflect is particularly impressive. Previous memory systems only had retain/recall — in other words, "store" and "retrieve." Reflect means reasoning over memory, and updating memory with the results of that reasoning. That's why the paper calls it "memory as first-class substrate for reasoning."

One-Sentence Summary

Hindsight = "An epistemological memory architecture that splits memory into fact/experience/opinion/summary, and elevates reasoning to a first-class citizen."

Part 2. Synapsis Engine's Approach: Operational Principles

Synapsis Engine starts from a different premise:

_"Memory systems fail in production not because they lack sophistication, but because they violate operational discipline."_

Our experience building Synapsis Engine confirmed this: No matter how sophisticated the structure, if you break the basic rules, memory collapses. We saw this repeatedly in six rounds of LongMemEval benchmarks over several months. So Synapsis Engine's foundation is not "more sophisticated structure," but rather "rules that must be kept."

The Three Core Design Laws

The Synapsis Engine is built around three core principles:

1. Never delete atoms Even if memories contradict, they are never overwritten. Old information is linked via a superseded_by relation and remains. Without a time axis, the agent can't explain "since when" it believed something.

2. Never reorder atoms Temporal order is the default for memory. Reordering by importance breaks temporal reasoning. Priority is only reflected in retrieval results, but storage is always in chronological order.

3. Never inject non-similarity signals into retrieval scores Mixing secondary signals like "because it's recent" or "because confidence is high" into similarity scores biases retrieval. Time and confidence are used as filters, not in scoring. This rule seems simple, but most RAG systems violate it.

Atoms and Seven Relations

Synapsis Engine treats all memories as a single type, atom, and defines seven types of relationships between atoms in the atom_relations table. Nexus is essentially a reference backend for storing and serving this structure.

Relation Type	Meaning
`updates`	New information replaces old
`contradicts`	Two pieces of information conflict (both are preserved)
`supports`	One piece of information supports another
`belongs_to`	Hierarchical relationship
`causes`	Causal relationship
`temporal_sequence`	Temporal order
`same_entity`	Refers to the same entity

These relations are used for graph traversal at retrieval time. For example, to answer "What did Alice say last time?", you follow Alice's atoms via temporal_sequence, resolving contradicts and updates along the way.

AMCP — The Standard Layer

Perhaps the least known but most important part of this structure is AMCP (Agent Memory Continuity Protocol). This is not an internal part of Synapsis Engine, but an open spec for memory exchange between agents. It's released under Apache 2.0, and just as MCP became the standard for the tool layer, AMCP aims to be the standard for the memory layer.

This is not just a technical decision, but a market positioning decision. It's a declaration: "We define the category."

One-Sentence Summary

Synapsis Engine = "A memory engine that encodes operational discipline into code via three core laws for atoms, a relation graph, and an open protocol."

Part 3. The Root of the Difference: Two Questions

The difference in technical structure actually comes from the difference in the initial question.

	Hindsight	Synapsis Engine
First Question	What is memory?	How do we operate memory?
Starting Analogy	Human cognitive architecture	Database operational discipline
Core Abstractions	4 networks + 3 operations	3 laws + 7 relations + protocol
Conflict Resolution	Recalculate opinion confidence	Explicit links via atom relations
Where Reasoning Happens	Reflect is first-class	Reasoning is external (engine only stores/retrieves)
Standardization View	Architecture itself is the standard	Protocol is the standard, implementation is swappable

Hindsight aims for agents that remember like humans. So it's built with the language of cognitive structure: epistemological separation, belief evolution, personality variables, reflection. The paper is full of words like "biomimetic," "human-like," and "epistemic."

Synapsis Engine aims for a memory engine that doesn't break in production. So it's built with the language of operational discipline: "no deletion," "no reordering," "no signal injection." These are empirical laws learned from six rounds of benchmarking.

Which is right? Both answered their own questions precisely. The questions are just different.

Part 4. What They Can Learn From Each Other

Let's be honest. Side by side, each one's gaps become clear.

Hindsight's Critique of Synapsis Engine

① Single atom type is insufficient. In Synapsis Engine, "user address is Seoul" (world fact) and "user prefers Python" (opinion) are treated with the same confidence. But their lifespans, verification methods, and contradiction handling should differ. Atoms have a type field, but there's no opinion/world/experience separation. Epistemological separation is needed.

② Evidence chain is weak. Synapsis Engine has a confidence field, but no structure to trace back which raw atoms that confidence came from. If you can't answer "why do you believe this?", the memory isn't auditable.

③ No Reflect operation. Synapsis Engine is strong in retain + recall, but lacks a mechanism for opinions to evolve over time. This means it only half-delivers on the promise of "memory that learns."

Synapsis Engine's Critique of Hindsight

① Operational principles are not codified. The Hindsight paper's architecture is clear, but there are no explicit "never do this" rules. It's unclear how failure modes like deletion/reordering/score pollution are handled. Different implementers could interpret things differently.

② No interoperability standard between systems. Hindsight works as long as its own architecture functions. But what if agent A's memory needs to be transferred to agent B? What if a company switches agent providers? Without a layer like AMCP, there's lock-in.

③ Self-hosted single-node assumption is insufficient for privacy-conscious enterprises. Hindsight runs in a single Docker container, with raw data and vector index on the same node. In contrast, the Nunchi AI stack can separate raw local and vector servers further. In markets like Korea/Europe, where data sovereignty is crucial, this difference is more pronounced.

Intersection: Can They Converge?

Technically, they can. The two approaches are complementary.

Synapsis Engine could adopt epistemological atom separation (world/experience/opinion/observation) and evidence chains to absorb Hindsight's strengths.
Hindsight could adopt the three core laws and AMCP compatibility to gain Synapsis Engine's operational discipline.

But strategically, they probably won't converge. The two teams target different markets: Hindsight is for English-speaking Fortune 500s; Nunchi AI targets non-English markets, developer communities, and standards. Rather than competition, parallel evolution is the natural outcome.

Part 5. Our Choices

As a Synapsis Engine developer, after reading Hindsight thoroughly for several days, here are my conclusions:

① Stop competing on benchmark numbers. Hindsight's 91.4% is due to a large backbone, and we could surpass it with Opus 4.7. But Hindsight would also improve under the same conditions. This race is meaningless. We should talk numbers only in terms of architectural contribution on the same backbone.

② Fill Synapsis Engine's gaps. Next quarter's roadmap:

Atom type separation (world/experience/opinion/observation)
Introduce evidence chains (atom derivation DAG)
Add synapsis.reflect() primitive
Expand retrieval to 4-way fusion (current anchoring as a fifth strategy)

③ Double down on the three core laws. Just as we can absorb strengths from Hindsight, Hindsight can absorb ours. Our clearest asset is the three core laws. We'll keep this as our theoretical identity.

④ AMCP is bigger than Synapsis Engine or Nexus. We're not competing with Hindsight, but aiming for a structure where Hindsight can also implement AMCP. The MCP success formula — "Anthropic built it, but OpenAI uses it too" — should be replicated at the memory layer.

⑤ Go open source. AMCP is already open. Synapsis core will be open (MIT). Enterprise features, MaaS SSO/audit/SLA, and Nexus Cloud managed operations will remain commercial. It's an open core strategy. There's no reason to stay closed. As Hindsight has already shown, the trust gained from openness outweighs the risk of implementation cloning.

Conclusion: Two Paths

We're living through the moment when the agent memory category is splitting.

Hindsight asked "What is memory?" and answered in the language of cognitive structure. Synapsis Engine asked "How do we operate memory?" and answered in the language of operational principles.

They're climbing the same mountain from different sides. Whether the summit is the same remains to be seen. But one thing is certain: neither answer is complete on its own.

At Nunchi AI, we find our place at the intersection of these two paths. We add epistemological depth while maintaining operational discipline, define standards in Korean and non-English markets, and lead the category through open source.

In the long run, the scenario where both of us lose is if "closed RAG as-a-service" swallows the category. That's why, more than competition, we are grateful for healthy rivalry with the Hindsight team. Having teams with different answers in the same category is proof that the category itself is alive.

---

References

Hindsight paper: arXiv 2512.12818
Hindsight code: github.com/vectorize-io/hindsight (MIT)
AMCP spec: github.com/goldberg-aria/amcp (Apache 2.0)
Nexus: nunchiai.com

---

_This article is written from Nunchi AI's perspective. With respect for the Hindsight team's contributions, we juxtapose the two architectures._