AMS
Audit Memory System (AMS): Durable orchestration and memory layer for MCP clients.
Ask AI about AMS
Powered by Claude Β· Grounded in docs
I know everything about AMS. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
Colibri
Technically β MCP orchestration runtime. Architecturally β three-axis control plane. Philosophically β system of legitimized agentic activity.
What is Colibri?
Colibri is a documentation-first TypeScript MCP runtime that unites task orchestration, audit trails, and cryptographic proof generation in one stdio server. Its Phase 0 core comprises:
Phase 0 Core β What You Build First
- 14-tool MCP surface (stdio, shipped) β 5 Ξ² Task Β· 4 ΞΆ Audit Β· 2 Ξ· Proof Β· 1 Ξ΅ (
skill_list) Β· 2 System (server_ping,server_health). R74.5 planned 19; Wave H reconciled the shipped list (see ADR-004 R75 amendment).server_info/server_shutdown/task_transition/task_depends_on/audit_session_endremain unimplemented;thought_record_listwas added not in the original 19. - Two-phase startup (init then ready) with 4 runtime modes:
FULL,READONLY,TEST,MINIMAL - Single-writer SQLite (
data/colibri.db) with WAL mode, viabetter-sqlite3 - 5-stage Ξ± middleware chain β tool-lock β schema validate β audit enter β dispatch β audit exit
- 8-state Ξ² FSM task pipeline β
INIT β GATHER β ANALYZE β PLAN β APPLY β VERIFY β DONE(+CANCELLED), enforced at the middleware layer - Hash-chained ΞΆ thought trail β every decision recorded and verifiable via
audit_verify_chain - Merkle Ξ· proof store β cryptographic proof of execution, sealed via
merkle_finalize/merkle_root
Full Vision β Advanced Components (not in Phase 0)
- Ξ΄ Multi-model router (Phase 1.5; Phase 0 stubs shipped R75 Wave I per ADR-005 Β§Decision) β Phase 0: constant scoring (always Claude) + single-member fallback chain, library-only. Phase 1.5: weighted multi-model scoring + N-member fallback + circuit breaker
- ΞΊ Deterministic rule engine (Phase 1β2) β Chevrotain-parsed formal DSL for decision making
- Ξ» Reputation model (Phase 1β2) β agent and action credibility tracking
- ΞΌ BFT consensus (Phase 3+) β Byzantine fault-tolerant agreement for critical decisions
- ΞΎ Governance layer (Phase 3+) β institutional rules and policy enforcement
- ΞΈ Identity fabric (Phase 3+) β cryptographic identity and authorization
Complexity Budget
Colibri has three layers of depth. You only need the first to start.
Layer 1 (Phase 0): Working Agent Runtime
- Delivers: MCP server + task pipeline + audit trail + proof store
- Build time: ~6 weeks (per phase estimate)
- What it does: Execute tasks deterministically, audit every step, prove work happened
- Enough for: Internal automation, task orchestration, agent-as-a-service
Layer 2 (Phase 1β2): Intelligent Agent Runtime
- Adds: Multi-model routing + rule engine + reputation scoring + fallback chains
- What it does: Route work to the right model, enforce rules, track agent credibility
- Enough for: Production AI workloads, policy-driven automation, cost/quality optimization
Layer 3 (Phase 3β8): Enterprise Multi-Party Runtime
- Adds: BFT consensus + governance + institutional rules + identity fabric + cryptographic proof aggregation
- What it does: Byzantine fault tolerance, multi-party decision-making, institutional accountability
- Enough for: Regulated industries, multi-agent systems, institutional compliance
Start with Layer 1. Add layers as your needs grow.
Three-Axis Architecture
Colibri operates on three independent but intertwined axes:
-
Execution of work β Tasks flow through a formal pipeline. Each task state (INIT, GATHER, ANALYZE, PLAN, APPLY, VERIFY, DONE) is executed agentically with full audit instrumentation. Output is deterministic, verifiable, and reproducible.
-
Management of intelligence β A weighted model router (Ξ΄) selects between candidate models scoring across quality, cost, latency, and load, with a fallback chain for graceful degradation. Phase 0 runs Claude-only via library-only stubs (shipped R75 Wave I per ADR-005 Β§Decision): constant scoring + single-member fallback. Phase 1.5 activates real multi-model scoring and a weighted fallback chain.
-
Legitimacy of action β Every action must be explainable, verifiable, and institutionally permissible. This axis comprises decision trails (what was decided and why), Merkle proofs (cryptographic proof of work), deterministic rules (formal constraints), reputation (credibility of agents), consensus (agreement when needed), governance (policy enforcement), and identity (who did it).
Together, these axes form a control plane for agentic work β not just a backend that produces results, but a system that ensures results are produced correctly, intelligently, and legitimately.
The Philosophy
Colibri is an attempt to build an agentic system where not just the result matters, but the right to the result.
Work must be done (execution axis). Intelligence must be correctly chosen (intelligence axis). And the action itself must be legitimized (legitimacy axis) β recorded in an immutable trail, proven through cryptographic hashing, verified against deterministic rules, passed through reputation checks, and when necessary confirmed by consensus and governance mechanics.
In this sense, Colibri is not "another AI assistant" but a project of an environment where agentic activity gets:
- Memory β decisions recorded and retrievable
- Provability β cryptographic evidence of work and reasoning
- Accountability β every action traceable to its agent and decision context
- Constitutional constraints β enforceable rules and governance
Current State
Phase 0 is 100% complete on non-deferred tasks (28/28 shipped as of R75 Wave I β 2026-04-18). P0.5.1/P0.5.2 shipped as Ξ΄ Model Router library stubs per ADR-005 Β§Decision (PR #149 scoring, PR #150 fallback). Full multi-model routing lands in Phase 1.5.
Of the 15 Greek-letter concepts:
- 8 ship code at Phase 0 granularity (
colibri_code: partial): Ξ± System Core Β· Ξ² Task Pipeline Β· Ξ³ Server Lifecycle Β· Ξ΄ Model Router (library-only stubs) Β· Ξ΅ Skill Registry Β· ΞΆ Decision Trail Β· Ξ· Proof Store Β· Ξ½ Integrations - 7 remain spec-only for later phases (
colibri_code: none): ΞΈ Consensus Β· ΞΉ State Fork Β· ΞΊ Rule Engine Β· Ξ» Reputation Β· ΞΌ Integrity Monitor Β· ΞΎ Identity Β· Ο Governance
src/server.ts, data/colibri.db (created at runtime in WAL mode), and a Jest suite with 1084 passing tests are all live at the last mainline commit.
Documentation corpus (R75 post-Wave-I):
- ~700 markdown files across 12 top-level directories (CLAUDE.md Β§9.2 β CANON Β· MIRROR Β· HERITAGE Β· SCRATCH Β· VENDOR)
- docs/ reorganized around the World Schema tree (
0-mutate/through5-time/) - 49 language-agnostic algorithm extractions in
docs/reference/extractions/ - 63 implementation tasks across 8 phases (Phase 0 = 28; all 28 shipped β 2 as library-only Ξ΄ stubs per ADR-005)
- 19 locked protocol specifications (
docs/spec/) - 14 new conceptual-glue documents linking the 15 Greek-letter concepts
The Phase 0 specification is complete; the implementation is effectively complete; Phase 1 planning is the next round's scope.
Minimum Viable Start
Phase 0 delivers a working agent runtime with a 14-tool stdio MCP surface:
- MCP server (
src/server.ts, shipped P0.2.1) accepting tool calls over stdio - Ξ² Task pipeline β 5 tools (
task_create,task_list,task_get,task_updatewith FSM-routing,task_next_actions) enforcing theINIT β GATHER β ANALYZE β PLAN β APPLY β VERIFY β DONEFSM at the middleware layer - SQLite backend (
data/colibri.db, created at runtime in WAL mode; schema shipped P0.2.2) viabetter-sqlite3 - ΞΆ/Ξ· Audit + Proof β 6 tools (
audit_session_start,thought_record,thought_record_list,audit_verify_chain,merkle_finalize,merkle_root) recording and sealing every decision - Ξ΅ Skill listing β
skill_listfor discovery; capability index shipped Wave H (P0.6.3) - System surface β
server_ping,server_health(2 live;server_info/server_shutdownwere planned but not yet implemented) - Deterministic execution β same inputs always yield same outputs; same-inputs-same-Merkle-root is the legitimacy guarantee
This is enough to orchestrate agent work deterministically, audit every decision, prove work was done correctly, and reproduce any execution from the audit trail.
No consensus, no governance, no multi-model routing, no agent spawning β just solid, verifiable task execution. Intelligence and legitimacy extensions layer in across later phases.
Start Here
- CLAUDE.md β Executor rules for Claude and any other AI coding client (four-tier agent hierarchy, worktree rule, writeback protocol)
- Colibri System Vision β Canonical vision (single source of truth)
- World Schema β The organizational spine: how every concept relates
- Task Breakdown β 63 tasks, 8 phases, dependency graph
Documentation
All docs are in docs/, organized by the World Schema tree β the layers a mutation passes through:
| Layer | Content |
|---|---|
| 0-mutate | The foundational idea: every interaction is a state mutation |
| 1-transport | MCP + JSON-RPC + 14 tools β how mutations enter |
| 2-plugin | The server: boot, modes, database, middleware |
| 3-world | Runtime model: physics (laws), social (agents), execution (pipeline) |
| 4-additions | Ξ½ Integrations: Git, Obsidian, Claude API |
| 5-time | Session β round β task β roadmap |
| spec | 19 locked protocol specifications (s01βs19) |
| decisions | Architectural Decision Records (ADR-001β006) |
| agents | Agent contracts: sigma, pm, executor, writeback |
| guides | Quick-start, implementation tasks, skill authoring |
| reference | Glossary, Phase 0 tools, heritage extractions |
Stack
Shipped at Phase 0:
- TypeScript 5.3+ (ESM, NodeNext) β MCP server, middleware, domains
- @modelcontextprotocol/sdk β MCP protocol implementation (stdio transport live; streamable HTTP client shipped via Ξ½ MCP bridge)
- Zod 3.23 β schema validation and type safety (v4 was planned but v3.23 ships)
- better-sqlite3 β single-writer SQLite with WAL mode, via a migration runner at
src/db/ - merkletreejs β Ξ· Proof Store Merkle tree construction (shipped Wave E)
- gray-matter β Ξ΅ skill-registry frontmatter parsing (shipped Wave C)
- Jest (ESM via
--experimental-vm-modules) β 1084 tests passing at the last mainline commit (R75 Wave I closea22dd23e)
Target for later phases (not yet implemented):
- Chevrotain β ΞΊ rule-engine DSL parser generator (Phase 1)
Repository layout
βββ .agents/ β Agent-ops corpus (CANON for skills/)
β βββ skills/ β 23 canonical colibri-* skill definitions
β βββ spawns/ β HERITAGE β Sigma round traces, read-only
β βββ swarms/ β HERITAGE β donor swarm templates, read-only
βββ .claude/
β βββ skills/ β MIRROR (drifting) β do not edit by hand
βββ .github/ β Issue/PR templates + docs-integrity CI
βββ .worktrees/ β SCRATCH β per-task feature worktrees
βββ docs/ β SINGLE ACTIVE CANON (World Schema tree)
β βββ colibri-system.md β Canonical vision
β βββ world-schema.md β Organizational spine (v3)
β βββ 0-mutate/ β Foundational idea + mutation lifecycle
β βββ 1-transport/ β MCP + 14 tools
β βββ 2-plugin/ β Server: boot, modes, database, middleware
β βββ 3-world/ β Runtime model (physics/social/execution)
β βββ 4-additions/ β Ξ½ Integrations
β βββ 5-time/ β Session β round β task β roadmap
β βββ spec/ β 19 locked protocol specifications
β βββ architecture/decisions/β ADRs (ADR-001β006)
β βββ agents/ β Agent contracts
β βββ reference/extractions/β 49 language-agnostic extractions
β βββ guides/implementation/β Phase 0 task prompts + breakdown
βββ data/
β βββ ams.db β HERITAGE β AMS donor task store, kept
β through Phase 0 bootstrap only.
β Target: data/colibri.db (P0.2.2)
βββ src/ β TypeScript runtime (shipped R75 Wave A+)
β βββ server.ts β MCP entry point (P0.2.1)
β βββ db/ β better-sqlite3 + migration runner (P0.2.2)
β βββ domains/ β Ξ² tasks, Ξ΄ router (stubs), Ξ΅ skills, ΞΆ trail, Ξ· proof, Ξ½ integrations
βββ src/__tests__/ β Jest ESM test suite (1001 tests; P0.1.2)
βββ AGENTS.md Β· CLAUDE.md Β· README.md Β· CONTRIBUTING.md Β· SECURITY.md Β· CODE_OF_CONDUCT.md
Zone model: every top-level folder declares a zone (CANON Β· HERITAGE Β· MIRROR Β· SCRATCH Β· VENDOR). See CLAUDE.md Β§9.2 for the full 12-folder manifest.
One canon: docs/ is the single active documentation surface. Everything in .agents/spawns/, .agents/swarms/, projects/, and data/ is HERITAGE β describing pre-R53 donors (AMS, CogniMesh, Phoenix), not Colibri Phase 0.
License
Apache-2.0 WITH Commons-Clause
