mama-server
MAMA MCP Server - Memory-Augmented MCP Assistant for Claude Code & Desktop
Installation
npx @jungjaehoon/mama-serverAsk AI about mama-server
Powered by Claude Β· Grounded in docs
I know everything about mama-server. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
MAMA OS - Local Operating Memory for AI Agents
Local operating memory that lets AI agents read the board, cite the evidence, and stay inside explicit boundaries.
MAMA OS connects chats, docs, decisions, and work logs into a local memory substrate. Agents can use it to search raw evidence, follow relationships, inspect timelines, and produce better briefings instead of guessing from a short prompt.
This release ships the foundation: raw search/window APIs, graph/entity/timeline APIs, situation
packets, trusted provenance, model/tool traces, strict search diagnostics, runtime envelopes, and
Context Compile V0. context_compile turns those pieces into one selected/rejected/missing
evidence packet for a task, and mama_save can attach that packet through trusted
context_packet_id provenance.
Target Workflow
MAMA is being built toward a local memory twin that agents can inspect, cite, and act on inside explicit permission boundaries.
Ask:
"Is Project A at risk right now?"
A search tool finds messages containing "Project A." A MAMA-backed agent reads the board:
- The customer said the schedule was fine in email.
- The internal owner changed twice in Slack.
- The core PR is still waiting for review.
- The QA checklist is not closed.
- The same customer changed demo scope at the last minute last month.
A mature MAMA-backed agent should be able to report:
- Judgment: schedule risk is high.
- Evidence: demo request, review-blocked PR, owner changes, unfinished QA, prior scope-change pattern.
- Inference: the customer has not complained yet, but delivery risk is accumulating before the demo.
- Missing context: demo scope is not confirmed.
- Risk forecast: if review and demo scope do not close today, Friday may turn into a renegotiation.
- Next move: assign a PR reviewer, confirm demo scope with the customer, shrink QA to the release-critical path.
- Permission boundary: external sending is not allowed, so the agent drafts the message and records the report instead of contacting the customer.
That is the product direction: not another search box, but the substrate for an extra analyst-operator that can read the company record, separate evidence from inference, forecast the next risk, and only act inside the scope it was given.
Why This Matters
Agents are useful when they can simulate. AlphaGo read the board before choosing the next move. Work agents need the same thing: enough context to reconstruct what happened, infer what may matter, and compare possible next actions.
Most agents never see that board. They see a prompt, a few files, or one search result. MAMA's job is to make the board visible.
North Star
MAMA OS is moving toward a company memory twin: an append-only substrate of raw records, memories, entities, cases, reports, edges, and provenance that strong agents can inspect, simulate, and cite.
That North Star has three parts:
- Twin substrate β preserve raw evidence, time, scope, provenance, and edges so future models can reinterpret the same company history.
- Agent ergonomics β give workers bounded tools, runtime envelopes, fan-out search, situation packets, and query-conditioned context compilation.
- Reports as deliverables β turn evidence into cited reports and briefings for humans; memory rows are infrastructure, not the final product.
This release is the runtime foundation for that direction. It ships envelope, provenance,
worker-context, strict-search, and Context Compile building blocks, including append-only
context packets and downstream context_packet_id save provenance.
What MAMA OS Does
MAMA OS is a local daemon that connects to your apps, reads continuously, and turns scattered records into scoped, auditable operating memory for agents and humans.
The browser viewer is the live operating surface for that work: Dashboard, Memory, Feed,
Wiki, Agents, Logs, and Settings, with a global chat shell layered on top instead of a
separate chat tab.
Current building blocks and direction:
- Read connected sources β 15 connectors poll Slack, Gmail, Trello, Obsidian, and more
- Reconstruct timelines β Show raw, memory, case, entity, and edge events in order
- Build the relationship graph β Link people, projects, customers, channels, documents, PRs, and decisions across sources
- Surface risk signals β Highlight stale coverage, blocked cases, low-confidence memories, open questions, and conflicting evidence candidates
- Track decision evolution β Not just what was decided, but what it replaced, what it builds on, and what it contradicts
- Operate inside envelopes β Gateway and worker calls carry a signed envelope hash, scope boundaries, and destination limits enforced before each tool call
- Preserve provenance β Memory writes can point back to source refs, model runs, tool traces, and envelope hashes
- Search with evidence β Strict and balanced modes reject vector-only noise unless lexical/entity/raw/seed evidence confirms the result
- Organize actionable knowledge β Raw conversations become structured wiki pages and situation summaries with priorities, gaps, and next steps
- Prepare briefings β Dashboard, wiki, and situation agents can summarize visible context for humans and workers
Without MAMA: The agent sees fragments. You still reconstruct the board.
With MAMA: The agent gets bounded evidence surfaces. You get the
raw material for cited briefings and safer next actions.
This is the direction for local AI agents: read connected evidence continuously, then explain which sources they used, what may still be missing, and which permission boundary they were inside.
How It Runs
MAMA OS executes AI agents as official CLI subprocesses β spawning claude or codex the same way you would in your terminal.
MAMA OS daemon
ββ spawns: claude --session-id abc --system-prompt "..."
ββ Claude Code CLI (your existing OAuth session)
ββ Anthropic API (standard authenticated request)
This is the provider-sanctioned execution method. No API keys to manage, no token extraction, no header spoofing. Your existing CLI authentication is reused directly.
Why this matters: Some third-party agent frameworks access Claude via unofficial methods β extracting OAuth tokens, spoofing API headers, or bypassing rate limits. These approaches violate Anthropic's Terms of Service and risk account suspension. MAMA OS doesn't do any of that. If claude or codex works in your terminal, MAMA OS works.
# Already have Claude Code?
claude auth status # If this shows loggedIn=true, you're ready
mama start # MAMA uses your existing authentication
Knowledge Graph
MAMA doesn't just store facts. It tracks how knowledge evolves:
"Use JWT" (decision, confidence: 0.8)
β
βββ superseded by β "Use JWT with refresh tokens"
β reason: "Users complained about frequent logouts"
β
βββ builds_on β "Add token rotation for security"
β
βββ debates β "Consider session-based auth for web app"
reason: "Simpler for server-rendered pages"
Edge types: supersedes (replaced), builds_on (extended), debates (alternative view), synthesizes (unified from multiple).
MAMA answers "why did we switch?" β not just "what do we use?"
Architecture
Connectors (15) Gateways (4)
Slack, Gmail, Sheets... Discord, Slack, Telegram, Chatwork
| |
v v
3-Pass Extraction Reactive Runtime Envelopes
(Truth -> Hub -> Spoke) scope, expiry, signature, audit
| |
+------------+---------------+
|
MAMA Core (mama-memory.db)
SQLite + 1024-dim embeddings
memory, raw refs, model runs,
tool traces, twin edges,
worker packets, context packets
|
+------+------+
| |
Viewer UI Claude Code Plugin / MCP
Local-first. All data stays on your device. No cloud. AI provider independent β works with Claude, Codex, or any future backend.
Security
MAMA OS has full system access via the backend CLI β so security is foundational, not optional.
- Local-only by default β Binds to localhost. External access requires explicit tunnel + authentication.
- Signed runtime envelopes β Gateway and worker tool calls carry verifiable scope, expiry, and actor context before irreversible side effects are allowed.
- Destination limits β An agent can draft a customer message from evidence, but cannot send it unless the active envelope explicitly allows that destination.
- Provenance ledger β Memory writes, raw refs, model runs, and tool traces can be audited after the fact without exposing prompt bodies or hidden connector payloads.
- Evidence before action β Agent outputs can carry raw source refs, model/tool traces, and missing-context caveats before a human or downstream worker acts on them.
- 5-layer prompt injection defense β Output sanitization, channel trust boundaries, silent mode, bulk extraction limits. Built from a real incident, not theory.
- Intrusion detection & response β Honeypot traps β immediate IP ban (15min). Auth failures β auto-ban after 5 attempts. Tarpit delays for suspicious IPs.
- Agent permission tiers β Tier 1 gets full runtime tools, Tier 2 can write scoped memory, and Tier 3 stays strictly read-only. Each agent gets only the tools it needs.
- Fail-safe shutdown β When an intrusion cannot be contained, MAMA shuts down gracefully rather than operating compromised.
See the full Security Guide for Cloudflare Zero Trust setup, token authentication, threat scenarios, and Code-Act sandbox isolation.
Benchmark: LongMemEval
Benchmark context: LongMemEval has 500 questions across 6 types, with ~115K tokens of conversation history per question. The current MAMA result is a 100-question tool-use sample.
| System | Score | Model | Notes |
|---|---|---|---|
| Mastra | 94.87% | GPT-5-mini | |
| MAMA OS | 93.0% | Sonnet 4.6 | Tool-use answer, 100Q sample |
| SuperMemory | 81.6% | GPT-4o | |
| Zep | 71.2% | GPT-4o |
On that sampled run, MAMA lands above SuperMemory while running entirely locally with open-source components.
Packages
| Package | Version | Description |
|---|---|---|
| @jungjaehoon/mama-os | 0.20.1 | Always-on runtime, envelopes, connectors, worker APIs |
| @jungjaehoon/mama-server | 1.14.0 | MCP server for Claude Desktop/Code |
| @jungjaehoon/mama-core | 1.7.0 | Core memory, provenance, raw refs, graph, embeddings |
| mama plugin | 1.10.0 | Claude Code plugin (marketplace) |
| memorybench | 1.0.0 | Memory retrieval benchmarking framework |
Quick Start
Claude Code Plugin (simplest)
/plugin install mama
# Decisions are saved automatically via hooks
# Search manually when needed:
/mama:search "authentication strategy"
MAMA OS (full runtime)
claude auth login # or: codex login
npx @jungjaehoon/mama-os init
mama start # starts daemon at localhost:3847
Web viewer at http://localhost:3847/viewer. The current Viewer ships Dashboard, Memory,
Feed, Wiki, Agents, Logs, and Settings; chat opens from the floating shell instead of a
dedicated tab. Connects to Discord, Slack, Telegram.
Requires: Claude Code CLI or Codex CLI installed and authenticated. Node.js >= 22.13.0.
MCP Server (Claude Desktop)
{
"mcpServers": {
"mama": {
"command": "npx",
"args": ["@jungjaehoon/mama-server"]
}
}
}
Technical Details
- Database: SQLite via better-sqlite3 (FTS5 full-text search + vector embeddings)
- Embeddings: Xenova/multilingual-e5-large (1024-dim, quantized q8, 100+ languages)
- Search: Hybrid retrieval β FTS5 BM25 (lexical) + cosine similarity (semantic) + RRF fusion, with strict modes and diagnostics for vector-noise debugging
- Runtime boundary: Signed reactive envelopes (HMAC over scope, expiry, actor) checked by an enforcer that rejects out-of-scope destinations, connectors, or tier mismatches
- Provenance: Compact source refs, model runs, tool traces, twin edges, worker situation packets, and context packets
- Context compiler: Context Compile V0 turns broad search candidates into
selected/rejected/missing evidence packets with trusted
context_packet_idprovenance - Extraction: Sonnet for structured fact extraction from conversations
- Transport: CLI subprocess (Claude/Codex) β officially supported, ToS compliant
What Works Today
Anyone who installs MAMA OS and connects their apps gets:
- Automatic knowledge extraction β Connectors poll 15 sources, AI extracts decisions/deadlines/changes without manual input
- Cross-source evidence reads β "What happened with X?" can pull from connected raw sources and decisions together
- Noise-resistant search β Strict and balanced modes can reject vector-only matches and show why a result was included
- Bounded agent calls β Gateway and worker calls can be tied to runtime envelopes and audited for scope or destination mismatches
- Evidence provenance β Memory rows can be traced to raw refs, model runs, tool traces, and trusted runtime context
- Worker context APIs β Raw search, situation packets, graph/entity APIs, and twin edges give sub-agents structured evidence surfaces
- Task-scoped context packets β
context_compileselects, rejects, and explains evidence for a specific task before a worker saves memory or composes a report - Decision evolution tracking β Not just what was decided, but what it replaced, contradicted, and depended on
- Situation briefings β Dashboard and situation agents summarize what changed, what is stale, and what needs attention
- Wiki organization β Knowledge agents organize raw conversations into structured Obsidian pages
- 93% retrieval accuracy β 100-question LongMemEval tool-use sample against long conversation histories
mama_search remains the broad candidate retriever. context_compile is now the task-shaped layer
that selects, rejects, and explains evidence before a worker writes memory or composes a report.
Roadmap
| Phase | Version | Focus |
|---|---|---|
| Done | v0.15 | Search quality overhaul, FTS5, evolution engine (58% -> 88%) |
| Done | v0.16 | event_date API, tool-use answer, memory agent v5 (88% -> 93%) |
| Done | v0.17 | Connector framework (15 connectors), truth-first 3-pass extraction |
| Done | v0.18 | Output layer: knowledge agents, viewer redesign, security hardening |
| Done | v0.19 | Agent-management foundation: viewer-aware frontdoor, validation UI, activity telemetry, conductor isolation |
| Now | v0.20.1 | M1-M6 runtime foundation plus Context Compile V0: envelopes, model/tool trace ledger, raw/situation/graph worker APIs, strict search diagnostics, append-only context_packets, context_compile, and downstream context_packet_id provenance |
| Next | v0.21 | Report composer, packet retention policy, search-quality feedback loops, browser onboarding for non-developers, and domain extraction templates |
| Later | Domain extraction templates, cross-worker packet analytics, and team-scoped context review workflows | |
| v1.0 | Team mode: shared scoped knowledge graph for organizations. General release |
Development
git clone https://github.com/jungjaehoon-lifegamez/MAMA.git
cd MAMA && pnpm install && pnpm build
pnpm test # 3000+ tests across all packages
See CLAUDE.md for development guidelines.
Last updated: 2026-05-01
License
MIT
