HiveMem
Personal knowledge system β MCP server with PostgreSQL semantic search and temporal knowledge graph
Ask AI about HiveMem
Powered by Claude Β· Grounded in docs
I know everything about HiveMem. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
HiveMem
Your second brain β and it stays yours. Forever. Local.
A sovereign personal knowledge system. The conversations, decisions, documents, and half-formed thoughts you produce across Claude, ChatGPT, Gemini, Copilot β and the files you accumulate in real life β all come home to one place that outlives any vendor and obeys only you.
Why HiveMem exists
When you think hard today, you often think with an LLM in the loop. School, work, authorities, court cases, taxes, family, health, relationships β these conversations contain your most private thinking. More intimate than any diary.
And then they evaporate:
- Your subscription lapses or you switch providers β history gone
- The provider retires a model or rewrites their ToS β answers no longer reproducible
- An account ban, a provider going under, a country blocking the service β everything lost
- The data sits on a vendor's servers, fed into training, served on subpoena, exposed in the next breach
HiveMem is built around the opposite stance:
- Sovereignty β Your data lives in your instance. Postgres + SeaweedFS, on hardware you control. No vendor sees the contents unless you explicitly route a single LLM call through them.
- Persistence β Everything is append-only with
valid_from/valid_until. No subscription change can revoke access. No retention policy you didn't author can delete what's yours. - Portability β A HiveMem instance packs into one encrypted archive (Postgres dump + binary store + config) and restores anywhere. Vendor lock-in: zero.
- Aggregation β What you write in Claude.ai, ChatGPT, Gemini, Claude Code, Copilot lands in HiveMem too. Those tools become front-ends; HiveMem holds the truth.
- Privacy by realm β Strict separation per life area
(
legal,medical,private,work). Per-realm routing rules: anything touching authorities or health stays on local models, never reaches a cloud provider.
Knowledge doesn't rot here
The long-term goal is a periodic agent β the Queen β that wakes on a schedule, surveys your knowledge, and dispatches specialized worker agents (Bees) to flag isolated cells, stale facts, duplicate candidates, and realms drifting from their blueprint. Everything risky stays a proposal that flows through the existing approval workflow; you keep the kill switch.
The schema, agent registry, and approval pipeline are in place today. The scheduler, the Bees themselves, the audit table, and the conversation UI that teaches the Queen your preferences are not yet built.
β Roadmap β what's planned, what's partial, and the order of work.
β Scientific foundations β the cognitive-science and PKM theory HiveMem's design is built on (Working Memory, Cognitive Load, Extended Mind, Forgetting Curve, Zettelkasten, PARA).
Docker images: ghcr.io/visterion/hivemem:main for the rolling main branch, plus semver tags such as ghcr.io/visterion/hivemem:8.1.0 for cut releases.
Highlights
- 6-Signal Ranked Search β Semantic similarity, keyword, recency, importance, popularity, and graph proximity β combined into one ranked result.
- Temporal Knowledge Graph β Facts with
valid_from/valid_until, contradiction detection, and multi-hop graph traversal. - Progressive Summarization β Four layers per cell: content, summary, key points, and insight. Never lose nuance.
- Long cells stay searchable β auto-summarizer turns multi-page documents into curated summaries that are embedded for semantic search; cost-capped, opt-in.
- Scanned PDFs become searchable β Tesseract OCR extracts text from scan-only PDFs; combined with the auto-summarizer, even paper-mailed documents are findable by semantic search.
- Document-Type Extraction β invoices, contracts, and other typed documents are auto-classified during summarization; typed facts (vendor, amount, parties, dates) land in the knowledge graph.
- Kroki + Vision β Diagramm-Thumbnails (Mermaid/PUML/Graphviz/D2) und Bildbeschreibung via Claude Haiku, async + opt-in.
- Append-Only Versioning + Time Machine β No data is ever deleted. Query your knowledge at any point in time.
- Agent Fleet + Approval Workflow β Agents write pending suggestions; only admins approve. Every write is human-gated.
- Auto-Inject Hook for Claude Code β Relevant memories injected into every session automatically, before you even ask.
- Full instance portability β Export the entire HiveMem instance (Postgres + attachments + identity) into one tar.gz, restore it on another host with one command. Mission promise made provable.
β Get started
Feature Status
Honest snapshot of what is shipping today versus what the surrounding prose describes as the long-term shape. See the roadmap for details on every π‘ / π΄ row.
| Feature | Status | Notes |
|---|---|---|
| 6-Signal Ranked Search | β Stable | semantic + keyword + recency + importance + popularity + graph proximity, all wired into one SQL ranker |
| Progressive Summarization | β Stable | content / summary / key points / insight, all four populated automatically |
| Auto-Summarizer for long cells | β Stable | summary is embedded for semantic search, cost-capped per realm |
| OCR for scanned PDFs | β Stable | Tesseract, async backfill, Vision fallback |
| Document-Type Extraction | β Stable | invoices/contracts/etc β typed facts in the knowledge graph |
| Kroki + Vision | β Stable | diagram thumbnails + Claude Haiku image description, opt-in, budget-capped |
| Append-Only Versioning + Time Machine | β Stable | time_machine queries by event time and ingestion time |
| Agent Approval Workflow | β Stable | every agent write lands as pending until an admin approves |
| Auto-Inject Hook (Claude Code) | β Stable | 6-stage filter pipeline, Bearer-token auth |
| Full Instance Portability | β Stable | one-command tar.gz of Postgres + attachments + identity |
| OAuth Custom Connector | β Stable | RFC 8414 / 9728 discovery, PKCE |
| Temporal Knowledge Graph | π‘ Partial | bi-temporal facts and multi-hop traversal ship; automatic contradiction detection is not yet implemented |
| Privacy by Realm β model routing | π‘ Partial | data segregation by realm works; per-realm enforcement of "stays on local models" is not yet wired into the LLM call path |
| Queen + Bees periodic agent | π΄ Planned | agents table, registration tool, and approval pipeline exist; no scheduler, no Bees, no agent_tasks audit table, no conversation UI |
Documentation
| Vision | Cognitive-science and PKM foundations behind HiveMem's design |
| Getting Started | Prerequisites, embedding service, token creation, connect to Claude |
| The Structure | Realms, signals, topics, cells, tunnels β the knowledge hierarchy |
| Architecture | System diagram, data model, security matrix |
| Tools | All 34 MCP tools, the parallel REST attachment API, search signals, progressive summarization |
| Authentication | Roles, token management, security details |
| OAuth + Custom Connector | Add HiveMem as a Claude.ai/ChatGPT Custom Connector |
| Backup + Portability | Export and restore entire instances, disaster recovery, cloning |
| Hook Integration | Auto-inject context into Claude Code sessions |
| Operations | Deployment, migrations, debugging |
| Roadmap | What's planned, what's partial, order of work |
License
HiveMem is fair-code licensed under the Sustainable Use License. Free for personal and internal business use. See LICENSING.md for details.
