📦

Contextplus Rs

Context+ MCP server rewritten in Rust — 17 semantic code analysis tools with Ollama embeddings, zero-copy caching, native tree-sitter

0 installs

Trust: 34 — Low

Rag

Ask AI about Contextplus Rs

I know everything about Contextplus Rs. Ask me about installation, configuration, usage, or troubleshooting.

0/500

Loading tools...

Reviews

Documentation

contextplus-rs

High-performance MCP server for semantic code analysis, written in Rust. Drop-in replacement for the original Context+ TypeScript implementation with 5-20x faster warm queries.

Why Rust?

Cold Start Benchmark (fresh process per call)

Measured against the workspace (~2800 source files), each call spawns a fresh process:

Tool	Rust	TypeScript	Grep	Rust vs TS
`get_file_skeleton`	1,712 ms	4,030 ms	1,315 ms*	2.4x faster
`get_context_tree`	1,857 ms	4,002 ms	1,382 ms*	2.2x faster
`get_blast_radius`	1,881 ms	4,148 ms	1,323 ms*	2.2x faster
`semantic_code_search`	1,684 ms	4,081 ms	2,448 ms*	2.4x faster
`semantic_identifier_search`	1,854 ms	4,128 ms	1,296 ms*	2.2x faster
Average	1,798 ms	4,078 ms	1,553 ms	2.3x faster

* Grep is fast but returns raw matches — see token comparison below.

Token Efficiency (MCP vs Grep)

MCP tools return ranked, structured results. Grep returns raw lines.

Query	MCP Output	Grep Output	Reduction
File skeleton (any module)	~400 tokens	~1,678 tokens	4x fewer
Context tree (any directory)	~1,250 tokens	~707K tokens	566x fewer
Blast radius (any symbol)	~300 tokens	~1,419 tokens	5x fewer
Semantic search ("form validation")	~500 tokens	~42.7M tokens	85,000x fewer
Identifier search ("any concept")	~625 tokens	~23.9M tokens	38,000x fewer

20-Search Session Cost

Engine	Wall Time	Tokens Consumed
Rust MCP	~28s	~10K tokens
TS MCP	~73s	~10K tokens
Grep	~96s	~268M tokens

Internal Bottleneck Comparison

Bottleneck	TypeScript	Rust	Improvement
Cache load (120MB)	115ms (VectorStore)	~2ms (mmap)	57x
Cosine scan 30K vectors	~50ms (JS loop)	~15ms (SIMD)	3x
Tree-sitter parse	5-20ms (WASM)	1-5ms (native)	4x
Warm semantic search	~1.5s	<100ms	15x
No-op file refresh	870ms	<10ms	87x

Rust eliminates all overhead via:

Zero-copy cache with rkyv + memmap2 (no deserialization)
SIMD cosine similarity via simsimd (AVX-512/AVX2 auto-dispatch)
Native tree-sitter (compiled in, no WASM VM) — 15 languages + regex fallback
Binary serialization (rkyv replaces JSON for memory graph)
Disk-persistent embedding cache with content-hash staleness detection
Adaptive embedding retry with exponential backoff and cancellation tokens
Automatic chunking for oversized embedding inputs (chunk → embed → merge)
Process lifecycle management — idle timeout, parent PID orphan detection, SIGTERM/SIGHUP handling
Memory graph disk persistence with debounced flush on mutation

Install

Build from source

git clone https://github.com/mrsufgi/contextplus-rs.git
cd contextplus-rs
cargo build --release

The binary is at target/release/contextplus-rs.

Prerequisites

Ollama running locally with an embedding model:

ollama pull snowflake-arctic-embed2   # embeddings
ollama pull qwen3.5:9b                # chat (for cluster labeling)

Configuration

Ollama / embedding

Variable	Default	Description
`OLLAMA_HOST`	`http://127.0.0.1:11434`	Ollama server URL
`OLLAMA_EMBED_MODEL`	`snowflake-arctic-embed2`	Embedding model
`OLLAMA_CHAT_MODEL`	`llama3.2`	Chat model for cluster labels
`OLLAMA_API_KEY`	(none)	Optional API key
`CONTEXTPLUS_EMBED_BATCH_SIZE`	`50`	Document embedding batch size (clamped 5–512)
`CONTEXTPLUS_QUERY_BATCH_SIZE`	`1`	Query embedding batch size (number of query vectors sent per Ollama request)
`CONTEXTPLUS_EMBED_CHUNK_CHARS`	`2000`	Max chars per embedding input (clamped 256–8000). Oversized inputs are chunked and merged
`CONTEXTPLUS_MAX_EMBED_FILE_SIZE`	`51200` (50 KB)	Skip files larger than this (bytes) for embedding. Min 1 KB
`CONTEXTPLUS_IGNORE_DIRS`	(none)	Extra directories to ignore (comma-separated), appended to the built-in list
`CONTEXTPLUS_CACHE_TTL_SECS`	`300`	Embedding cache TTL in seconds
`CONTEXTPLUS_EMBED_NUM_GPU`	(none)	Ollama `num_gpu` option (GPU layer count)
`CONTEXTPLUS_EMBED_MAIN_GPU`	(none)	Ollama `main_gpu` option (primary GPU index)
`CONTEXTPLUS_EMBED_NUM_THREAD`	(none)	Ollama `num_thread` option
`CONTEXTPLUS_EMBED_NUM_BATCH`	(none)	Ollama `num_batch` option
`CONTEXTPLUS_EMBED_NUM_CTX`	(none)	Ollama `num_ctx` option
`CONTEXTPLUS_EMBED_LOW_VRAM`	(none)	Ollama `low_vram` option (`true`/`false`)

Search / indexing

Variable	Default	Description
`CONTEXTPLUS_HNSW_EF_CONSTRUCTION`	`100`	HNSW `efConstruction` — higher values improve index quality at the cost of build time
`CONTEXTPLUS_HNSW_EF_SEARCH`	`32`	HNSW `ef_search` — higher values improve recall at the cost of query latency. Set explicitly when higher recall is needed
`CONTEXTPLUS_ANN_CANDIDATE_MULTIPLIER`	`10`	ANN candidate pool multiplier: fetches `top_k × N` HNSW candidates before re-ranking. Larger values improve recall; only applies when corpus exceeds 2,000 files

Warmup

Variable	Default	Description
`CONTEXTPLUS_WARMUP_ON_START`	`true`	Warm the `SearchIndex` cache at server startup. Set to `false` / `0` / `no` / `off` to disable
`CONTEXTPLUS_WARMUP_CONCURRENCY`	`1`	Number of parallel Ollama embed requests during `warmup_embeddings` / `warmup_identifiers`. Set to match `OLLAMA_NUM_PARALLEL` on the host

Tracker (file-watcher)

Variable	Default	Description
`CONTEXTPLUS_EMBED_TRACKER`	`lazy`	Tracker mode: `lazy` (start on first search), `eager` / `startup` (start at boot), `off` / `false` (disabled)
`CONTEXTPLUS_EMBED_TRACKER_DEBOUNCE_MS`	`700`	File-watcher debounce window in milliseconds
`CONTEXTPLUS_EMBED_TRACKER_MAX_FILES`	`8`	Max files re-embedded per watcher tick

Process lifecycle

Variable	Default	Description
`CONTEXTPLUS_IDLE_TIMEOUT_MS`	`900000`	Auto-shutdown after this many ms idle (0 or `off` to disable, min 60 s)
`CONTEXTPLUS_PARENT_POLL_MS`	`5000`	Poll interval for parent PID orphan detection in milliseconds (min 1 s)

Usage

As an MCP server (stdio)

contextplus-rs --root-dir /path/to/project

Claude Code integration

Add to your MCP config (~/.claude/mcp.json or project .mcp.json):

{
  "mcpServers": {
    "contextplus": {
      "command": "/path/to/contextplus-rs",
      "args": ["--root-dir", "/path/to/project"],
      "env": {
        "OLLAMA_EMBED_MODEL": "snowflake-arctic-embed2",
        "OLLAMA_CHAT_MODEL": "qwen3.5:9b",
        "OLLAMA_HOST": "http://127.0.0.1:11434",
        "CONTEXTPLUS_EMBED_BATCH_SIZE": "256",
        "CONTEXTPLUS_EMBED_TRACKER": "eager"
      }
    }
  }
}

Note: think: false is sent automatically to the chat model to avoid slow thinking-mode responses. Models like qwen3.5:9b produce cluster labels in <1s with thinking disabled vs 45s+ with thinking enabled.

CLI subcommands

# Generate MCP config for your editor (claude, cursor, vscode, windsurf, opencode)
contextplus-rs init claude
contextplus-rs init cursor

# Print file skeleton
contextplus-rs skeleton src/main.rs

# Print context tree
contextplus-rs tree --max-tokens 5000

MCP Resources

URI	Description
`contextplus://instructions`	Returns tool usage instructions fetched from the Context+ API. Cached in memory after first fetch

Tools (17)

Code Analysis

Tool	Description
`get_context_tree`	Token-aware file tree with symbols and line ranges. Supports `depth_limit` to cap directory depth and `max_tokens` (default 50K) for automatic pruning (Level 2 → 1 → 0)
`get_file_skeleton`	Function signatures and structure without full file read
`get_blast_radius`	Map every file that imports or references a symbol
`run_static_analysis`	Run available linters (tsc with `--build` for project references, eslint, cargo check, ruff)

Semantic Search

Tool	Description
`semantic_code_search`	Hybrid semantic + keyword file search via Ollama embeddings
`semantic_identifier_search`	Find functions/classes by meaning with call-site ranking
`semantic_navigate`	Cluster files by semantic similarity (spectral clustering)

File Management

Tool	Description
`propose_commit`	Write files with validation and shadow restore points
`list_restore_points`	List all shadow restore points
`undo_change`	Restore files from a restore point

Memory Graph

Tool	Description
`upsert_memory_node`	Create or update a memory graph node
`create_relation`	Create or update edges between nodes
`search_memory_graph`	Semantic search with BFS traversal
`retrieve_with_traversal`	Retrieve node neighborhood via BFS
`add_interlinked_context`	Batch-add nodes with auto-linking
`prune_stale_links`	Remove decayed edges and orphan nodes

Navigation

Tool	Description
`get_feature_hub`	Navigate Obsidian-style wikilinks between feature docs

Architecture

See ARCHITECTURE.md for detailed internals (data flow, caching strategy, memory layout, performance architecture, and how to add new tools).

src/
  main.rs                    # CLI + MCP server entry point
  server_adapters.rs         # rmcp ServerHandler impl + tool dispatch
  server_definitions.rs      # Tool definitions (names, descriptions, JSON schemas)
  server_helpers.rs          # Shared handler utilities
  config.rs                  # Environment variable configuration
  error.rs                   # ContextPlusError enum (thiserror)
  core/
    embeddings.rs            # OllamaClient + adaptive retry + chunking + cancellation
    tree_sitter.rs           # Native multi-lang parser (15 languages)
    parser.rs                # Code symbol extraction + regex fallback for unsupported langs
    walker.rs                # gitignore-aware file walker (ignore crate)
    embedding_tracker.rs     # File watcher with lazy/eager/off modes
    clustering.rs            # Spectral clustering (nalgebra)
    memory_graph.rs          # petgraph + rkyv disk persistence with debounced flush
    hub.rs                   # Wikilink parser
    process_lifecycle.rs     # Idle timeout + parent PID orphan detection
    safe_path.rs             # Path traversal prevention
    utils.rs                 # Shared utilities
  tools/                     # One file per tool (context_tree supports depth_limit filtering)
  git/shadow.rs              # Restore points (file-based backup)
  cache/rkyv_store.rs        # Zero-copy rkyv+mmap VectorStore

Supported Languages (tree-sitter)

TypeScript, TSX, JavaScript, Python, Rust, Go, Java, C, C++, Bash, Ruby, PHP, C#, Kotlin, HTML, CSS

Unsupported file types fall back to regex-based symbol extraction.

Key Crates

Crate	Purpose
`rmcp`	MCP SDK with stdio transport
`simsimd`	SIMD-accelerated cosine distance
`rkyv` + `memmap2`	Zero-copy cache persistence
`tree-sitter`	Native code parsing (15 languages)
`petgraph`	Memory graph with stable indices
`nalgebra`	Spectral clustering (eigendecomposition)
`notify`	File system watching
`ignore`	gitignore-aware file walking

Benchmarks (`cargo bench`)

Nine Criterion benchmark suites cover the critical hot paths — no Ollama dependency, fully reproducible.

Cache Load (rkyv + mmap)

How fast the embedding cache loads from disk. This was the #1 bottleneck in TS (1,109ms raw, 115ms with VectorStore optimization).

Operation	1K vectors	5K vectors	30K vectors
rkyv read	0.36 ms	2.8 ms	181 ms
rkyv mmap	0.63 ms	3.1 ms	103 ms
to_store (HashMap build)	0.16 ms	0.97 ms	91 ms

At 5K vectors (typical project size), total load is ~4ms. At 30K vectors, mmap beats read by 43%.

Cosine Similarity (simsimd SIMD vs scalar)

Operation	1K×1024	5K×1024	30K×1024	SIMD speedup
simsimd scan	68 µs	404 µs	4.1 ms	—
naive scan	663 µs	3.4 ms	20 ms	~5x
Single pair (1024-dim)	0.08 µs	—	—	8x vs naive

30K-vector scan in 4.1ms — well under the 20ms target.

Tree-sitter Parse (native, 15 languages)

Language	Parse time
TypeScript	113 µs
TSX	143 µs
JavaScript	102 µs
Python	141 µs
Rust	171 µs
Go	100 µs
Java	96 µs
C	98 µs
C++	91 µs
Bash	~83 µs
All 15 combined	~2.1 ms

All 15 languages parsed in ~2.1ms total — vs 50-200ms for WASM in TS.

Warm Search Pipeline

End-to-end: disk load → VectorStore build → find_nearest(top_5) → format results.

Scenario	1K files	5K files	30K files
Full pipeline (mmap + search)	0.9 ms	5.3 ms	224 ms
Warm search only (in-memory)	78 µs	418 µs	4.3 ms
Hash-check staleness (no-op)	23 µs	131 µs	975 µs

Warm search on 30K files: 4.3ms. Hash-check (no-op refresh): <1ms.

Development

cargo test                  # 1050+ tests
cargo bench                 # 9 Criterion benchmark suites
cargo clippy --all-targets  # Lint
cargo fmt --check           # Format check

Credits

This is a Rust rewrite of Context+ (TypeScript), originally created by the Context+ community. The Rust port was built from the fix/merge-cache-on-save branch which includes performance optimizations (hash-based cache invalidation, VectorStore extraction, parallel call-site ranking, tree-sitter dedup).

License

MIT