📦

Channel

MCP (Model Context Protocol) server channel: JSON-RPC handler, tool serving

0 installs

Trust: 37 — Low

Ask AI about Channel

I know everything about Channel. Ask me about installation, configuration, usage, or troubleshooting.

0/500

Loading tools...

Reviews

Documentation

AgentRun

Multi-channel, manifest-driven, RBAC-gated AI Agent Runtime.

AgentRun is a self-hosted runtime that turns declarative YAML manifests into a fully operational AI agent — with tool calling, role-based access, session memory, and multi-channel delivery. Think of it as containerd for AI agents: you define what the agent can do, AgentRun handles the rest.

Key Features

Manifest-driven — Define tools, workflows, use-cases, skills, and knowledge bases as Kubernetes-style YAML
Multi-channel — Same agent brain serves Slack, MCP (Model Context Protocol), and future channels
RBAC-gated — Identity resolution, role mapping, and per-role use-case access with budget controls
Triple execution — Skills run as direct (deterministic, fast), agent (Claude Agent SDK reasoning loop), or generic (model-agnostic function calling with any LLM)
Pack system — Extension bundles loaded from any ManifestStore (S3, GCS, local filesystem), grouping tools + workflows + use-cases + skills
Session memory — Per-thread conversation persistence with configurable TTL
RAG — Built-in vector search over ingested documents (pgvector)

Cloud-agnostic by design

Every infrastructure concern in AgentRun is behind a TypeScript interface defined in @agentrun-ai/core. The core package has zero cloud dependencies — all cloud-specific behavior is injected at startup via PlatformRegistry.

@agentrun-ai/aws and @agentrun-ai/gcp are the two production-ready implementations. To run on another cloud, implement the same interfaces and register your providers:

import { setProviderRegistrar, bootstrapPlatform } from "@agentrun-ai/core";
import { registerGcpProviders } from "@agentrun-ai/gcp";

// Use the GCP implementation (Vertex AI, Firestore, Cloud Storage, Pub/Sub, Secret Manager)
setProviderRegistrar(registerGcpProviders);
await bootstrapPlatform();

Design Philosophy

AgentRun's harness design follows the principles outlined in Anthropic's Harness Design for Long-Running Apps engineering blog post. Every component in the harness encodes an assumption about what the model can't do on its own — and those assumptions are designed to be re-evaluated as models improve.

Principle	Implementation
Task Decomposition	Catalog hierarchy: use-cases → workflows → tools → skills. Queries are routed to the minimal set of tools needed.
Separation of Generation from Evaluation	Optional `evaluator` in `GenericAgentConfig` — an independent LLM call that scores responses against explicit quality criteria before delivery.
Making Quality Gradable	Four default criteria with weights: factual accuracy (0.4), completeness (0.3), conciseness (0.2), actionability (0.1). Custom criteria supported.
Context Management	Session history with automatic summarization when context grows large. Structured handoffs preserve tool call metadata across turns.
Structured Handoffs	`buildPromptWithHistory` includes tool usage metadata so subsequent turns know what was already queried, preventing redundant calls.
Iterative Simplification	`GenericAgentConfig` accepts pluggable `callLlm` and `executeTool` — swap the LLM, add or remove the evaluator, or change tool schemas without touching the harness.

The evaluator is opt-in and disabled by default. When enabled, it adds ~500 tokens of overhead per query but catches hallucinations, incomplete answers, and filler before the response reaches the user.

Architecture

Channel Input → Identity Resolution → RBAC Gating → Routing
  → Execution (direct tool calls | agentic LLM loop | generic model-agnostic runner)
  → [Optional: Response Evaluation against quality criteria]
  → Session Persistence → Channel Delivery → Usage Tracking

Provider Interfaces

Every infrastructure concern is a TypeScript interface in @agentrun-ai/core. The @agentrun-ai/aws and @agentrun-ai/gcp packages provide production-ready implementations; additional providers can be built by implementing the same interfaces.

Interface	Purpose	AWS impl (`@agentrun-ai/aws`)	GCP impl (`@agentrun-ai/gcp`)
`LlmProvider`	LLM completions and summarization	Bedrock (`BedrockLlmProvider`)	Vertex AI (`VertexAiLlmProvider`)
`CredentialProvider`	Per-role scoped credentials	STS (`StsCredentialProvider`)	GCP IAM (`GcpCredentialProvider`)
`SessionStore`	Conversation history persistence	DynamoDB (`DynamoSessionStore`)	Firestore (`FirestoreSessionStore`)
`UsageStore`	Token and invocation tracking	DynamoDB (`DynamoUsageStore`)	Firestore (`FirestoreUsageStore`)
`ManifestStore`	Pack manifest storage and discovery	S3 (`S3ManifestStore`)	Cloud Storage (`GcsManifestStore`)
`QueueProvider`	Async message dispatch	SQS (`SqsQueueProvider`)	Pub/Sub (`PubSubQueueProvider`)
`BootstrapSecretProvider`	Secret retrieval at startup	Secrets Manager (`SmSecretProvider`)	Secret Manager (`GcpSecretProvider`)
`EmbeddingProvider`	Text embeddings for RAG	Bedrock Titan (`BedrockEmbeddingProvider`)	Vertex AI (`VertexEmbeddingProvider`)
`VectorStore`	Vector similarity search	pgvector (`PgVectorStore`)	pgvector (`PgVectorStore`)
`KnowledgeBaseProvider`	Managed RAG retrieval	Bedrock KB (`BedrockKbProvider`)	Vertex AI Search (`VertexSearchProvider`)

Packages

Package	Description
`@agentrun-ai/core`	v0.4.0 — Orchestrator, model router, generic runner, catalog, RBAC, platform registry
`@agentrun-ai/gcp`	v0.4.0 — GCP providers: Vertex AI, Firestore, Cloud Storage, Pub/Sub, KMS token encryption
`@agentrun-ai/aws`	Bedrock LLM/embeddings, DynamoDB, S3, SQS, STS, Secrets Manager
`@agentrun-ai/channel-slack`	Slack adapter, Block Kit formatting, identity resolution
`@agentrun-ai/channel-gchat`	Google Chat adapter, Cards V2 formatting, Workspace Add-on support
`@agentrun-ai/channel-mcp`	MCP JSON-RPC server for Claude Code and other MCP clients
`@agentrun-ai/tools-aws`	AWS infrastructure tools (EKS, RDS, Lambda, CloudWatch, SQS)
`@agentrun-ai/tools-gcp`	GCP infrastructure tools (GKE, Cloud SQL, Cloud Functions, Cloud Logging, Pub/Sub)
`@agentrun-ai/tools-github`	GitHub tools (PRs, commits, reviews)
`@agentrun-ai/tools-jira`	Jira tools (issues, comments, transitions)
`@agentrun-ai/cli`	CLI: validate manifests, sync packs, ingest docs for RAG

Dependency graph

@agentrun-ai/core              (zero external deps — pure TypeScript)
    ↑
@agentrun-ai/aws               @aws-sdk/*, @agentrun-ai/core
@agentrun-ai/gcp               @google-cloud/*, @agentrun-ai/core
@agentrun-ai/channel-slack     @slack/web-api, @agentrun-ai/core
@agentrun-ai/channel-gchat     @agentrun-ai/core
@agentrun-ai/channel-mcp       @agentrun-ai/core
@agentrun-ai/tools-aws         @aws-sdk/*, @agentrun-ai/core
@agentrun-ai/tools-gcp         @google-cloud/*, @agentrun-ai/core
@agentrun-ai/tools-github      @octokit/rest, @agentrun-ai/core
@agentrun-ai/tools-jira        @agentrun-ai/core
@agentrun-ai/cli               @agentrun-ai/core, commander

Quick Start

Pick the cloud provider package that matches your infrastructure, then wire it up:

# AWS (Bedrock, DynamoDB, S3, SQS, STS, Secrets Manager)
npm install @agentrun-ai/core @agentrun-ai/aws @agentrun-ai/channel-slack

# GCP (Vertex AI, Firestore, Cloud Storage, Pub/Sub, Secret Manager)
npm install @agentrun-ai/core @agentrun-ai/gcp @agentrun-ai/channel-slack

import { setProviderRegistrar, bootstrapPlatform, processRequest } from "@agentrun-ai/core";
import { SlackChannelAdapter } from "@agentrun-ai/channel-slack";

// Choose ONE provider package:
import { registerAwsProviders } from "@agentrun-ai/aws";
// import { registerGcpProviders } from "@agentrun-ai/gcp";

setProviderRegistrar(registerAwsProviders);   // or registerGcpProviders
await bootstrapPlatform();

const adapter = new SlackChannelAdapter();
await processRequest(adapter, {
    userId: "U12345",
    channelId: "C12345",
    text: "show me the cluster status",
    threadTs: "1234567890.123456",
});

Model Router (v0.4.0)

Automatically select optimal LLM models based on query complexity and role permissions:

import { selectModel, classifyComplexity } from "@agentrun-ai/core";

// Zero-cost complexity classification
const complexity = classifyComplexity("analyze performance bottlenecks");
// → "complex"

// RBAC-gated model selection (pick cheapest model meeting requirement)
const models = {
    fast: { capability: "fast", inputCostPer1kTokens: 0.001, ... },
    pro: { capability: "advanced", inputCostPer1kTokens: 0.01, ... },
};

const selection = selectModel("analyze performance...", models, ["fast", "pro"]);
// → { name: "pro", reason: "complex query → advanced model (pro)" }

Complexity Tiers:

simple: "list prs", "show status" → fast/cheap models
moderate: multi-step synthesis
complex: "design architecture", "analyze impact" → advanced models

See examples/model-router-demo for full example.

OpenAI-Compatible Gateway (v0.4.0)

Use any OpenAI-compatible LLM endpoint (OpenAI, Ollama, self-hosted gateway, Vertex AI):

import { createOpenAICaller } from "@agentrun-ai/core";

const caller = createOpenAICaller({
    baseUrl: "https://api.openai.com",  // or http://localhost:11434 for Ollama
    defaultModel: "gpt-4o",
    resolveToken: async (userId) => await tokenStore.get(userId, "openai"),
});

const result = await processGenericQuery(
    "show cluster status",
    "U12345",
    "slack",
    { callLlm: caller, executeTool: myTools }
);

Works with:

OpenAI API
Anthropic Vertex AI
Local LLM servers (Ollama, vLLM, LM Studio)
Self-hosted gateways
Any OpenAI-compatible endpoint

See examples/openai-gateway-demo for full example.

Model-agnostic (Generic Runner)

For non-Anthropic LLMs (Gemini, GPT, Ollama), use processGenericQuery — it accepts pluggable callLlm and executeTool functions and auto-derives allowed tools and KB context from the catalog:

import { processGenericQuery, bootstrapPlatform, setProviderRegistrar } from "@agentrun-ai/core";
import { registerGcpProviders } from "@agentrun-ai/gcp";

setProviderRegistrar(registerGcpProviders);
await bootstrapPlatform();

const result = await processGenericQuery(
    "show cluster status",
    "user@example.com",
    "google",
    {
        callLlm: async ({ systemPrompt, contents, tools }) => {
            // your LLM call (Gemini, GPT, Ollama, etc.)
            return { text: "...", functionCalls: [] };
        },
        executeTool: async (toolName, args) => {
            // route to your tool registry
            return JSON.stringify({ status: "ok" });
        },
        // toolSchemas omitted → auto-derived from catalog workflows for the user's role
    },
);

Deployment Examples

Example	Description
`aws-lambda`	AWS serverless: API Gateway + Lambda + SQS + DynamoDB
`gcp-cloud-functions`	GCP serverless: Cloud Functions + Pub/Sub + Firestore
`gchat-standalone`	Google Chat bot via Fastify + HTTP endpoint
`slack-standalone`	Single Fastify server, no external dependencies
`docker`	Docker Compose with PostgreSQL (pgvector) + Redis

Documentation

AgentRun Book — Complete platform reference (governance, security, architecture)
Contributing — Development setup and contribution guide
Security — Vulnerability disclosure policy
CLA — Contributor License Agreement

Manifest Examples

AgentRun uses 6 manifest kinds. All follow the apiVersion: agentrun/v1 pattern:

Tool — atomic capability

# tools/list-open-prs.yaml
apiVersion: agentrun/v1
kind: Tool
metadata:
  name: list-open-prs
spec:
  type: mcp-server           # mcp-server | aws-sdk | http | lambda
  mcpTool: list_open_prs     # maps to MCP tool registry name
  description: List open pull requests
  category: development
  readOnly: true

Workflow — composes tools

# workflows/review-pull-requests.yaml
apiVersion: agentrun/v1
kind: Workflow
metadata:
  name: review-pull-requests
spec:
  description: Review open PRs across repositories
  tools:
    - list-open-prs
    - get-pr-details
    - recent-commits

Workflow with steps — deterministic pipeline

# workflows/check-billing.yaml
apiVersion: agentrun/v1
kind: Workflow
metadata:
  name: check-billing
spec:
  description: Get AWS cost breakdown for the current month
  tools:
    - check-billing
  steps:
    - tool: check-billing
      action: GetCostAndUsage
      input:
        TimePeriod:
          Start: "{{ startDate }}"
          End: "{{ endDate }}"
        Granularity: MONTHLY
        Metrics: ["UnblendedCost"]
      outputTransform: "ResultsByTime[0].Total.UnblendedCost"
      timeoutMs: 10000

UseCase — maps user intent to workflows

# use-cases/code-review.yaml
apiVersion: agentrun/v1
kind: UseCase
metadata:
  name: code-review
spec:
  description: Review PRs and recent commits
  keywords: [pr, pull request, review, merge, commit, deploy]
  workflows:
    - review-pull-requests
  scope: github              # MCP server scope filtering
  template: |
    List open PRs with author, title, status, and highlights.

Skill — slash command with prompt + tools

# skills/health-check.yaml
apiVersion: agentrun/v1
kind: Skill
metadata:
  name: health-check
spec:
  command: /health-check
  description: Full infrastructure health check
  mode: direct               # direct (fast) | agent (LLM reasoning)
  tools:
    - describe-eks-cluster
    - describe-rds
    - list-lambdas
    - list-sqs-queues
  prompt: |
    Check all infrastructure components and report status.
    Use OK/Warning/Critical for each service.
  allowedRoles: [developer, operator, admin]
  maxBudgetUsd: 0.15

KnowledgeBase — RAG document collection

# knowledge-bases/infra-runbooks.yaml
apiVersion: agentrun/v1
kind: KnowledgeBase
metadata:
  name: infra-runbooks
spec:
  description: Infrastructure runbooks and troubleshooting guides
  source:
    type: markdown
    path: docs/runbooks/
  chunking:
    strategy: heading
    maxTokens: 1500
    overlap: 60
  embedding:
    model: titan-embed-v2
    dimensions: 1024
  tags: [infra-health, lambda-debug, aws]   # scoped to matching use-cases/roles

Eval — test cases for skill routing

# evals/health-check.yaml
apiVersion: agentrun/v1
kind: Eval
metadata:
  name: health-check
spec:
  target:
    kind: Skill
    name: health-check
  triggerCases:
    - query: "how is the infrastructure?"
      shouldTrigger: true
    - query: "find the checkout lambda"
      shouldTrigger: false
  executionCases:
    - id: full-health
      prompt: "check infrastructure health"
      expectations:
        - type: tool_called
          value: describe_eks_cluster
        - type: tool_called
          value: describe_rds
  config:
    passThreshold: 0.8
    maxBudgetPerCaseUsd: 0.15

Using Evals

Evals validate that skills and workflows behave correctly by testing:

Trigger routing — Does this query trigger the skill? (or not)
Execution quality — Were the right tools called? Did the response meet quality criteria?

Running Evals

# CLI (validate skill routing and execution)
pnpm --filter @agentrun-ai/cli exec agentrun eval evals/health-check.yaml

# Programmatic (in your agent harness)
import { runEval } from "@agentrun-ai/core";

const results = await runEval({
  evalPath: "evals/health-check.yaml",
  skillCatalog: yourSkillCatalog,
  callLlm: yourLlmProvider,
  executeTool: yourToolExecutor,
});

console.log(`Passed: ${results.passedCases}/${results.totalCases}`);

Eval Structure

Trigger Cases — Test skill routing logic:

shouldTrigger: true — This query MUST activate the skill
shouldTrigger: false — This query MUST NOT activate the skill

Execution Cases — Test tool calls and output quality:

expectations — List tools that should/shouldn't be called
assertions — Custom checks on the response (regex, token count, cost)

Cost Control

The maxBudgetPerCaseUsd field limits spend per evaluation run. If an LLM call exceeds this, it's rejected and logged as a failure — useful for catching regressions where the agent starts making expensive queries.

License

GNU Affero General Public License v3.0 — See NOTICE for copyright.

AgentRun is free software: you can redistribute it and/or modify it under the terms of the AGPLv3. If you run a modified version of AgentRun as a network service, you must make the source code available to users of that service (AGPLv3 Section 13).

Extensions via Packs (YAML manifests) are configuration, not derivative works — no copyleft trigger.

Channel

Reviews

Documentation

AgentRun

Key Features

Cloud-agnostic by design

Design Philosophy

Architecture

Provider Interfaces

Packages

Dependency graph

Quick Start

Model Router (v0.4.0)

OpenAI-Compatible Gateway (v0.4.0)

Model-agnostic (Generic Runner)

Deployment Examples

Documentation

Manifest Examples

Tool — atomic capability

Workflow — composes tools

Workflow with steps — deterministic pipeline

UseCase — maps user intent to workflows

Skill — slash command with prompt + tools

KnowledgeBase — RAG document collection

Eval — test cases for skill routing

Using Evals

Running Evals

Eval Structure

Cost Control

License

Security Checklist