📦

Localollamamcpserver

Name: Localollamamcpserver
Rating: 2.0 (1 reviews)
Author: DimonSmart

MCP server: Localollamamcpserver

0 installs

2 stars

Trust: 41 — Fair

Installation

npx localollamamcpserver

Ask AI about Localollamamcpserver

I know everything about Localollamamcpserver. Ask me about installation, configuration, usage, or troubleshooting.

0/500

Loading tools...

Reviews

Documentation

DimonSmart.LocalOllamaMCPServer

A Model Context Protocol (MCP) server that provides tools to query local Ollama instances. Built with the official ModelContextProtocol SDK from Anthropic + Microsoft, it enables larger models (like Claude, GPT-4) to test prompts against smaller local models (like Llama 3, Mistral, etc.) running on Ollama.

Features

query_ollama - Send prompts to local Ollama models and get responses
query_ollama_with_files - Test a prompt template on one or more files from workspace roots (supports wildcards)
list_ollama_connections - List all configured Ollama server connections
list_ollama_models - Inspect which models are available on each Ollama server
Interactive elicitation fallback when a requested model is missing (MCP 0.4.1 preview)
Full MCP specification compliance with proper JSON-RPC 2.0 framing
Support for multiple Ollama server connections with authentication
Automatic tool schema generation from method signatures
SSL certificate validation control for self-signed certificates
1-hour default timeout for long-running model inference requests

Prerequisites

.NET 8.0 SDK
Ollama running locally (default: http://localhost:11434)

Installation

As a .NET Tool

Install the tool globally:

dotnet tool install --global DimonSmart.LocalOllamaMCPServer

To update to the latest version:

dotnet tool update --global DimonSmart.LocalOllamaMCPServer

One-click install in VS Code (MCP)

After installing the .NET tool, you can add this server to VS Code or VS Code Insiders with a single click:

VS Code will show you the MCP configuration and let you choose whether to add it to your user or workspace settings.

If you prefer the raw URLs, you can use:

VS Code:

vscode:mcp/install?%7B%22name%22%3A%20%22DimonSmart%20Local%20Ollama%20MCP%22%2C%20%22type%22%3A%20%22stdio%22%2C%20%22command%22%3A%20%22DimonSmart.LocalOllamaMCPServer%22%7D

VS Code Insiders:

vscode-insiders:mcp/install?%7B%22name%22%3A%20%22DimonSmart%20Local%20Ollama%20MCP%22%2C%20%22type%22%3A%20%22stdio%22%2C%20%22command%22%3A%20%22DimonSmart.LocalOllamaMCPServer%22%7D

And the underlying JSON configuration is:

{
  "name": "DimonSmart Local Ollama MCP",
  "type": "stdio",
  "command": "DimonSmart.LocalOllamaMCPServer"
}

You can also paste this JSON into VS Code via the MCP: Add Server command from the Command Palette.

From Source

Clone the repository.
Build the project:
```
dotnet build
```

Usage

Running the Server

You can run the server directly:

dotnet run --project src/DimonSmart.LocalOllamaMCPServer/DimonSmart.LocalOllamaMCPServer.csproj

Or if installed as a tool:

DimonSmart.LocalOllamaMCPServer

To check the version and configuration file location:

DimonSmart.LocalOllamaMCPServer --version

The server communicates via Standard Input/Output (stdio) using the MCP protocol. It is designed to be used by MCP clients such as:

Claude Desktop
GitHub Copilot in VS Code
Cursor
Custom agents and AI applications

Available Tools

`query_ollama`

Send a prompt to a local Ollama model and receive the response.

Parameters:

model_name (string, required) - Name of the Ollama model (e.g., llama3, mistral, phi4)
prompt (string, required) - The prompt text to send to the model
options (object, optional) - Model parameters such as temperature, top_p, etc.
connection_name (string, optional) - Name of the Ollama server connection. Uses default if omitted.

Example Request (JSON-RPC):

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "query_ollama",
    "arguments": {
      "model_name": "llama3",
      "prompt": "Why is the sky blue?",
      "connection_name": "remote-gpu",
      "options": {
        "temperature": 0.7
      }
    }
  },
  "id": 1
}

Elicitation example when the model is missing

If you call query_ollama with a model that does not exist on the selected connection, the server now issues an MCP elicitation/create request asking the user to choose one of the available models. This keeps the conversation inside the same tool call instead of failing immediately.

Client sends an invalid tool call:

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "query_ollama",
    "arguments": {
      "model_name": "llama3-invalid",
      "prompt": "hello",
      "connection_name": "local"
    }
  },
  "id": 42
}

The server looks up the real models (for example llama3 and mistral) and prompts the client:

{
  "jsonrpc": "2.0",
  "method": "elicitation/create",
  "params": {
    "message": "The model 'llama3-invalid' was not found on 'local'. Please choose another model to continue.",
    "requested_schema": {
      "required": ["modelName"],
      "properties": {
        "modelName": {
          "title": "Select an Ollama model",
          "description": "Pick a model available on 'local'.",
          "enum": ["llama3", "mistral"],
          "default": "llama3"
        }
      }
    }
  },
  "id": "elicitation-42"
}

After the user selects a model, the client replies:

{
  "jsonrpc": "2.0",
  "method": "elicitation/response",
  "params": {
    "request_id": "elicitation-42",
    "accepted": true,
    "content": {
      "modelName": "mistral"
    }
  }
}

The server re-runs the original tool call with the selected model (mistral) and returns the final tools/call result. If the user cancels or the client does not support elicitation, the tool call exits with a descriptive error message.

`query_ollama_with_files`

Test a prompt template against one or more files that live under the workspace roots advertised by the MCP host (via the roots/list request). The server requests those roots at runtime and refuses to read anything outside of them. Supports wildcards for batch processing multiple files.

Placeholders in the prompt template:

{{data}} - replaced with the file contents unless send_data_as_user_message=true

Parameters:

model_name (string, required) - Target model
prompt_template (string, required) - Template containing the placeholders above
file_path (string, optional) - File mask to apply. Use patterns like *.* for all files, *.cs for C# files, or a specific name for a single file
send_data_as_user_message (bool, optional) - Append file content as a separate user-style message instead of inline replacement
max_files (int, optional) - Limit how many files are processed when using wildcards (0 = no limit)
connection_name (string, optional) - Ollama server connection (default if omitted)

Example Request (process a single markdown file):

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "query_ollama_with_files",
    "arguments": {
      "model_name": "llama3",
      "prompt_template": "Summarize {{file_name}} in 3 bullet points. Content: {{data}}",
      "file_path": "docs/example.md"
    }
  },
  "id": 99
}

Example Request (process all markdown files under a root):

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "query_ollama_with_files",
    "arguments": {
      "model_name": "mistral",
      "prompt_template": "Classify the tone of {{file_path}}. Respond with 'positive', 'neutral', or 'negative'.",
      "file_path": "*.md",
      "max_files": 5
    }
  },
  "id": 100
}

If the connected host does not support roots/list or returns an empty list, the tool fails with an error instead of touching the filesystem.

`list_ollama_connections`

List all configured Ollama server connections. Passwords are masked for security.

Parameters: None

Example Request:

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "list_ollama_connections",
    "arguments": {}
  },
  "id": 2
}

`list_ollama_models`

List all models that the selected Ollama server reports through /api/tags. This is especially handy when deciding which model to select during elicitation.

Parameters:

connection_name (string, optional) - Name of the Ollama server connection. Uses default if omitted.

Example Request:

{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "list_ollama_models",
    "arguments": {
      "connection_name": "remote-gpu"
    }
  },
  "id": 3
}

Configuration

The server supports multiple Ollama instances through configuration. You can configure connections using either appsettings.json or environment variables. If no configuration is provided, the server will automatically use default settings with a local Ollama server at http://localhost:11434.

Configuration via appsettings.json

Create or edit appsettings.json in the tool's installation directory. The DefaultServerName specifies which server to use when connection_name is not provided in tool calls.

{
  "Ollama": {
    "DefaultServerName": "local",
    "Servers": [
      {
        "Name": "local",
        "BaseUrl": "http://localhost:11434"
      },
      {
        "Name": "remote-gpu",
        "BaseUrl": "https://my-gpu-server.com:11434",
        "User": "admin",
        "Password": "secret-password",
        "IgnoreSsl": true
      }
    ]
  }
}

Server Configuration Properties:

Name - Unique identifier for the server connection
BaseUrl - Ollama server URL
User (optional) - Username for Basic authentication
Password (optional) - Password for Basic authentication
IgnoreSsl (optional) - Set to true to accept self-signed SSL certificates

Workspace roots for prompt testing

The MCP host (e.g., the VS Code MCP extension) is responsible for advertising filesystem boundaries via roots/list. This server requests the list of roots at runtime and restricts all file operations to that set. If the host does not expose any roots, tools such as query_ollama_with_files return an error instead of reading arbitrary paths.

Relative file_path values are resolved against the reported roots. Tools such as query_ollama_with_files return an error if the host does not expose any roots.

Configuration via Environment Variables

You can also configure servers using environment variables following standard .NET configuration naming conventions:

Ollama__DefaultServerName=remote-gpu
Ollama__Servers__0__Name=local
Ollama__Servers__0__BaseUrl=http://localhost:11434
Ollama__Servers__1__Name=remote-gpu
Ollama__Servers__1__BaseUrl=https://my-gpu-server.com:11434
Ollama__Servers__1__User=admin
Ollama__Servers__1__Password=secret
Ollama__Servers__1__IgnoreSsl=true

Development

Building from Source

Clone the repository:

git clone https://github.com/DimonSmart/LocalOllamaMCPServer.git
cd LocalOllamaMCPServer

Build the project:
```
dotnet build
```

Run locally:

dotnet run --project src/DimonSmart.LocalOllamaMCPServer/DimonSmart.LocalOllamaMCPServer.csproj

Running Tests

The project uses EasyVCR to record and replay HTTP interactions for reliable testing.

Run all tests:

dotnet test

Recording new cassettes:

Ensure Ollama is running locally
Pull the test model: ollama pull phi4:latest
Delete the existing cassette in tests/DimonSmart.LocalOllamaMCPServer.Tests/cassettes/
Run the tests to generate a new cassette

CI/CD

The project includes GitHub Actions workflows that:

Build and test on every push to main
Publish NuGet package to NuGet.org on version tags (e.g., v2.0.0)
Create GitHub releases with artifacts

Technology Stack

.NET 8.0 - Target framework
ModelContextProtocol SDK - Official MCP implementation
Microsoft.Extensions.Hosting - Application lifetime management
Microsoft.Extensions.Http - HTTP client factory
EasyVCR - HTTP recording for tests

Related Projects

Model Context Protocol - Official MCP documentation
Ollama - Run large language models locally
MCP Servers Registry - Collection of MCP servers

License

MIT