Nl Opendata MCP
MCP Server for Dutch government open data (CBS, data.overheid.nl)
Installation
npx nl-opendata-mcpAsk AI about Nl Opendata MCP
Powered by Claude Β· Grounded in docs
I know everything about Nl Opendata MCP. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
NL OpenData MCP Server
A comprehensive Model Context Protocol (MCP) server for accessing Dutch government open data from CBS (Centraal Bureau voor de Statistiek) and data.overheid.nl.
This is a small side project to experiment with MCP and see if it can be useful for accessing Dutch government open data. It is not production ready and should not be used for production.
I have tested it with Claude Desktop and LM Studio and it works wreasonably well. It should work with the most MCP clients. The results are always dependent on the quality of the model you use. Frontier models understand data and mcp tools well and produce good results. Local models may not understand the tools calling and produce unexpected results.
For local models I got the best results using the Devstral2 small model with gpt-oss 20b close second. Context length is a very limiting factor for local models. Yo will need at least 30k tokens context length for basic analysis to work.
β¨ Features
- π Access 4800+ CBS Datasets - Browse and query statistical data on population, economy, health, environment, and more
- π Smart Search - Search datasets by title, summary, or both with local caching for fast results
- π₯ Flexible Data Fetching - Query, filter, and download datasets in CSV or Parquet format
- π¦ DuckDB Integration - Save datasets directly to DuckDB for efficient SQL querying
- π Python Analysis - Execute Pandas code directly on datasets without downloading
- ποΈ Dual Source Support - Check availability across CBS OData and data.overheid.nl
π¦ Installation
Using uvx (Recommended)
# Run directly from PyPI
uvx nl-opendata-mcp
# Or run directly from GitHub
uvx --from git+https://github.com/soulnai/nl-opendata-mcp.git nl-opendata-mcp
Install with pip/uv
# Install from PyPI
uv pip install nl-opendata-mcp
# Install from GitHub
uv pip install git+https://github.com/soulnai/nl-opendata-mcp.git
From Source (Development)
# Clone the repository
git clone https://github.com/soulnai/nl-opendata-mcp.git
cd nl-opendata-mcp
# Install dependencies and package in editable mode
uv sync
uv pip install -e .
# Run the server
uv run nl-opendata-mcp
π§ Configuration
Claude Desktop
Add to your Claude Desktop configuration (claude_desktop_config.json):
{
"mcpServers": {
"nl-opendata-mcp": {
"command": "uvx",
"args": ["nl-opendata-mcp"]
}
}
}
LM Studio
Add to your LM Studio configuration (mcp.json):
{
"mcpServers": {
"nl-opendata-mcp": {
"command": "uvx",
"args": ["nl-opendata-mcp"]
}
}
}
With Environment Variables
The server supports different transport modes:
# Default: stdio transport
uvx nl-opendata-mcp
# HTTP transport (port 8000)
TRANSPORT=http uvx nl-opendata-mcp
# SSE transport (port 8000)
TRANSPORT=sse uvx nl-opendata-mcp
π οΈ Available Tools
Discovery Tools
| Tool | Description |
|---|---|
cbs_list_datasets | List available datasets from the CBS catalog |
cbs_search_datasets | Search datasets by keyword (in title, summary, or both) |
cbs_check_dataset_availability | Check if a dataset is available via CBS OData or data.overheid.nl |
cbs_estimate_dataset_size | Estimate dataset size before fetching (rows, columns, recommended strategy) |
cbs_inspect_dataset_details | Get comprehensive dataset summary (metadata, structure, sample data) |
Metadata Tools
| Tool | Description |
|---|---|
cbs_get_metadata | Unified metadata tool - get info, structure, dimension values, or custom endpoints |
cbs_get_metadata types:
metadata_type="info"- Dataset description (TableInfos)metadata_type="structure"- Column definitions and data types (DataProperties)metadata_type="endpoints"- Available metadata endpointsmetadata_type="dimensions"- Dimension values with codes for filtering (requiresendpoint_name)metadata_type="custom"- Query custom endpoint (requiresendpoint_name)
Data Fetching Tools
| Tool | Description |
|---|---|
cbs_query_dataset | Query data with filtering and column selection |
cbs_save_dataset | Save dataset to CSV (use fetch_all=True for complete dataset) |
cbs_save_dataset_to_duckdb | Save dataset to DuckDB for SQL queries |
Analysis Tools (disabled by default)
| Tool | Description |
|---|---|
cbs_analyze_remote_dataset | Execute Python/Pandas code on a remote dataset |
cbs_analyze_local_dataset | Execute Python/Pandas code on a local CSV file |
π Usage Examples
1. Discovering Datasets
List available datasets:
Use cbs_list_datasets with top=20 to see the first 20 datasets
Search for population data:
Use cbs_search_datasets with query="bevolking" to find population datasets
Search only in titles:
Use cbs_search_datasets with query="inflatie" and search_field="title"
2. Exploring a Dataset
Get a comprehensive overview (recommended first step):
Use cbs_inspect_dataset_details with dataset_id="85313NED"
# Returns:
# - Source confirmation (CBS OData or data.overheid.nl)
# - Title and description
# - Column definitions with types
# - Sample data (first 5 rows)
Check dataset size before fetching:
Use cbs_estimate_dataset_size with dataset_id="85313NED"
# Returns:
# - Estimated row count
# - Column count
# - Recommended fetch strategy
Get detailed column structure:
Use cbs_get_metadata with dataset_id="85313NED" and metadata_type="structure"
# Returns CSV with: Key, Type, Title, Description for each column
3. Querying Data
Basic query with pagination:
Use cbs_query_dataset with:
- dataset_id="85313NED"
- top=100
- skip=0
Query with OData filter:
Use cbs_query_dataset with:
- dataset_id="85313NED"
- filter="Perioden eq '2023JJ00'"
- top=50
Select specific columns:
Use cbs_query_dataset with:
- dataset_id="85313NED"
- select=["Perioden", "TotaleBevolking_1", "Mannen_2", "Vrouwen_3"]
- top=100
Query with multiple conditions:
Use cbs_query_dataset with:
- dataset_id="85313NED"
- filter="Perioden eq '2023JJ00' and Leeftijd eq '10000'"
4. Downloading Full Datasets
Save complete dataset to CSV:
Use cbs_save_dataset with:
- dataset_id="85313NED"
- file_name="population_data.csv"
- fetch_all=true
Save to DuckDB for SQL analysis (supports column selection):
Use cbs_save_dataset_to_duckdb with:
- dataset_id="85313NED"
- table_name="population"
- fetch_all=true
# Creates datasets.db with table 'population'
5. Analyzing Data
Note: Analysis tools are disabled by default. To enable them, set USE_PYTHON_ANALYSIS=true in the environment. It is security risk to let the model write and execute Python code on a client machine. Recommended for advanced users only. It's better to let the model query and save the data to a file, and let your CLI LLM coding tool to analyse it by writing and executing scripts. If you are running it in a non CLI environment (like LM Studio) and still want to give it the ability to analyse data, you can enable it by setting USE_PYTHON_ANALYSIS=true in the environment.
Analyze remote dataset with Pandas:
Use cbs_analyze_remote_dataset with:
- dataset_id="85313NED"
- analysis_code="print(df.describe())"
Calculate statistics:
Use cbs_analyze_remote_dataset with:
- dataset_id="85313NED"
- analysis_code="result = df['TotaleBevolking_1'].mean()"
Filter and aggregate:
Use cbs_analyze_remote_dataset with:
- dataset_id="85313NED"
- analysis_code="""
filtered = df[df['Perioden'].str.contains('2023')]
result = filtered.groupby('Leeftijd')['TotaleBevolking_1'].sum()
print(result)
"""
Analyze a local CSV file:
Use cbs_analyze_local_dataset with:
- dataset_path="downloads/85313NED_full.csv"
- analysis_code="print(df.info())"
6. Working with Metadata
Get classification codes (e.g., gender categories):
Use cbs_get_metadata with:
- dataset_id="85313NED"
- metadata_type="dimensions"
- endpoint_name="Geslacht"
Get time period definitions:
Use cbs_get_metadata with:
- dataset_id="85313NED"
- metadata_type="dimensions"
- endpoint_name="Perioden"
π― Common Workflows
Workflow 1: Quick Data Exploration
1. cbs_search_datasets(query="unemployment") # Find relevant datasets
2. cbs_inspect_dataset_details(dataset_id="82809NED") # Get overview
3. cbs_query_dataset(dataset_id="82809NED", top=10) # Preview data
Workflow 2: Full Dataset Analysis
1. cbs_estimate_dataset_size(dataset_id="85313NED") # Check size
2. cbs_save_dataset_to_duckdb(dataset_id="85313NED") # Save to DuckDB
3. Query using DuckDB CLI or cbs_analyze_local_dataset
Workflow 3: Filtered Data Export
1. cbs_get_metadata(dataset_id="85313NED", metadata_type="structure") # Get column names
2. cbs_get_metadata(dataset_id="85313NED", metadata_type="dimensions", endpoint_name="Perioden") # Get period codes
3. cbs_query_dataset(dataset_id="85313NED", filter="Perioden eq '2023JJ00'", select=["..."]) # Query
4. cbs_save_dataset(dataset_id="85313NED", file_name="filtered_data.csv") # Export
π Output Formats
| Format | Use Case |
|---|---|
| CSV | Universal compatibility, works with Excel, Pandas, etc. |
| DuckDB | SQL queries, joins across multiple datasets |
π OData Filter Examples
The CBS OData API uses OData v3 syntax for filtering:
# Exact match
Perioden eq '2023JJ00'
# Substring match
substringof('Amsterdam', RegioS)
# Multiple conditions
Perioden eq '2023JJ00' and Leeftijd eq '10000'
# OR conditions
Geslacht eq '1100' or Geslacht eq '2000'
# Numeric comparisons
TotaleBevolking_1 gt 100000
