Crawlbase MCP
Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle-tested web scraping, JavaScript rendering, and anti-bot protection enabling structured, live data inside your AI workflows.
Installation
npx crawlbase-mcpAsk AI about Crawlbase MCP
Powered by Claude Β· Grounded in docs
I know everything about Crawlbase MCP. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
What is Crawlbase MCP?
Crawlbase MCP is a Model Context Protocol (MCP) server that bridges AI agents and the live web. Instead of relying on outdated training data, your LLMs can now fetch fresh, structured, real-time content β powered by Crawlbaseβs proven crawling infrastructure trusted by 70,000+ developers worldwide.
It handles the complexity of scraping for you:
- JavaScript rendering for modern web apps
- Proxy rotation & anti-bot evasion
- Structured outputs (HTML, Markdown, screenshots)
How It Works
- Get Free Crawlbase Tokens β Sign up at Crawlbase βοΈ, get free Normal, and JavaScript tokens.
- Set Up MCP Configuration β Configure the MCP server in your preferred client (Claude, Cursor, or Windsurf) by updating the MCP Servers settings.
- Start Crawling β Use commands like crawl, crawl_markdown, or crawl_screenshot to bring live web data into your AI agent.
Setup & Integration
Claude Desktop
- Open Claude Desktop β Settings β Developer β Edit Config
- Add to
claude_desktop_config.json: - Replace
your_token_hereandyour_js_token_herewith the tokens from your dashboard.
{
"mcpServers": {
"crawlbase": {
"type": "stdio",
"command": "npx",
"args": ["@crawlbase/mcp@latest"],
"env": {
"CRAWLBASE_TOKEN": "your_token_here",
"CRAWLBASE_JS_TOKEN": "your_js_token_here"
}
}
}
}
Claude Code
Add to your claude.json configuration:
{
"mcpServers": {
"crawlbase": {
"type": "stdio",
"command": "npx",
"args": ["@crawlbase/mcp@latest"],
"env": {
"CRAWLBASE_TOKEN": "your_token_here",
"CRAWLBASE_JS_TOKEN": "your_js_token_here"
}
}
}
}
Cursor IDE
- Open Cursor IDE β File β Preferences β Cursor Settings β Tools and Integrations β Add Custom MCP
- Add to
mcp.json: - Replace
your_token_hereandyour_js_token_herewith the tokens from your dashboard.
{
"mcpServers": {
"crawlbase": {
"type": "stdio",
"command": "npx",
"args": ["@crawlbase/mcp@latest"],
"env": {
"CRAWLBASE_TOKEN": "your_token_here",
"CRAWLBASE_JS_TOKEN": "your_js_token_here"
}
}
}
}
Windsurf IDE
- Open WindSurf IDE β File β Preferences β WindSurf Settings β General β MCP Servers β Manage MCPs β View raw config
- Add to
mcp_config.json: - Replace
your_token_hereandyour_js_token_herewith the tokens from your dashboard.
{
"mcpServers": {
"crawlbase": {
"type": "stdio",
"command": "npx",
"args": ["@crawlbase/mcp@latest"],
"env": {
"CRAWLBASE_TOKEN": "your_token_here",
"CRAWLBASE_JS_TOKEN": "your_js_token_here"
}
}
}
}
HTTP Transport Mode
For scenarios where you need a shared MCP server accessible over HTTP (e.g., multi-user environments, custom integrations), you can run the server in HTTP mode:
# Clone and install
git clone https://github.com/crawlbase/crawlbase-mcp.git
cd crawlbase-mcp
npm install
# Start HTTP server with tokens (default port: 3000)
CRAWLBASE_TOKEN=your_token CRAWLBASE_JS_TOKEN=your_js_token npm run start:http
# Or with custom port
CRAWLBASE_TOKEN=your_token CRAWLBASE_JS_TOKEN=your_js_token MCP_PORT=8080 npm run start:http
The server exposes:
POST /mcp- MCP Streamable HTTP endpointGET /health- Health check endpoint
Per-Request Token Authentication
HTTP mode supports per-request tokens via headers, allowing multiple users to share a single server:
curl -X POST http://localhost:3000/mcp \
-H "Content-Type: application/json" \
-H "Accept: application/json, text/event-stream" \
-H "X-Crawlbase-Token: your_token" \
-H "X-Crawlbase-JS-Token: your_js_token" \
-d '{"jsonrpc": "2.0", "method": "tools/list", "id": 1}'
Headers:
X-Crawlbase-Token- Normal token for HTML requestsX-Crawlbase-JS-Token- JavaScript token for JS-rendered pages/screenshots
Headers override environment variables when provided, enabling multi-tenant deployments.
π Get your free tokens at Crawlbase βοΈ.
Usage
Once configured, use these commands inside Claude, Cursor, or Windsurf:
- crawl β Fetch raw HTML
- crawl_markdown β Extract clean Markdown
- crawl_screenshot β Capture screenshots
Example prompts:
- βCrawl Hacker News and return top stories in markdown.β
- βTake a screenshot of TechCrunch homepage.β
- βFetch Tesla investor relations page as HTML.β
Async Crawling with Cloud Storage
For larger jobs, Crawlbase MCP can push crawl results to Crawlbase Cloud Storage instead of returning them immediately. Your AI agent can then come back later to read, list, or clean up those pages β useful when crawling many URLs at once, revisiting a dataset across sessions, or keeping heavy HTML out of the chat until you actually need it.
Example prompts:
- βCrawl these 50 product pages and save them to my Crawlbase storage. Once they're saved, summarize each one.β
- βSave the Hacker News front page to storage so I can analyze it later.β
- βHow many pages do I have stored in Crawlbase right now? Show me the most recent 20.β
- βPull back everything I saved yesterday from my Crawlbase storage and give me a report.β
- βDelete all the pages I have in Crawlbase storage β I'm done with that project.β
Use Cases
- Market research β Pull live data from competitors, news, and reports
- E-commerce monitoring β Track products, reviews, and prices in real time
- News & finance feeds β Keep AI agents up-to-date with live events
- Autonomous AI agents β Give them vision to act on fresh web data
Resources & Next Steps
Looking to supercharge your AI agents with live web data? Get started here:
- βοΈ Learn More β See how MCP powers AI agents with real-time web data βοΈ
- π Crawlbase Website β Get free tokens & start crawling today βοΈ
Copyright 2026 Crawlbase

