Urltomarkdown MCP
MCP server for converting URLs and HTML to clean Markdown
Ask AI about Urltomarkdown MCP
Powered by Claude Β· Grounded in docs
I know everything about Urltomarkdown MCP. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
urltomarkdown-mcp
An MCP server that converts URLs and raw HTML to clean Markdown. Built on top of urltomarkdown.
Give any MCP-compatible AI assistant the ability to read web pages β just point it at a URL and get structured Markdown back, ready for summarization, analysis, or ingestion into your workflow.
Tools
convert_url_to_markdown
Fetches a URL and converts the web page to clean Markdown.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
url | string (URL) | yes | - | URL to fetch and convert |
inline_title | boolean | no | true | Prepend page title as H1 |
ignore_links | boolean | no | false | Strip hyperlinks |
readability | boolean | no | true | Use Readability for cleaner output |
convert_html_to_markdown
Converts raw HTML to clean Markdown (no network request needed).
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
html | string | yes | - | Raw HTML to convert |
url | string (URL) | no | - | Source URL for relative links and domain filters |
inline_title | boolean | no | true | Prepend page title as H1 |
ignore_links | boolean | no | false | Strip hyperlinks |
readability | boolean | no | true | Use Readability for cleaner output |
Use cases
- Research β Have your AI assistant read and summarize articles, documentation, or blog posts
- Data extraction β Pull structured content from web pages for analysis
- Documentation ingestion β Convert API docs or reference pages into Markdown for context
- Content migration β Bulk convert web content to Markdown format
- RAG pipelines β Feed clean web content into retrieval-augmented generation systems
Installation
Using npx (no install needed)
{
"mcpServers": {
"urltomarkdown": {
"command": "npx",
"args": ["-y", "urltomarkdown-mcp"]
}
}
}
Global install
npm install -g urltomarkdown-mcp
Then add to your MCP config:
{
"mcpServers": {
"urltomarkdown": {
"command": "urltomarkdown-mcp"
}
}
}
From source
git clone https://github.com/SapienEx-AI/urltomarkdown-mcp.git
cd urltomarkdown-mcp
npm install
{
"mcpServers": {
"urltomarkdown": {
"command": "node",
"args": ["/path/to/urltomarkdown-mcp/src/index.js"]
}
}
}
Where to put the config
| Client | Config file |
|---|---|
| Claude Desktop | ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows) |
| Claude Code | ~/.claude/settings.local.json or project .mcp.json |
| Cursor | .cursor/mcp.json in your project root |
| Windsurf | ~/.codeium/windsurf/mcp_config.json |
How it works
This server wraps the urltomarkdown library and exposes it over the Model Context Protocol via stdio transport. Under the hood it uses:
- Turndown for HTML-to-Markdown conversion
- Mozilla's Readability for content extraction and noise removal
- JSDOM for DOM parsing
The Readability pass strips navigation, sidebars, ads, and other non-content elements before conversion, producing clean output suitable for LLM consumption.
Testing
npm test
License
MIT
SapienEx β AI strategy and consulting
