Browser MCP Lite
Minimal, auth-secured MCP server for real browser access. ~500 lines. Token auth, Chrome Extension MV3, accessibility tree.
Ask AI about Browser MCP Lite
Powered by Claude Β· Grounded in docs
I know everything about Browser MCP Lite. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
browser-mcp-lite
ηΉι«δΈζ | English
Give your AI assistant eyes into your real browser β in ~500 lines of code.
A minimal, auth-secured MCP server that lets AI assistants read pages, take screenshots, and run scripts in your actual Chrome browser β with your existing login sessions intact.
AI Assistant ββHTTP POSTβββΆ MCP Server (:12307) ββWebSocketβββΆ Chrome Extension βββΆ DOM
(token auth) (ws://) (minimal permissions)
Why build your own?
Existing browser MCP solutions are either:
- Too heavy β mcp-chrome requires
debugger,history,<all_urls>permissions, ships minified code you can't audit - Headless only β Playwright MCP launches a new browser, can't see your logged-in sessions
- Cloud-dependent β Browserbase, etc. route your pages through third-party servers
browser-mcp-lite takes a different approach:
| mcp-chrome | Playwright MCP | browser-mcp-lite | |
|---|---|---|---|
| Reads your real browser | Yes | No (headless) | Yes |
| Auth on MCP endpoint | No | No | Token auth |
| Extension permissions | 8+ | N/A | 5 (minimal) |
| Code you can audit | Minified | Yes | ~500 lines |
| Login sessions | Yes | No | Yes |
What it does
Four tools, exposed via standard MCP protocol:
| Tool | Description | Returns |
|---|---|---|
list_tabs | List all open tabs | [{id, url, title, active}] |
read_page | Read page as accessibility tree | Structured text (token-efficient) |
screenshot | Capture visible area | Base64 PNG image |
inject_script | Run custom JS in a tab | Script return value |
Quick start
1. Install and start the server
Option A β npx (no clone needed):
npx browser-mcp-lite
Option B β clone:
git clone https://github.com/notoriouslab/browser-mcp-lite.git
cd browser-mcp-lite/server
npm install
node index.js
On first run, a token is auto-generated. The server prints the full token and a ready-to-paste config block:
[browser-mcp-lite] MCP server: http://127.0.0.1:12307/mcp
[browser-mcp-lite] WebSocket: ws://127.0.0.1:12307/ws
β First run β new token generated
βββ Auth Token (paste into Chrome Extension popup) βββ
<full 64-char hex token>
βββββββββββββββββββββββββββββββββββββββββββββββββββββ
βββ MCP Client Config (save as .mcp.json) βββ
{
"mcpServers": {
"browser": {
"type": "http",
"url": "http://127.0.0.1:12307/mcp",
"headers": {
"Authorization": "Bearer <token>"
}
}
}
}
βββββββββββββββββββββββββββββββββββββββββββββββββββββ
[browser-mcp-lite] Waiting for Chrome Extension...
2. Load the Chrome Extension
- Open
chrome://extensions/ - Enable Developer mode (top right)
- Click Load unpacked β select the
extension/folder - Click the Browser MCP Lite icon in your toolbar
- Copy the token printed in the terminal and paste it into the Auth Token field
- Click Connect
The extension auto-connects. Click the toolbar icon to verify the green dot.
3. Connect your AI assistant
Note: The ready-to-paste
.mcp.jsonconfig is printed in the terminal when the server starts.
Claude Code
Add to your project's .mcp.json:
{
"mcpServers": {
"browser": {
"type": "http",
"url": "http://127.0.0.1:12307/mcp",
"headers": {
"Authorization": "Bearer YOUR_TOKEN_HERE"
}
}
}
}
Replace YOUR_TOKEN_HERE with the token printed in the terminal.
Claude Desktop
Add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"browser": {
"type": "http",
"url": "http://127.0.0.1:12307/mcp",
"headers": {
"Authorization": "Bearer YOUR_TOKEN_HERE"
}
}
}
}
Cursor / VS Code
In Cursor: Settings β MCP β Add Server:
- Name:
browser - Type:
http - URL:
http://127.0.0.1:12307/mcp - Headers:
Authorization: Bearer YOUR_TOKEN_HERE
Architecture
Three components
βββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β AI Assistant β β MCP Server β β Chrome Extension β
β (any MCP client) ββββββΆβ (Fastify) ββββββΆβ (Manifest V3) β
β βHTTP β :12307/mcp β WS β β
β βPOST β Token auth β β tabs, scripting, β
β βββββββ βββββββ activeTab, alarms β
βββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
MCP Server (server/index.js, ~150 lines)
- Fastify HTTP server with
@modelcontextprotocol/sdkStreamable HTTP transport - Token auth on every request (Bearer token from
~/.browser-mcp-secrets.json) - WebSocket endpoint for Chrome Extension communication
- Per-session MCP transport management
Chrome Extension (extension/background.js, ~215 lines)
- Manifest V3 Service Worker
- WebSocket client with keepalive alarm (survives MV3 Service Worker termination)
- Implements the four tools directly via Chrome APIs
- Auto-reconnects when server restarts
Accessibility Tree Builder (extension/inject/accessibility-tree.js, ~195 lines)
- Injected into target pages to build a structured DOM representation
- Outputs
ref_*IDs for each interactive/structural element - Much more token-efficient than raw HTML β AI can reason about page structure without burning context
Why accessibility tree instead of raw HTML?
A typical web page has 50-200KB of HTML. An accessibility tree of the same page is 2-10KB β 10-50x smaller. It strips away styling, scripts, and decorative elements, keeping only what matters: headings, links, buttons, inputs, and their labels.
# Raw HTML: ~150KB
<div class="css-1dbjc4n r-1awozwy r-18u37iz r-1h0z5md" data-testid="...">
<div class="css-901oao r-1fmj7o5 r-37j5jr r-a023e6 r-b88u0q ...">
<span class="css-901oao ...">Settings</span>
</div>
</div>
# Accessibility tree: ~50 bytes
[ref_42 link "Settings" href=/settings]
Security design
- Token auth (both layers): MCP endpoint requires
Authorization: Bearer <token>. WebSocket endpoint requires token handshake on connect. Both use the same token from~/.browser-mcp-secrets.json. - Minimal permissions: Only
tabs,activeTab,scripting,alarms,storage. Nodebugger, nohistory, nocookies, nowebRequest. - Localhost only: Server binds to
127.0.0.1. Not exposed to the network. - No network proxying: The extension never makes HTTP requests using your browser cookies. It only reads DOM and takes screenshots.
- On-demand: Start the server when you need it, Ctrl+C when you're done. No daemon, no auto-start.
- Auditable: ~500 lines total. Read it all in 10 minutes.
Privacy considerations
Be aware of what you're exposing to your AI assistant.
list_tabssends all open tab URLs and titles to the AI assistant. If you have sensitive pages open (email, medical, private messages), their URLs and titles will be visible.screenshotcaptures whatever is on screen, including passwords, personal data, or private content.screenshotswitches tabs: Chrome'scaptureVisibleTabAPI can only capture the active tab. If you screenshot a background tab, it will be briefly switched to the foreground.
Prompt injection risk: A malicious web page could contain hidden text that tricks your AI assistant into calling inject_script with harmful code on other tabs. Always review tool calls before approving them. Do not auto-approve inject_script calls.
Mitigations: Only run the server when you need it. Close sensitive tabs before use. Review what your AI assistant can see via list_tabs before calling other tools. Never auto-approve tool calls in your MCP client.
MV3 Service Worker survival
Chrome's Manifest V3 terminates Service Workers after ~30 seconds of inactivity. The extension uses chrome.alarms with a 24-second interval to stay alive and reconnect the WebSocket if needed.
Customization
Change the port:
MCP_PORT=9999 node index.js
Update the WebSocket URL in extension/background.js line 5 and extension/popup.html accordingly.
Add new tools:
Edit server/tools.js to register new MCP tools, and add the corresponding handler in extension/background.js's handleToolRequest switch.
How it works (for the blog post curious)
- You start the MCP server (
node index.js). It listens on:12307for HTTP (MCP) and WebSocket (extension). - The Chrome Extension's Service Worker connects to
ws://127.0.0.1:12307/ws. - Your AI assistant sends an MCP request (e.g.,
read_page) tohttp://127.0.0.1:12307/mcpwith a Bearer token. - The server validates the token, translates the MCP call into a WebSocket message, and sends it to the extension.
- The extension executes the corresponding Chrome API call (e.g.,
chrome.scripting.executeScriptto inject the accessibility tree builder). - The result flows back: Extension β WebSocket β Server β HTTP response β AI assistant.
The whole round trip takes 100-500ms depending on page complexity.
Real-world example
See examples/web-architectures.md for a detailed walkthrough of reading data from different web architectures β iframe-based legacy portals, React/Angular SPAs, Vue.js dynamic sites, AJAX-heavy dashboards, and CSP-strict pages.
Running tests
node --test tests/server.test.js
Tests cover: health check, token auth (missing/wrong/valid), MCP protocol handshake, and session management.
File structure
browser-mcp-lite/
βββ server/
β βββ index.js # Fastify MCP Server + WebSocket hub
β βββ tools.js # Tool schemas + handlers (4 tools)
β βββ token.js # Shared token generation/loading
β βββ setup.js # One-time token generator
β βββ package.json
βββ extension/
β βββ manifest.json # Manifest V3, minimal permissions
β βββ background.js # Service Worker: WS client + tool implementations
β βββ popup.html # Connection status UI
β βββ popup.js
β βββ inject/
β β βββ accessibility-tree.js # DOM β structured text
β βββ icons/
βββ examples/
β βββ web-architectures.md # Handling different web architectures
βββ tests/
β βββ server.test.js # Server auth + protocol tests
βββ LICENSE
βββ README.md # English
βββ README.zh-TW.md # ηΉι«δΈζ
Requirements
- Node.js 18+
- Chrome or Chromium-based browser
- Any MCP-compatible AI client
License
MIT
Credits
Accessibility tree approach inspired by anthropics/anthropic-quickstarts.
