Agent Browser MCP Server
MCP server for agent-browser - complete browser automation with token-efficient @ref system
Ask AI about Agent Browser MCP Server
Powered by Claude ยท Grounded in docs
I know everything about Agent Browser MCP Server. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
Agent-Browser MCP
English | ็ฎไฝไธญๆ | ๆฅๆฌ่ช
Model Context Protocol (MCP) server for agent-browser - providing complete browser automation capabilities for AI agents.
This project is an independent MCP server implementation that wraps the excellent agent-browser CLI tool, making its powerful browser automation features available through the Model Context Protocol.
Features
- ๐ง 44 Tools - Complete coverage of agent-browser's functionality
- ๐ฏ Token-Efficient @ref System - Reduces token usage by caching element references
- ๐ Full Playwright API - Leverage the complete browser automation capabilities
- ๐ Auto-Launch - Browser starts automatically when needed
- ๐พ State Persistence - Save and restore browser state across sessions
- ๐ฌ Video Recording - Record browser sessions for debugging
- ๐ Network Interception - Monitor and modify network requests
- ๐ Session Management - Manage multiple tabs and windows
Installation
Using npm
npm install agent-browser-mcp-server
From Source
git clone https://github.com/hughedward/agent_browser_mcp.git
cd agent_browser_mcp
npm install
npm run build
Quick Start
For Claude Desktop
- Install the package
- Configure in Claude Desktop settings (
~/.claude/settings.json):
{
"mcpServers": {
"agent-browser-mcp-server": {
"command": "npx",
"args": ["agent-browser-mcp-server"],
"env": {
"HEADED": "false"
}
}
}
}
Standalone
agent-browser-mcp-server
Available Tools
Core Tools
browser_navigate- Navigate to a URLbrowser_snapshot- Capture page structure with @ref systembrowser_screenshot- Take screenshotsbrowser_close- Close browser/page
Navigation & History
browser_back- Go back in historybrowser_forward- Go forward in historybrowser_reload- Reload the current page
Element Interaction
browser_click- Click an elementbrowser_fill- Fill input fieldsbrowser_type- Type without clearingbrowser_select- Select dropdown optionsbrowser_check/browser_uncheck- Check/uncheck checkboxesbrowser_drag- Drag and dropbrowser_upload- Upload filesbrowser_dblclick- Double clickbrowser_focus- Focus elementsbrowser_hover- Hover over elementsbrowser_scroll- Scroll pagebrowser_press- Press keyboard keys
Element Discovery
browser_find- Semantic element search (role, text, label, placeholder, etc.)browser_get- Get element informationbrowser_is- Check element state
Tabs & Windows
browser_tab- Manage tabsbrowser_window- Manage windowsbrowser_frame- Switch to iframes
Advanced Features
browser_record- Record browser sessionsbrowser_network- Monitor network requestsbrowser_console- Access consolebrowser_errors- Track JavaScript errorsbrowser_trace- Performance tracingbrowser_profiler- Chrome DevTools profilingbrowser_evaluate- Execute JavaScriptbrowser_pdf- Export to PDFbrowser_dialog- Handle JavaScript dialogsbrowser_download- Manage downloads
State & Storage
browser_state- Save/load browser statebrowser_cookies- Manage cookiesbrowser_storage- Access localStorage/sessionStorage
Utilities
browser_wait- Wait for conditionsbrowser_set- Set element attributesbrowser_mouse- Mouse controlbrowser_diff- Compare pagesbrowser_highlight- Debug highlighting
Configuration
Environment Variables:
| Variable | Description | Default |
|---|---|---|
HEADED | Run in headed mode (visible browser) | false |
BROWSER | Browser to use (chromium/firefox/webkit) | chromium |
Development
# Install dependencies
npm install
# Build
npm run build
# Run in development mode (auto-rebuild)
npm run dev
# Run tests
npm test
# Watch mode
npm run test:watch
# Start server
npm start
Documentation
- CLAUDE.md - Development guide for Claude Code
- TESTING_GUIDE.md - Testing instructions
- QUICK_TEST_GUIDE.md - Quick reference
Related Projects
- agent-browser - Original CLI tool this project wraps
- Model Context Protocol - The protocol this server implements
License
Apache-2.0
Note: This project is an independent implementation and is not officially affiliated with Vercel or the original agent-browser project.
