Puppeteer Browser Automation
STDIOBrowser automation MCP server with Linux display server support for webpage interactions.
Browser automation MCP server with Linux display server support for webpage interactions.
A Model Context Protocol server that provides browser automation capabilities using Puppeteer, with full support for Linux display servers (X11 and Wayland). This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment.
This fork adds automatic detection and configuration for Linux display servers:
url
(string)name
(string, required): Name for the screenshotselector
(string, optional): CSS selector for element to screenshotwidth
(number, optional, default: 800): Screenshot widthheight
(number, optional, default: 600): Screenshot heightselector
(string): CSS selector for element to clickselector
(string): CSS selector for element to hoverselector
(string): CSS selector for input fieldvalue
(string): Value to fillselector
(string): CSS selector for element to selectvalue
(string): Value to selectscript
(string): JavaScript code to executeThe server provides access to two types of resources:
console://logs
)
screenshot://<name>
)
{ "mcpServers": { "puppeteer": { "command": "npx", "args": ["ts-node", "/path/to/index.ts"] } } }
To install Puppeteer Linux for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @PhialsBasement/mcp-puppeteer-linux --client claude
git clone https://github.com/PhialsBasement/MCP-Puppeteer-Linux.git
npm install
ts-node index.ts
The server automatically detects and configures the appropriate display environment:
WAYLAND_DISPLAY
WAYLAND_DISPLAY
QT_QPA_PLATFORM
GDK_BACKEND
MOZ_ENABLE_WAYLAND
XDG_SESSION_TYPE
DISPLAY
XAUTHORITY
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.