Scrapling获取
HTTP-SSESTDIO绕过机器人检测的网页内容获取工具
绕过机器人检测的网页内容获取工具
An MCP server that helps AI assistants access text content from websites that implement bot detection, bridging the gap between what you can see in your browser and what the AI can access.
This tool is optimized for low-volume retrieval of documentation and reference materials (text/HTML only) from websites that implement bot detection. It has not been designed or tested for general-purpose site scraping or data harvesting.
Note: This project was developed in collaboration with Claude Sonnets 3.7 and 4.5, using LLM Context.
# Install scrapling-fetch-mcp uv tool install scrapling-fetch-mcp # Install browser binaries (REQUIRED - large downloads) uvx --from scrapling-fetch-mcp scrapling install
Important: The browser installation downloads hundreds of MB of data and must complete before first use. If the MCP server times out on first use, the browsers may still be installing in the background. Wait a few minutes and try again.
Add this configuration to your Claude Desktop MCP settings:
MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{ "mcpServers": { "scrapling-fetch": { "command": "uvx", "args": ["scrapling-fetch-mcp"] } } }
After updating the config, restart Claude Desktop.
This MCP server provides two tools that Claude can use automatically when you ask it to fetch web content:
The AI decides which tool to use based on your request. You just ask naturally:
"Can you fetch the docs at https://example.com/api"
"Find all mentions of 'authentication' on that page"
"Get me the installation instructions from their homepage"
The tools support three levels of bot detection bypass:
Claude automatically starts with basic mode and escalates if needed.
Built with Scrapling for web scraping with bot detection bypass.
Apache 2.0