Vision
STDIOMCP server adding vision capabilities to AI models through OpenRouter's vision models
MCP server adding vision capabilities to AI models through OpenRouter's vision models
Ever wanted to use a model like GLM-4.6 or other great AI models that just don't have vision capabilities? This MCP server solves that problem by adding vision capabilities to any model through OpenRouter's vision models.
Some really good AI models don't support vision. You're stuck choosing between your preferred model or vision capabilities. This server bridges that gap by providing seamless vision capabilities through OpenRouter's vision models.
This MCP server provides a simple analyze_image tool that can:
Before installing, make sure you have:
node --version # Should show v18.0.0 or higher npm --version # Should show 8.0.0 or higher
If you need to install or update Node.js:
# Install nvm first, then: nvm install 20 nvm use 20
brew install nodewinget install OpenJS.NodeJSsudo apt install nodejs npmImportant: This server is written in TypeScript and uses dependencies (like
node-fetchv3) that require Node.js 18+. Older versions (like Node.js 16 or below) will not work.
sk-or-v1-...)npm install -g @thenomadinorbit/vision-mcp-server
Success! The package is now globally available as
vision-mcpcommand.
git clone https://github.com/TheNomadInOrbit/vision-mcp-server.git cd vision-mcp-server npm install npm run build npm install -g .
Note: Use this method if you want to modify the source code or contribute to the project.
Add this server configuration to your MCP client:
{ "mcpServers": { "vision-analyzer": { "command": "vision-mcp", "type": "stdio", "timeout": 60, "disabled": false, "autoApprove": [], "env": { "OPENROUTER_API_KEY": "your_api_key_here", "OPENROUTER_MODEL": "anthropic/claude-3-5-sonnet" } } } }
Important: The vision-mcp command requires an OpenRouter API key to run. You cannot test it directly without configuration.
OPENROUTER_API_KEY="your_api_key_here" vision-mcp --help
You should see the server start up with logs like:
Application initialized successfully
Starting Vision MCP Server...
MCP server started successfully
Vision MCP Server is running on stdio
Press Ctrl+C to stop the test.
vision-mcp without the API key?vision-mcp
You'll get this error (this is normal and expected):
Error: OPENROUTER_API_KEY environment variable is required
This means the installation worked! The server is just protecting you from running without proper configuration.
# Check if the command is available which vision-mcp # Check if the package is installed npm list -g @thenomadinorbit/vision-mcp-server
"vision-analyzer" - Server name (you can change this to anything you like)"command": "vision-mcp" - Required: The global command to run the server"type": "stdio" - Required: Communication protocol for MCP"timeout": 60 - Optional: Timeout in seconds (default: 60)"disabled": false - Optional: Set to true to disable the serverConfigure which tools can run without asking for permission:
"autoApprove": []
Options:
[] (empty) - Requires approval for all tools (safest)["list_models"] - Auto-approve listing available models only["analyze_image"] - Auto-approve vision analysis (convenient but less safe)["analyze_image", "list_models"] - Auto-approve all tools (most convenient)You can use any vision model from OpenRouter:
"env": { "OPENROUTER_API_KEY": "your_api_key_here", "OPENROUTER_MODEL": "anthropic/claude-3-5-sonnet" }
Popular Models:
anthropic/claude-3.5-sonnet (recommended - best for vision)openai/gpt-4o-2024-08-06 (excellent vision capabilities)google/gemini-2.0-flash-001 (fast and cost-effective)anthropic/claude-3-opus (most powerful for complex analysis){ "mcpServers": { "vision-analyzer": { "command": "vision-mcp", "type": "stdio", "timeout": 60, "disabled": false, "autoApprove": ["list_models"], "env": { "OPENROUTER_API_KEY": "sk-or-v1-your-actual-key-here", "OPENROUTER_MODEL": "anthropic/claude-3.5-sonnet", "MAX_IMAGE_SIZE": "10485760" } } } }
Once configured, your AI assistant can use these tools:
analyze_imageAnalyze images with AI vision models
list_modelsList all available vision models from OpenRouter
Once configured, you can ask your AI assistant to analyze images like this:
You: "Can you analyze this image: https://example.com/image.jpg"
What happens behind the scenes:
analyze_image tool from this MCP serverYou see: Detailed image analysis from your AI assistant You don't see: All the technical MCP communication happening behind the scenes
Analyze an image from URL:
"Can you analyze this image: https://example.com/image.jpg"
Analyze a local image:
"Please analyze the image at /Users/username/Pictures/photo.png"
Get available models:
"What vision models are available?"
Detailed analysis:
"Analyze this image and tell me about the objects, colors, and mood: https://example.com/artwork.jpg"
Compare images:
"Can you analyze these two images and tell me the differences: image1.jpg and image2.jpg"
You can customize the server with these environment variables:
| Variable | Description | Default | Required | 
|---|---|---|---|
OPENROUTER_API_KEY | Your OpenRouter API key | - | Yes | 
OPENROUTER_MODEL | AI model to use | anthropic/claude-3.5-sonnet | No | 
MAX_IMAGE_SIZE | Max image size in bytes | 10485760 (10MB) | No | 
vision-mcp gives an error!"The Error:
Error: OPENROUTER_API_KEY environment variable is required
Why this happens: You're trying to run vision-mcp directly from the command line. This MCP server is designed to be used through an MCP client (like Claude Code), not run directly.
The Fix:
vision-mcp directly in terminalQuick test only: If you want to test the installation, use:
OPENROUTER_API_KEY="your_key" vision-mcp --help
This server doesn't have a web interface or CLI commands. It's an MCP server that adds vision capabilities to your AI assistant through the MCP protocol.
Workflow:
npm install -g @thenomadinorbit/vision-mcp-server)vision-mcpnpm install -g @thenomadinorbit/vision-mcp-servernpm install -g . after buildingnpm list -g @thenomadinorbit/vision-mcp-server to verify installationvision-mcp (not a file path)npm run buildwhich vision-mcpsk-or-v1-Found a bug or have a suggestion? Feel free to open an issue with:
If you want to fork and modify for your own use:
git clone https://github.com/TheNomadInOrbit/vision-mcp-server.git cd vision-mcp-server npm install npm run build
If you discover a security vulnerability, please open a security issue with the "security" label.
See CHANGELOG.md for a detailed history of changes.
MIT - Use it however you want.