Think Tool
STDIOMCP server implementing think tool for improving Claude's complex reasoning capabilities.
MCP server implementing think tool for improving Claude's complex reasoning capabilities.
A Model Context Protocol (MCP) server that implements the "think" tool for enhancing complex reasoning capabilities in Large Language Models (LLMs). This tool provides LLMs with a dedicated space for structured thinking during problem-solving tasks, significantly improving performance in complex scenarios requiring policy adherence and multi-step reasoning.
The Think Tool MCP server is based on Anthropic's research demonstrating that providing LLMs with a dedicated "thinking space" dramatically improves performance on complex tasks. This tool allows any compatible LLM (Claude, GPT-4, and others) to:
As described in Anthropic's blog post, the think tool has shown significant improvements in tasks requiring complex reasoning and policy adherence across different language models.
Requirements: Cursor version 0.45.6 or higher
Cmd/Ctrl + ,
)think-tool-mcp
(or your preferred name)command
npx -y think-tool-mcp
Add to your claude_desktop_config.json
:
{ "mcpServers": { "think-tool": { "command": "npx", "args": ["-y", "think-tool-mcp"] } } }
Config file locations:
~/Library/Application Support/Claude/claude_desktop_config.json
%APPDATA%\Claude\claude_desktop_config.json
This server works with any platform supporting the Model Context Protocol. Refer to your platform's documentation for MCP server configuration.
Extensive research by Anthropic has demonstrated significant performance improvements when LLMs use the think tool. The following results showcase the measurable impact across different benchmarks and use cases.
τ-Bench is a comprehensive benchmark designed to test LLM tool usage in realistic customer service scenarios. It evaluates the ability to navigate complex conversations, follow detailed policy guidelines, and maintain consistency across multiple task trials.
The airline domain represents a complex policy-heavy environment where precise adherence to detailed rules is critical.
Configuration | k=1 | k=2 | k=3 | k=4 | k=5 |
---|---|---|---|---|---|
Think + Optimized Prompt | 0.584 | 0.444 | 0.384 | 0.356 | 0.340 |
Think Tool Alone | 0.404 | 0.254 | 0.186 | 0.140 | 0.100 |
Extended Thinking | 0.412 | 0.290 | 0.232 | 0.192 | 0.160 |
Baseline (No Think Tool) | 0.332 | 0.206 | 0.148 | 0.116 | 0.100 |
Key Findings:
The retail domain has simpler policies, allowing the think tool to show benefits even without extensive prompting.
Configuration | k=1 | k=2 | k=3 | k=4 | k=5 |
---|---|---|---|---|---|
Think Tool (No Prompt) | 0.812 | 0.735 | 0.685 | 0.650 | 0.626 |
Extended Thinking | 0.770 | 0.681 | 0.623 | 0.581 | 0.548 |
Baseline | 0.783 | 0.695 | 0.643 | 0.607 | 0.583 |
Key Findings:
SWE-Bench evaluates coding performance on real-world software engineering tasks. The think tool contributed to Claude 3.7 Sonnet achieving state-of-the-art performance.
Performance Impact:
Approach | Airline Domain (k=1) | Retail Domain (k=1) | Implementation Effort |
---|---|---|---|
Baseline | 0.332 | 0.783 | None |
Extended Thinking | 0.412 (+24%) | 0.770 (-1.7%) | Platform-dependent |
Think Tool | 0.404 (+22%) | 0.812 (+3.7%) | Minimal |
Think + Optimized Prompt | 0.584 (+76%) | N/A | Low |
Key Takeaway: The think tool provides substantial performance improvements with minimal implementation overhead, making it an excellent choice for enhancing LLM capabilities in complex reasoning scenarios.
The fastest way to get started:
npx -y think-tool-mcp
For persistent usage across projects:
npm install -g think-tool-mcp
For contributing or local development:
git clone https://github.com/abhinav-mangla/think-tool-mcp.git cd think-tool-mcp npm install npm run build npm start
Human: Help me optimize our company's supply chain with 5 warehouses and 12 retail locations.
LLM: I'll help you optimize your supply chain. Let me think through this systematically.
[Uses think tool]
Thought: This is a complex optimization problem requiring:
1. Analysis of warehouse-retail location distances
2. Shipping cost calculations between all pairs
3. Delivery time constraints evaluation
4. Inventory capacity considerations
5. Demand pattern analysis
I need to formulate this as a linear programming problem and consider both cost minimization and delivery time constraints...
[Proceeds with structured solution based on analysis]
Human: Can I return this opened product after 45 days without a receipt?
LLM: Let me carefully check our return policy.
[Uses think tool]
Thought: Analyzing this return request:
- Item: 45 days old (exceeds standard 30-day window)
- No receipt available
- Package opened
Policy considerations:
- Standard returns require 30 days + receipt
- Store credit might be possible with purchase verification
- Product category affects policy (electronics vs. clothing)
- Loyalty members may have extended windows
- Manager override possibilities
[Provides policy-compliant response with available options]
Human: Find inactive premium users (60+ days) and create a re-engagement campaign.
LLM: I'll search the database and create a targeted campaign.
[Runs database queries]
[Uses think tool]
Thought: Query results show 342 inactive premium users. Key patterns:
- 45% are in 6-12 month subscription cohort
- 72% last activity was "viewed dashboard"
- Only 8% used newest features
- 28% had support tickets before going inactive
Campaign strategy:
- Segment users by usage patterns
- Highlight unexplored features
- Address support-related pain points
- Personalize incentives based on history
[Creates targeted email campaign based on analysis]
think
Provides LLMs with a dedicated space for complex reasoning and analysis.
Parameters:
thought
(string, required): The thought process, reasoning, or analysis to recordDescription: The think tool accepts any structured thinking that an LLM needs to perform. This can include:
Usage Pattern: LLMs will automatically use this tool when they need to engage in complex reasoning. The tool does not retrieve new information or make changes—it simply provides a space for structured thinking.
think-tool-mcp/
├── src/
│ ├── index.ts # CLI entry point
│ └── server.ts # MCP server implementation
├── examples/
│ └── example_usage.md # Usage examples
├── dist/ # Compiled JavaScript
├── package.json
├── tsconfig.json
└── README.md
# Install dependencies npm install # Build the project npm run build # Run in development mode npm run dev # Start the built server npm start
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
git checkout -b feature/amazing-feature
)git commit -m 'Add some amazing feature'
)git push origin feature/amazing-feature
)Server not starting:
Tool not appearing in AI platform:
Permission errors:
npx
instead of global installationFor development and debugging:
npm run dev
This runs the server with TypeScript directly and provides more detailed error information.
This project is licensed under the MIT License - see the LICENSE file for details.
Abhinav Mangla
Enhancing AI reasoning, one thought at a time.