plugins/claude-code-expert/skills/extended-thinking/SKILL.md
# Claude Code Extended Thinking Complete guide to extended thinking modes and thinking budget configuration. ## Overview Extended thinking (also called "ultrathink") allows Claude to perform deeper reasoning before responding. This uses additional tokens for internal reasoning that improves quality on complex tasks. ## Thinking Modes ### Trigger Phrases (CLI) Use these phrases in your prompts to activate thinking levels: | Phrase | Thinking Tokens | Approx Cost | |--------|----------------
npx skillsauth add markus41/claude plugins/claude-code-expert/skills/extended-thinkingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Complete guide to extended thinking modes and thinking budget configuration.
Extended thinking (also called "ultrathink") allows Claude to perform deeper reasoning before responding. This uses additional tokens for internal reasoning that improves quality on complex tasks.
Use these phrases in your prompts to activate thinking levels:
| Phrase | Thinking Tokens | Approx Cost |
|--------|----------------|-------------|
| think | ~4,000 tokens | ~$0.06 |
| think hard / megathink | ~10,000 tokens | ~$0.15 |
| ultrathink | ~32,000 tokens | ~$0.48 |
Example: "ultrathink about how to refactor the auth module"
Option+T — Toggle extended thinking on/off in interactive modeCLAUDE_CODE_EFFORT_LEVEL=high # low, medium, high
Standard reasoning — Claude thinks internally as needed.
More deliberate reasoning with configurable thinking budget.
Maximum reasoning depth for the most complex problems. Best for architecture decisions, debugging complex issues, and novel problem-solving.
import Anthropic from "@anthropic-ai/sdk";
const client = new Anthropic();
const response = await client.messages.create({
model: "claude-sonnet-4-6",
max_tokens: 16000,
thinking: {
type: "enabled",
budget_tokens: 10000 // Tokens allocated for thinking
},
messages: [
{ role: "user", content: "Solve this complex architecture problem..." }
]
});
// Access thinking content
for (const block of response.content) {
if (block.type === "thinking") {
console.log("Thinking:", block.thinking);
} else if (block.type === "text") {
console.log("Response:", block.text);
}
}
| Budget | Use Case | |--------|----------| | 2,000-5,000 | Simple analysis, code review | | 5,000-10,000 | Architecture decisions, debugging | | 10,000-20,000 | Complex multi-step reasoning | | 20,000+ | Deep research, novel problem solving |
const stream = await client.messages.stream({
model: "claude-sonnet-4-6",
max_tokens: 16000,
thinking: {
type: "enabled",
budget_tokens: 10000
},
messages: [{ role: "user", content: "..." }]
});
for await (const event of stream) {
if (event.type === "content_block_start") {
if (event.content_block.type === "thinking") {
console.log("--- Thinking started ---");
}
}
if (event.type === "content_block_delta") {
if (event.delta.type === "thinking_delta") {
process.stdout.write(event.delta.thinking);
} else if (event.delta.type === "text_delta") {
process.stdout.write(event.delta.text);
}
}
}
Within Claude Code sessions, extended thinking is managed automatically based on:
| Model | Extended Thinking | |-------|------------------| | Claude Opus 4.6 | Full support | | Claude Sonnet 4.6 | Full support | | Claude Haiku 4.5 | Limited support |
/cost to monitor thinking token usage/compact preserves thinking conclusions but drops raw thinking{
"content": [
{
"type": "thinking",
"thinking": "Let me analyze this step by step...\n1. First consideration...\n2. Second consideration..."
},
{
"type": "text",
"text": "Based on my analysis, here's what I recommend..."
}
],
"usage": {
"input_tokens": 500,
"output_tokens": 3000,
"cache_creation_input_tokens": 0,
"cache_read_input_tokens": 0
}
}
development
Enhanced plan-authoring skill with Pre-Writing context gathering, task metadata, non-TDD templates, Red Flags, telemetry, and an automated plan linter. Use when you have a spec or requirements for a multi-step task, before touching code.
tools
Documentation intelligence engine with graph-based API docs, algorithm library, and drift detection
tools
Ultraplan cloud planning — kick off a plan in the cloud from your terminal, review and revise in the browser, then execute remotely or send back to CLI
tools
--- name: mcp description: Configure MCP servers for Claude Code — stdio vs HTTP, authentication, Tools/Resources/Prompts distinction, channels (CI webhook, mobile relay, Discord bridge, fakechat), and cost of always-loaded tools. Use this skill whenever adding an MCP server, debugging connection issues, choosing between MCP Tools vs Prompts vs Resources, installing channel servers, or managing .mcp.json. Triggers on: "MCP server", "mcp config", "add Obsidian MCP", "install context7", "channels"