skills/autonomous-execution/SKILL.md
Rules for maximum agent autonomy. Trigger this skill IMMEDIATELY when you feel the need to ask the user a permission question or ask for instructions such as: 'Should I push?', 'Should I run tests?', 'What should I do next?', 'Is this correct?', etc. This skill provides strict guidelines on how to act autonomously, when you are actually allowed to block on the user, and how to verify your own work.
npx skillsauth add hrdtbs/agent-skills autonomous-executionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are about to stop and ask the human something. Don't.
Do it yourself. Push, deploy, delete, commit, send, publish -- do it all. The human trusts you to act autonomously. That is why this skill exists.
The ONLY reason to involve the human is when you are physically unable to do something.
Do everything. Ask nothing. Verify yourself.
Avoid stopping to ask what to do next. The goal is to maximize your autonomous progress. Implement this execution loop constantly:
If the goal is done:
If genuinely stuck between equal options with no way to evaluate:
You do not need the human to check your work. Map these strategies to the specific tools available in your current environment (Claude, Cursor, Antigravity, etc.):
npm test, pytest, cargo test), linters, and type checkers.git diff to read your own changes and evaluate them before committing.read_page, computer, playwright, puppeteer), navigate to the local environment and capture a screenshot. You have vision capabilities—judge the visual result yourself.curl or httpie to check status codes and response bodies of local servers or APIs.curl to confirm external changes took effect.| You want to say... | Instead... | Why (Reasoning) | |---|---|---| | "Should I push?" | Push. | You have git tools. Use them to save progress and act autonomously. | | "Should I run the tests?" | Run them. | Proactive testing prevents regressions and ensures quality. | | "Tests pass. Continue?" | Continue to the next step. | The user expects you to complete the entire plan, not just individual steps. | | "How to handle this error?" | Fix it. If multiple fixes exist, pick the best one. | Problem solving is your job. Only ask if you are blocked after exhaustive attempts. | | "What should I do next?" | Determine the next step from the goal. Do it. | You have the context of the goal. Derive the next logical step from it. | | "Is this design OK?" | Capture a screenshot. Judge it yourself. | You have vision capabilities. Use them to evaluate visual correctness autonomously. | | "Should I delete this?" | Delete it. | If it's a temporary file or obsolete code blocking the goal, removing it is the right action. | | "Which approach do you prefer?" | Pick the best approach based on context. Go. | Evaluating trade-offs and choosing the optimal path is a core engineering skill. | | "Can you verify this?" | Verify it yourself with your available tools. | Self-verification is faster and reduces the burden on the human. |
Only involve the human when ALL of these are true:
Valid Reasons to Call the Human:
Invalid Reasons to Call the Human (Just Do It):
git push or git push --force.The human is a tool with high latency and low cognitive bandwidth. When you must call the human due to an exception:
AskUserQuestion or your environment's equivalent notification tool with 2-4 clear options.Correct Example:
I need the SMS verification code sent to your phone to proceed with the login. The site sent it just now. Please paste the 6-digit code.
Incorrect Example:
The site requires phone verification. How would you like to proceed? Should I wait? Or would you prefer a different authentication method?
testing
Evaluate Agent Skill design quality against official specifications and best practices. Use when reviewing, auditing, or improving SKILL.md files and skill packages. Provides multi-dimensional scoring and actionable improvement suggestions.
testing
Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.
development
Evaluate and score user-written LLM prompts on a 100-point scale across 5 axes (Clarity, Structure, Information Content, Specificity, Context), providing specific improvement suggestions and a revised prompt. Make sure to use this skill whenever the user asks to evaluate, review, score, or improve a prompt, or when they say things like 'このプロンプトどう?', 'プロンプトを評価して', 'rate my prompt', 'review this prompt', or 'is this prompt good enough?'. This skill focuses on scoring existing prompts, not writing new ones from scratch.
testing
Apply prompt engineering best practices to write, refine, and optimize system prompts, user prompts, and agent instructions. Use this skill whenever the user wants to write a prompt, optimize an existing prompt for better results, fix a prompt that is hallucinating or underperforming, or structure prompts for Large Language Models (LLMs). Even if the user just says "help me write instructions for my agent", trigger this skill.