skills/agents-autogpt/SKILL.md
Use when the user wants to build autonomous AI agents using AutoGPT Platform, design visual workflow agents, or evaluate AutoGPT against other agent frameworks. Covers platform vs classic architecture decisions, block design, execution model tradeoffs, and production deployment. Do NOT use for: general agent architecture theory (use ai-agents-architect), LangChain agent patterns (use agents-llamaindex or langchain-agent), CrewAI multi-agent orchestration (use agents-crewai), or agent memory system design (use agent-memory-systems).
npx skillsauth add sharkitect-solutions/sharkitect-claude-toolkit autogpt-agentsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| File | Load When | Do NOT Load |
|---|---|---|
| framework-comparison.md | User is choosing between agent frameworks, evaluating AutoGPT vs alternatives, or asking "should I use AutoGPT or..." | User has already committed to AutoGPT and needs implementation help |
| platform-architecture-gotchas.md | User is deploying AutoGPT Platform, hitting infrastructure issues, or designing production agent systems | User is doing local development or prototyping |
| block-development-patterns.md | User is building custom blocks, extending AutoGPT functionality, or debugging block execution | User is only using built-in blocks via the visual builder |
| Topic | This Skill | Other Skill | |---|---|---| | AutoGPT Platform setup and deployment | YES | - | | AutoGPT block design and execution model | YES | - | | AutoGPT vs other agent frameworks decision | YES | - | | General autonomous agent architecture | Mention only | ai-agents-architect | | LangChain agent chains and tools | NO | agents-llamaindex, langchain-agent | | CrewAI role-based collaboration | NO | agents-crewai | | Agent memory and context management | NO | agent-memory-systems | | Agent evaluation and benchmarking | NO | agent-evaluation | | MCP server integration for agents | NO | mcp-integration | | Docker containerization general | NO | docker-expert | | FastAPI backend patterns | NO | fastapi-pro | | General workflow automation (n8n, Make) | NO | n8n-workflow-patterns, make-builder |
AutoGPT has TWO distinct systems with different architectures, licenses, and use cases. Confusing them is the most common mistake.
| Signal | Use Platform | Use Classic (Forge) | |---|---|---| | Need visual drag-and-drop agent builder | YES -- React frontend with node-based editor | NO | | Non-technical users will build agents | YES -- low-code visual builder | NO -- requires Python | | Need persistent scheduled/webhook-triggered agents | YES -- built-in trigger system with queue-based execution | Possible but manual | | Building custom agent logic with full code control | NO -- constrained to block system | YES -- full Python agent with Forge toolkit | | Need to deploy to production at scale | Platform -- but read gotchas | Classic -- more deployment flexibility | | Want to benchmark agent performance | Classic has agbenchmark with VCR cassettes | Platform has no built-in benchmark | | License matters (commercial use) | Polyform Shield -- restrictive, review before commercial use | MIT -- permissive |
The license trap: Platform and Classic have DIFFERENT licenses. Platform uses Polyform Shield License 1.0.0, which restricts competitive use. Classic uses MIT. Many teams discover this after building on Platform. Check before committing.
First-match decision -- use the FIRST row where your situation matches:
| Your Situation | Best Framework | Why NOT AutoGPT | |---|---|---| | Need a visual builder for non-developers | AutoGPT Platform | -- (AutoGPT IS the answer) | | Building agents that call tools in a chain with full code control | LangChain/LangGraph | AutoGPT's block system constrains tool chaining patterns. LangChain gives direct control over agent loops | | Multi-agent collaboration with role specialization | CrewAI | AutoGPT agents are solo graph executors. Multi-agent coordination requires custom orchestration on top | | Simple hosted agent with file search, code interpreter | OpenAI Assistants API | Zero infrastructure. If your agent fits Assistants' constraints, don't build infrastructure | | Microsoft ecosystem (Azure, M365, Dynamics) | Semantic Kernel | Native Azure integration. AutoGPT has no Microsoft-specific connectors | | Production agent with custom logic, no visual builder needed | LangGraph or custom | AutoGPT Platform adds infrastructure overhead (Postgres+Redis+RabbitMQ+Supabase) for a visual builder you won't use | | Prototyping quickly, may switch frameworks later | LangChain | Largest ecosystem, most examples, easiest to start and migrate away from |
AutoGPT Platform executes agents as directed acyclic graphs (DAGs) through a queue-based system:
Trigger -> REST API -> Graph Validator -> RabbitMQ Queue -> Executor -> Node-by-Node Block Execution -> Output
| Constraint | Impact | Workaround |
|---|---|---|
| DAG only -- no cycles | Cannot build agents that loop back to earlier steps (common in ReAct pattern) | Use nested agent blocks or external orchestration for iterative reasoning |
| Queue-based execution | Latency floor of 100-500ms per node even for trivial operations (RabbitMQ overhead) | Acceptable for batch/async agents. Fatal for real-time conversational agents |
| Node isolation | Each node executes independently. No shared memory between nodes in same execution | Pass all needed context through node connections. State bloats quickly |
| Single-tenant executor | Default executor processes one graph at a time per worker | Scale with docker compose up -d --scale executor=N but each executor needs its own resources |
| No streaming | Block outputs yield complete results, not token streams | Cannot build streaming chat interfaces natively. Need WebSocket workaround |
| Cost Component | Local Dev | Production (Small) | Production (Scale) | |---|---|---|---| | Infrastructure (Postgres+Redis+RabbitMQ) | $0 (Docker) | $50-150/mo (managed services) | $200-800/mo | | Supabase (auth) | Free tier | $25/mo | $25-200/mo | | Executor compute | Local CPU | 1-2 vCPU, 2-4GB RAM per executor | 2-4 vCPU, 4-8GB RAM x N executors | | LLM API calls | Per-node, per-execution | Track via cost_config in blocks | At 100 executions/day with 5 LLM nodes each: 500 API calls/day. Model choice matters |
The hidden cost: Every LLM block in the graph is a separate API call. A 10-node agent with 3 LLM blocks running 50 times/day = 150 API calls/day = $15-45/day with GPT-4o. Graph complexity directly multiplies LLM spend.
Blocks are AutoGPT's unit of functionality. Design decisions here determine agent quality.
| Principle | Rule | Common Violation |
|---|---|---|
| Single responsibility | One block does one thing. Combine via graph, not via block internals | "Swiss army knife" blocks that do fetching + parsing + LLM call + formatting |
| Yield early, yield often | Use yield "output_name", value for each distinct output as soon as available | Collecting all results then yielding once at the end (breaks streaming and partial execution) |
| Schema everything | Input and output schemas (Pydantic) must be explicit. No dict or Any types | Loose schemas that accept anything -- blocks break silently when upstream changes |
| Credential isolation | Never hardcode API keys. Use credentials_required and get_credentials() | Storing keys in block config or environment variables directly |
| Idempotent execution | Blocks may re-execute on failure. Side effects must be safe to repeat | Blocks that create resources without checking for existing ones (duplicate records, double-sends) |
| Cost awareness | Set cost_config for any block making external API calls | Users have no visibility into which blocks drive costs without explicit tracking |
| Name | What Happens | Why It Fails | |---|---|---| | Monolith Graph | 30+ node agent with complex branching doing everything in one graph | Impossible to debug, test, or modify. Single failure cascades. Split into nested agent blocks | | Visual-Only Thinking | Designing agents in the visual builder without considering execution model | Drag-and-drop hides queue latency, memory constraints, and DAG limitations. Design on paper first, then implement | | Platform for Chat | Using AutoGPT Platform for real-time conversational agents | Queue-based execution adds 100-500ms per node. No native streaming. Use OpenAI Assistants or LangChain for chat | | License Blindspot | Building commercial product on Platform without reading Polyform Shield | Polyform Shield restricts competitive use. Discovered post-launch = rewrite or legal exposure | | Fork-and-Forget | Forking AutoGPT repo to customize, then never syncing upstream | AutoGPT ships breaking changes frequently (Prisma schema migrations, block API changes). Custom forks diverge quickly and become unmaintainable | | Infrastructure Overkill | Deploying full Platform stack (Postgres+Redis+RabbitMQ+Supabase) for a simple agent | If you don't need the visual builder or multi-tenant execution, use LangChain/LangGraph with 90% less infrastructure |
development
When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.
testing
--- name: using-sharkitect-methodology description: Use when starting any conversation in a Sharkitect workspace OR before any task involving NEW pricing, positioning, proposal, strategy, plan-execution, or schema-design work — mandates invocation of Sharkitect-specific methodology skills (pricing-strategy, marketing-strategy-pmm, smb-cfo, hq-revenue-ops, executing-plans, brainstorming) under the same anti-rationalization discipline as using-superpowers. Documentation has failed 4 times across H
testing
Use when user says 'end session', 'wrap up', 'stop for the day', 'done for today', 'close out', 'save session', 'wrapping up', or invokes /end-session. Runs the full 9-step end-of-session protocol: resource audit, MEMORY.md update, lessons capture, plan status, pending items, workspace checklist, .tmp/ audit, git commit+push, Supabase brain sync, session brief, summary. Final step schedules a detached self-kill of the current session ONLY (3s delay) so the window closes cleanly. Other claude.exe processes (active workspaces) are NOT touched -- orphan cleanup is handled separately by Claude-Orphan-Cleanup-Hourly with proper age safeguards. Do NOT use for: mid-session quick saves (use session-checkpoint), skill syncing (use sync-skills.py), brain memory queries (use supabase-sync.py pull), document freshness reviews (use document-lifecycle), resource gap detection (use resource-auditor).
testing
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, passive voice, negative parallelisms, and filler phrases.