skills/ai-native-dev/SKILL.md
Comprehensive AI-Native system development and deployment planning through interactive Q&A. Use when users want to design AI-powered applications with AI Agents, plan microservice architectures, define APIs, select technology stacks, design agent workflows, and plan Kubernetes deployments. Guides users through structured decision-making from system design to deployment documentation.
npx skillsauth add alijilani-dev/claude ai-native-devInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill provides a structured workflow for guiding users through the complete development lifecycle of AI-Native systems—from architecture design through Kubernetes deployment—via interactive Q&A. It ensures consistent decision-making at every stage.
Trigger conditions:
| Principle | Decision | Rationale | |-----------|----------|-----------| | Agent Autonomy | Single-responsibility agents | Each agent has one clear purpose; easier to test and scale | | LLM Abstraction | Provider-agnostic interface | Swap LLM providers without code changes | | Communication | Async-first, sync when needed | AI operations are inherently slow; async prevents blocking | | State Management | Stateless agents, external state | Agents don't hold state; enables horizontal scaling | | Error Handling | Graceful degradation | AI failures shouldn't crash the system | | Observability | Trace every LLM call | Debug AI behavior with full request/response logging | | Security | Secrets in vault, never in code | API keys for LLMs are high-value targets | | K8s Services | ClusterIP for internal traffic | No external exposure for service-to-service | | K8s RBAC | Namespace-scoped, least privilege | Minimize blast-radius |
┌─────────────────────────────────────────────────────────────────────────┐
│ AI-NATIVE SYSTEM DEVELOPMENT │
├─────────────────────────────────────────────────────────────────────────┤
│ PHASE 1: SYSTEM DESIGN │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Stage 1: │ │ Stage 2: │ │ Stage 3: │ │
│ │ System │─►│ AI Agent │─►│ API & │ │
│ │ Discovery │ │ Design │ │ Endpoints │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ │
├─────────────────────────────────────────────────────────────────────────┤
│ PHASE 2: TECHNOLOGY & INTEGRATION │
│ ┌──────────────┐ ┌──────────────┐ │
│ │ Stage 4: │ │ Stage 5: │ │
│ │ Tech Stack │─►│ Integration │ │
│ │ Selection │ │ Patterns │ │
│ └──────────────┘ └──────────────┘ │
├─────────────────────────────────────────────────────────────────────────┤
│ PHASE 3: KUBERNETES DEPLOYMENT │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Stage 6: │ │ Stage 7: │ │ Stage 8: │ │
│ │ Manifest │─►│ Config & │─►│ Networking │ │
│ │ Planning │ │ Secrets │ │ & RBAC │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ │
└─────────────────────────────────────────────────────────────────────────┘
Understand the overall system purpose, components, and user interactions.
System Purpose
Component Identification
User Interaction Model
External Dependencies
Data Flow
| Aspect | Description |
|--------|-------------|
| System Name | |
| Purpose | |
| Primary Users | |
| Interaction Model | |
| Component Count | |
| AI-Powered Components | |
| External Dependencies | |
| Component | Type | AI-Powered | Description |
|-----------|------|------------|-------------|
| | Frontend / Backend / Agent / Service | Yes/No | |
Design AI agents with clear responsibilities, capabilities, and interaction patterns.
| Agent Type | Characteristics | Use Case | |------------|-----------------|----------| | Conversational Agent | Handles natural language dialogue | Chat interfaces, Q&A systems | | Task Agent | Executes specific tasks autonomously | Automation, workflow execution | | Orchestrator Agent | Coordinates multiple agents | Complex multi-step processes | | Retrieval Agent | Fetches and synthesizes information | RAG systems, knowledge bases | | Tool-Using Agent | Calls external tools/APIs | Function calling, integrations |
For each AI agent identified:
Agent Identity
LLM Requirements
Agent Capabilities
Agent Boundaries
Agent Communication
| Agent Name | Type | LLM Provider | Model | Latency Req | Description |
|------------|------|--------------|-------|-------------|-------------|
| Agent Name | Capabilities (Actions) | Tools/Functions | Input Format | Output Format |
|------------|----------------------|-----------------|--------------|---------------|
| Agent | Communicates With | Communication Type | Purpose |
|-------|-------------------|-------------------|---------|
| | User / Agent / Service | Sync / Async / Stream | |
| Principle | Implementation | |-----------|----------------| | Single Responsibility | Each agent has ONE primary purpose | | Stateless Design | No in-memory state; use external storage | | Graceful Degradation | Return meaningful errors, never crash | | Timeout Handling | All LLM calls have timeouts (default: 30s) | | Retry Logic | Exponential backoff for transient failures | | Token Budgeting | Set max_tokens to control costs | | Prompt Versioning | Version control all system prompts |
Define all APIs and endpoints for both traditional services and AI agents.
| Component Type | Endpoint Pattern | Example |
|----------------|------------------|---------|
| Frontend | Static + Proxy | /, /assets/*, /api/* (proxy) |
| Backend REST | Resource-based | /api/v1/tasks, /api/v1/tasks/{id} |
| AI Agent (Sync) | Action-based | /agent/chat, /agent/analyze |
| AI Agent (Stream) | SSE endpoint | /agent/chat/stream |
| Notifications | WebSocket/SSE | /notifications/subscribe |
| Health Checks | Standard paths | /health, /ready, /live |
For each component:
Endpoint Inventory
Request/Response
Authentication
Rate Limiting
| Component | Endpoint | Method | Description | Auth Required |
|-----------|----------|--------|-------------|---------------|
| Agent | Endpoint | Method | Input | Output | Streaming | Timeout |
|-------|----------|--------|-------|--------|-----------|---------|
Select appropriate technologies with consistent decision-making rationale.
| Layer | Options | Decision Factors | |-------|---------|------------------| | Frontend | React, Vue, Next.js, Static | Interactivity needs, SEO, team expertise | | Backend | Node.js, Python, Go, Java | Performance, ecosystem, team expertise | | AI/Agent Framework | LangChain, LlamaIndex, Custom, Semantic Kernel | Complexity, flexibility, vendor lock-in | | LLM Provider | OpenAI, Anthropic, Azure OpenAI, Local | Cost, latency, compliance, capabilities | | Message Queue | None, Redis, RabbitMQ, Kafka | Scale, ordering needs, persistence | | Database | None, PostgreSQL, MongoDB, Redis | Data model, query patterns, scale | | Cache | None, Redis, Memcached | Read patterns, invalidation needs |
Language & Runtime
AI Infrastructure
Data Layer
Constraints
| Layer | Technology | Justification |
|-------|------------|---------------|
| Frontend | | |
| Backend Services | | |
| AI Agent Framework | | |
| LLM Provider | | |
| Database | | |
| Cache | | |
| Message Queue | | |
Define how components communicate and integrate.
| Pattern | When to Use | Implementation | |---------|-------------|----------------| | Sync REST | Simple request-response | HTTP client with timeout | | Async Queue | Fire-and-forget, long processing | Message queue (Redis, RabbitMQ) | | Streaming (SSE) | Real-time AI responses | Server-Sent Events | | WebSocket | Bidirectional real-time | WS connection with heartbeat | | Event-Driven | Loose coupling, multiple consumers | Pub/Sub pattern |
| Pattern | Description | Use Case | |---------|-------------|----------| | Request-Response | Sync call to LLM, wait for response | Simple chat, single-turn | | Streaming Response | Token-by-token streaming | Chat UX, long responses | | Tool Calling | LLM calls functions, agent executes | Actions, integrations | | Agent Chaining | Output of one agent feeds another | Complex workflows | | Human-in-the-Loop | Agent requests human approval | High-stakes decisions |
Service Communication
AI Integration
Event Handling
| From | To | Pattern | Protocol | Purpose |
|------|-----|---------|----------|---------|
| Flow Name | Steps | Pattern | Description |
|-----------|-------|---------|-------------|
| User Chat | UI → Agent → LLM → Agent → UI | Stream | User sends message, gets streamed response |
Determine required Kubernetes resources based on system design.
| Component Characteristic | Manifest Type | Rationale | |--------------------------|---------------|-----------| | Stateless service | Deployment | Rolling updates, scaling | | Stateful service (DB) | StatefulSet | Stable identity, ordered | | Background processor | Deployment + HPA | Scale based on queue depth | | Scheduled job | CronJob | Periodic execution | | Per-node requirement | DaemonSet | Logging, monitoring agents |
Workload Types
Namespace Strategy
Resource Requirements
| Category | Manifest Type | Count | Names |
|----------|--------------|-------|-------|
| Workloads | Deployment | | |
| Networking | Service | | |
| Networking | Ingress | | |
| Networking | NetworkPolicy | | |
| Config | ConfigMap | | |
| Config | Secret | | |
| RBAC | ServiceAccount | | |
| RBAC | Role | | |
| RBAC | RoleBinding | | |
| Deployment Name | Replicas | Purpose |
|-----------------|----------|---------|
Design ConfigMaps and Secrets with security best practices.
| Data Type | Resource | Security Level | |-----------|----------|----------------| | Service URLs | ConfigMap | Public | | Feature flags | ConfigMap | Public | | Log levels | ConfigMap | Public | | LLM API keys | Secret | Critical | | Database credentials | Secret | Critical | | JWT signing keys | Secret | Critical | | TLS certificates | Secret (TLS type) | High |
| Config Item | Resource | Description |
|-------------|----------|-------------|
| LLM_API_KEY | Secret | LLM provider API key |
| LLM_ORG_ID | Secret | LLM organization ID |
| LLM_MODEL_NAME | ConfigMap | Model identifier (not sensitive) |
| LLM_TIMEOUT_MS | ConfigMap | Request timeout |
| LLM_MAX_TOKENS | ConfigMap | Token limit per request |
| LLM_TEMPERATURE | ConfigMap | Model temperature setting |
| AGENT_SYSTEM_PROMPT | ConfigMap | Agent's system prompt (version controlled) |
Per-Component Config
Secrets Inventory
Security Approach
| ConfigMap Name | Environment Variables | Justification |
|----------------|----------------------|---------------|
| Secret Name | Data Type | Keys | Security Approach | Justification |
|-------------|-----------|------|-------------------|---------------|
Design secure networking and minimal-privilege access control.
| Scenario | Service Type | Rationale | |----------|--------------|-----------| | Internal communication | ClusterIP | Default; no external exposure | | External access | ClusterIP + Ingress | Centralized TLS, routing | | Development/testing | NodePort | Simple but not for production |
Consistency Rule: Default to ClusterIP for ALL services. Use single Ingress for external access.
| Policy Type | Purpose | |-------------|---------| | Default Deny | Block all traffic by default | | Allow Ingress | Whitelist specific ingress sources | | Allow Egress | Whitelist specific egress destinations | | LLM Egress | Allow agents to reach external LLM APIs |
| Scope | When to Use | |-------|-------------| | Namespace (Role) | Default; covers 99% of cases | | Cluster (ClusterRole) | Only for cluster-wide resources |
Service Exposure
Network Policies
RBAC Needs
| Service Name | Type | Justification |
|--------------|------|---------------|
| Host | Path | Backend Service | Description |
|------|------|-----------------|-------------|
| Policy Name | Pod Selector | Ingress From | Egress To | Justification |
|-------------|--------------|--------------|-----------|---------------|
| Type | Name | Scope | Justification |
|------|------|-------|---------------|
After completing all 8 stages, generate a comprehensive document with:
# [System Name] - Development & Deployment Plan
## 1. Design Principles
[Consistency guidelines table]
## 2. System Architecture
### 2.1 System Overview
### 2.2 Component Registry
## 3. AI Agent Design
### 3.1 Agent Registry
### 3.2 Agent Capabilities
### 3.3 Agent Communication Matrix
## 4. API & Endpoints
### 4.1 Endpoint Registry
### 4.2 AI Agent Endpoints
## 5. Technology Stack
[Technology decisions with justifications]
## 6. Integration Patterns
### 6.1 Communication Matrix
### 6.2 Data Flow Diagrams
## 7. Kubernetes Deployment
### 7.1 Manifest Inventory
### 7.2 Deployments
### 7.3 ConfigMaps
### 7.4 Secrets
### 7.5 Services
### 7.6 Ingress
### 7.7 Network Policies
### 7.8 RBAC
## 8. Architecture Diagram
[ASCII/Text diagram]
Before finalizing, verify these consistency rules are applied:
| Category | Check | |----------|-------| | Agents | Each agent has single responsibility? | | Agents | All agents are stateless? | | Agents | LLM calls have timeouts? | | APIs | Health endpoints on all components? | | APIs | Consistent naming convention? | | Services | All internal services use ClusterIP? | | Services | Single Ingress for external access? | | Secrets | No secrets in ConfigMaps? | | Secrets | LLM API keys in Secrets? | | RBAC | Namespace-scoped (not cluster)? | | RBAC | Each component has own ServiceAccount? | | Network | Default deny with explicit allows? | | Network | LLM egress explicitly allowed for agents? |
User: I want to build an AI-powered task manager with a chat interface.
Assistant: I'll help you design and plan your AI-Native task manager. Let's go through the 8 stages systematically.
Stage 1: System Discovery
Let me understand your system:
[User responds, continue through all 8 stages...]
This skill extends and incorporates:
k8s-planning - Kubernetes deployment planning (Stages 6-8)data-ai
Orchestrate complex tasks by delegating work to parallel subagent teams, preserving the main context window and preventing auto-compact. This skill should be used when users ask to apply subagent-teams, when performing complex multi-step tasks, when context window is getting large, or when independent subtasks can run in parallel.
development
Generate new Claude Code skills with proper structure and standards. Use when the user requests skill creation, wants to generate a new skill, or mentions creating custom Claude Code functionality. Activated by phrases like "create a skill", "generate a skill", "make a new skill", or "build a skill for".
testing
Generate comprehensive educational quizzes based on Bloom's Taxonomy methodology (Remember, Understand, Apply, Analyze, Evaluate, Create). Creates structured True/False quizzes with detailed answer keys and explanations. Use when user requests quiz generation, assessment creation, test materials, practice questions, mentions Bloom's Taxonomy, or provides educational topics for quiz creation. Activates for study topics, course materials, reference files (.md, .txt, .pdf), or educational content requiring systematic assessment.
content-media
Generate comprehensive educational notes using Bloom's Taxonomy methodology. Creates structured learning materials with summaries, practice questions, and visual diagrams. Use when user requests notes generation, study materials, learning resources, mentions Bloom's Taxonomy, or provides topics for educational note-taking. Activates for .md files, study topics, course materials, or educational content creation.