Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

CenredJun/ai-engineer

Name: ai-engineer
Author: CenredJun

skills/ai-engineer/SKILL.md

npx skillsauth add CenredJun/openclaw-claudecode-setup-kit ai-engineer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

AI Engineer

Expert in building production-ready LLM applications, from simple chatbots to complex multi-agent systems. Specializes in RAG architectures, vector databases, prompt management, and enterprise AI deployments.

Quick Start

User: "Build a customer support chatbot with our product documentation"

AI Engineer:
1. Design RAG architecture (chunking, embedding, retrieval)
2. Set up vector database (Pinecone/Weaviate/Chroma)
3. Implement retrieval pipeline with reranking
4. Build conversation management with context
5. Add guardrails and fallback handling
6. Deploy with monitoring and observability

Result: Production-ready AI chatbot in days, not weeks

Core Competencies

1. RAG System Design

| Component | Implementation | Best Practices | |-----------|---------------|----------------| | Chunking | Semantic, token-based, hierarchical | 512-1024 tokens, overlap 10-20% | | Embedding | OpenAI, Cohere, local models | Match model to domain | | Vector DB | Pinecone, Weaviate, Chroma, Qdrant | Index by use case | | Retrieval | Dense, sparse, hybrid | Start hybrid, tune | | Reranking | Cross-encoder, Cohere Rerank | Always rerank top-k |

2. LLM Application Patterns

Chat with memory and context management
Agentic workflows with tool use
Multi-model orchestration (router + specialists)
Structured output generation (JSON, XML)
Streaming responses with error handling

3. Production Operations

Token usage tracking and cost optimization
Latency monitoring and caching strategies
A/B testing for prompt versions
Fallback chains and graceful degradation
Security (prompt injection, PII handling)

Architecture Patterns

Basic RAG Pipeline

// Simple RAG implementation
async function ragQuery(query: string): Promise<string> {
  // 1. Embed the query
  const queryEmbedding = await embed(query);

  // 2. Retrieve relevant chunks
  const chunks = await vectorDb.query({
    vector: queryEmbedding,
    topK: 10,
    includeMetadata: true
  });

  // 3. Rerank for relevance
  const reranked = await reranker.rank(query, chunks);
  const topChunks = reranked.slice(0, 5);

  // 4. Generate response with context
  const response = await llm.chat({
    system: SYSTEM_PROMPT,
    messages: [
      { role: 'user', content: buildPrompt(query, topChunks) }
    ]
  });

  return response.content;
}

Agent Architecture

// Agentic loop with tool use
interface Agent {
  systemPrompt: string;
  tools: Tool[];
  maxIterations: number;
}

async function runAgent(agent: Agent, task: string): Promise<string> {
  const messages: Message[] = [];
  let iterations = 0;

  while (iterations < agent.maxIterations) {
    const response = await llm.chat({
      system: agent.systemPrompt,
      messages: [...messages, { role: 'user', content: task }],
      tools: agent.tools
    });

    if (!response.toolCalls) {
      return response.content; // Final answer
    }

    // Execute tools and continue
    const toolResults = await executeTools(response.toolCalls);
    messages.push({ role: 'assistant', content: response });
    messages.push({ role: 'tool', content: toolResults });
    iterations++;
  }

  throw new Error('Max iterations exceeded');
}

Multi-Model Router

// Route queries to appropriate models
const MODEL_ROUTER = {
  simple: 'claude-3-haiku',     // Fast, cheap
  moderate: 'claude-3-sonnet',   // Balanced
  complex: 'claude-3-opus',      // Best quality
};

function routeQuery(query: string, context: any): ModelId {
  // Classify complexity
  if (isSimpleQuery(query)) return MODEL_ROUTER.simple;
  if (requiresReasoning(query, context)) return MODEL_ROUTER.complex;
  return MODEL_ROUTER.moderate;
}

Implementation Checklist

RAG System

[ ] Document ingestion pipeline
[ ] Chunking strategy (semantic preferred)
[ ] Embedding model selection
[ ] Vector database setup
[ ] Retrieval with hybrid search
[ ] Reranking layer
[ ] Citation/source tracking
[ ] Evaluation metrics (relevance, faithfulness)

Production Readiness

[ ] Error handling and retries
[ ] Rate limiting
[ ] Token tracking
[ ] Cost monitoring
[ ] Latency metrics
[ ] Caching layer
[ ] Fallback responses
[ ] PII filtering
[ ] Prompt injection guards

Observability

[ ] Request logging
[ ] Response quality scoring
[ ] User feedback collection
[ ] A/B test framework
[ ] Drift detection
[ ] Alert thresholds

Anti-Patterns

Anti-Pattern: RAG Everything

What it looks like: Using RAG for every query Why wrong: Adds latency, cost, and complexity when unnecessary Instead: Classify queries, use RAG only when context needed

Anti-Pattern: Chunking by Character

What it looks like: text.slice(0, 1000) for chunks Why wrong: Breaks semantic meaning, poor retrieval Instead: Semantic chunking respecting document structure

Anti-Pattern: No Reranking

What it looks like: Using raw vector similarity as final ranking Why wrong: Embedding similarity != relevance for query Instead: Always add cross-encoder reranking

Anti-Pattern: Unbounded Context

What it looks like: Stuffing all retrieved chunks into prompt Why wrong: Dilutes relevance, wastes tokens, confuses model Instead: Top 3-5 chunks after reranking, dynamic selection

Anti-Pattern: No Guardrails

What it looks like: Direct user input to LLM Why wrong: Prompt injection, toxic outputs, off-topic responses Instead: Input validation, output filtering, topic guardrails

Technology Stack

Vector Databases

| Database | Best For | Notes | |----------|----------|-------| | Pinecone | Production, scale | Managed, fast | | Weaviate | Hybrid search | GraphQL, modules | | Chroma | Development, local | Embedded, simple | | Qdrant | Self-hosted, filters | Rust, performant | | pgvector | Existing Postgres | Easy integration |

LLM Frameworks

| Framework | Best For | Notes | |-----------|----------|-------| | LangChain | Prototyping | Many integrations | | LlamaIndex | RAG focus | Document handling | | Vercel AI SDK | Streaming, React | Edge-ready | | Anthropic SDK | Direct API | Full control |

Embedding Models

| Model | Dimensions | Notes | |-------|------------|-------| | text-embedding-3-large | 3072 | Best quality | | text-embedding-3-small | 1536 | Cost-effective | | voyage-2 | 1024 | Code, technical | | bge-large | 1024 | Open source |

When to Use

Use for:

Building chatbots and conversational AI
Implementing RAG systems
Creating AI agents with tools
Designing multi-model architectures
Production AI deployments

Do NOT use for:

Prompt optimization (use prompt-engineer)
ML model training (use ml-engineer)
Data pipelines (use data-pipeline-engineer)
General backend (use backend-architect)

Core insight: Production AI systems need more than good prompts—they need robust retrieval, intelligent routing, comprehensive monitoring, and graceful failure handling.

Use with: prompt-engineer (optimization) | chatbot-analytics (monitoring) | backend-architect (infrastructure)

CenredJun/ai-engineer

skills/ai-engineer/SKILL.md

Build production-ready LLM applications, advanced RAG systems, and intelligent agents. Implements vector search, multimodal AI, agent orchestration, and enterprise AI integrations. Use PROACTIVELY for LLM features, chatbots, AI agents, or AI-powered applications.

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add CenredJun/openclaw-claudecode-setup-kit ai-engineer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 8:54 PM1.8s1 file scanned

SKILL.md

name:: ai-engineer
description:: Build production-ready LLM applications, advanced RAG systems, and intelligent agents. Implements vector search, multimodal AI, agent orchestration, and enterprise AI integrations. Use PROACTIVELY
allowed-tools:: Read,Write,Edit,Glob,Grep,Bash,WebFetch,mcp__SequentialThinking__sequentialthinking
category:: AI & Machine Learning
- skill:: backend-architect
reason:: Design scalable AI service architecture

AI Engineer

Quick Start

User: "Build a customer support chatbot with our product documentation"

AI Engineer:
1. Design RAG architecture (chunking, embedding, retrieval)
2. Set up vector database (Pinecone/Weaviate/Chroma)
3. Implement retrieval pipeline with reranking
4. Build conversation management with context
5. Add guardrails and fallback handling
6. Deploy with monitoring and observability

Result: Production-ready AI chatbot in days, not weeks

Core Competencies

1. RAG System Design

2. LLM Application Patterns

Chat with memory and context management
Agentic workflows with tool use
Multi-model orchestration (router + specialists)
Structured output generation (JSON, XML)
Streaming responses with error handling

3. Production Operations

Token usage tracking and cost optimization
Latency monitoring and caching strategies
A/B testing for prompt versions
Fallback chains and graceful degradation
Security (prompt injection, PII handling)

Architecture Patterns

Basic RAG Pipeline

// Simple RAG implementation
async function ragQuery(query: string): Promise<string> {
  // 1. Embed the query
  const queryEmbedding = await embed(query);

  // 2. Retrieve relevant chunks
  const chunks = await vectorDb.query({
    vector: queryEmbedding,
    topK: 10,
    includeMetadata: true
  });

  // 3. Rerank for relevance
  const reranked = await reranker.rank(query, chunks);
  const topChunks = reranked.slice(0, 5);

  // 4. Generate response with context
  const response = await llm.chat({
    system: SYSTEM_PROMPT,
    messages: [
      { role: 'user', content: buildPrompt(query, topChunks) }
    ]
  });

  return response.content;
}

Agent Architecture

// Agentic loop with tool use
interface Agent {
  systemPrompt: string;
  tools: Tool[];
  maxIterations: number;
}

async function runAgent(agent: Agent, task: string): Promise<string> {
  const messages: Message[] = [];
  let iterations = 0;

  while (iterations < agent.maxIterations) {
    const response = await llm.chat({
      system: agent.systemPrompt,
      messages: [...messages, { role: 'user', content: task }],
      tools: agent.tools
    });

    if (!response.toolCalls) {
      return response.content; // Final answer
    }

    // Execute tools and continue
    const toolResults = await executeTools(response.toolCalls);
    messages.push({ role: 'assistant', content: response });
    messages.push({ role: 'tool', content: toolResults });
    iterations++;
  }

  throw new Error('Max iterations exceeded');
}

Multi-Model Router

// Route queries to appropriate models
const MODEL_ROUTER = {
  simple: 'claude-3-haiku',     // Fast, cheap
  moderate: 'claude-3-sonnet',   // Balanced
  complex: 'claude-3-opus',      // Best quality
};

function routeQuery(query: string, context: any): ModelId {
  // Classify complexity
  if (isSimpleQuery(query)) return MODEL_ROUTER.simple;
  if (requiresReasoning(query, context)) return MODEL_ROUTER.complex;
  return MODEL_ROUTER.moderate;
}

Implementation Checklist

RAG System

[ ] Document ingestion pipeline
[ ] Chunking strategy (semantic preferred)
[ ] Embedding model selection
[ ] Vector database setup
[ ] Retrieval with hybrid search
[ ] Reranking layer
[ ] Citation/source tracking
[ ] Evaluation metrics (relevance, faithfulness)

Production Readiness

[ ] Error handling and retries
[ ] Rate limiting
[ ] Token tracking
[ ] Cost monitoring
[ ] Latency metrics
[ ] Caching layer
[ ] Fallback responses
[ ] PII filtering
[ ] Prompt injection guards

Observability

[ ] Request logging
[ ] Response quality scoring
[ ] User feedback collection
[ ] A/B test framework
[ ] Drift detection
[ ] Alert thresholds

Anti-Patterns

Anti-Pattern: RAG Everything

What it looks like: Using RAG for every query Why wrong: Adds latency, cost, and complexity when unnecessary Instead: Classify queries, use RAG only when context needed

Anti-Pattern: Chunking by Character

What it looks like: text.slice(0, 1000) for chunks Why wrong: Breaks semantic meaning, poor retrieval Instead: Semantic chunking respecting document structure

Anti-Pattern: No Reranking

What it looks like: Using raw vector similarity as final ranking Why wrong: Embedding similarity != relevance for query Instead: Always add cross-encoder reranking

Anti-Pattern: Unbounded Context

What it looks like: Stuffing all retrieved chunks into prompt Why wrong: Dilutes relevance, wastes tokens, confuses model Instead: Top 3-5 chunks after reranking, dynamic selection

Anti-Pattern: No Guardrails

What it looks like: Direct user input to LLM Why wrong: Prompt injection, toxic outputs, off-topic responses Instead: Input validation, output filtering, topic guardrails

Technology Stack

Vector Databases

LLM Frameworks

Embedding Models

When to Use

Use for:

Building chatbots and conversational AI
Implementing RAG systems
Creating AI agents with tools
Designing multi-model architectures
Production AI deployments

Do NOT use for:

Prompt optimization (use prompt-engineer)
ML model training (use ml-engineer)
Data pipelines (use data-pipeline-engineer)
General backend (use backend-architect)

Core insight: Production AI systems need more than good prompts—they need robust retrieval, intelligent routing, comprehensive monitoring, and graceful failure handling.

Use with: prompt-engineer (optimization) | chatbot-analytics (monitoring) | backend-architect (infrastructure)

Related Skills

CenredJun/deep-research

development

VerifiedTrustedCommunity

Execute autonomous multi-step research using Google Gemini Deep Research Agent. Use for: market analysis, competitive landscaping, literature reviews, technical research, due diligence. Takes 2-10 ...

SKILL.mdUpdated Apr 16, 2026

CenredJun/deep-research

CenredJun/cost-optimizer

testing

VerifiedTrustedCommunity

Tracks cumulative LLM costs across DAG execution and makes real-time decisions to stay within budget. Downgrades models, skips optional nodes, or stops early when cost exceeds thresholds. Use when managing execution budgets, analyzing cost breakdowns, or optimizing model routing for cost. Activate on "cost budget", "too expensive", "reduce cost", "cost optimization", "model downgrade", "budget exceeded". NOT for LLM model selection logic (use llm-router), pricing comparisons across providers, or billing/invoicing.

SKILL.mdUpdated Apr 16, 2026

CenredJun/cost-optimizer

CenredJun/copywriting

development

VerifiedTrustedCommunity

When the user wants to write, rewrite, or improve marketing copy for any page — including homepage, landing pages, pricing pages, feature pages, about pages, or product pages. Also use when the user says "write copy for," "improve this copy," "rewrite this page," "marketing copy," "headline help," "CTA copy," "value proposition," "tagline," "subheadline," "hero section copy," "above the fold," "this copy is weak," "make this more compelling," or "help me describe my product." Use this whenever someone is working on website text that needs to persuade or convert. For email copy, see email-sequence. For popup copy, see popup-cro. For editing existing copy, see copy-editing.

SKILL.mdUpdated Apr 16, 2026

CenredJun/copywriting

CenredJun/content-marketer

testing

VerifiedTrustedCommunity

Elite content marketing strategist specializing in AI-powered content creation, omnichannel distribution, SEO optimization, and data-driven performance marketing.

SKILL.mdUpdated Apr 16, 2026

CenredJun/content-marketer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/CenredJun/openclaw-claudecode-setup-kit.git

# Copy into Claude Code skills folder (global)
cp -r openclaw-claudecode-setup-kit/skills/ai-engineer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

CenredJun/openclaw-claudecode-setup-kit

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT