Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

RaheesAhmed/ai-engineer

Name: ai-engineer
Author: RaheesAhmed

skills/ai-engineer/SKILL.md

npx skillsauth add RaheesAhmed/SajiCode ai-engineer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

AI Engineer

Model Selection Matrix

| Model | Best For | Cost | Speed | Context | |-------|----------|------|-------|---------| | GPT-4o | General tasks, function calling | $$$ | Fast | 128K | | Claude 3.5 Sonnet | Code generation, analysis | $$$ | Fast | 200K | | GPT-4o-mini | Cost-sensitive tasks | $ | Very fast | 128K | | Claude 3.5 Haiku | High-volume, simple tasks | $ | Very fast | 200K | | Llama 3.1 70B | Self-hosted, privacy | Free* | Medium | 128K | | Mixtral 8x22B | Open-source, balanced | Free* | Medium | 64K | | Gemini 2.0 Flash | Multimodal, long context | $$ | Fast | 1M |

Decision: Start with cheapest model that meets quality bar. Upgrade only when quality fails.

RAG Pipeline Architecture

Ingestion Pipeline

Documents → Chunking → Embedding → Vector Store
                ↓
         Metadata extraction
         (title, source, date)

Retrieval Pipeline

Query → Query Understanding → Retrieval → Reranking → Generation
              ↓                    ↓           ↓
         Expansion/       Hybrid search    Cross-encoder
         Decomposition    (vector + BM25)   scoring

Chunking Strategies

// Recursive text splitter — best default
const splitter = new RecursiveCharacterTextSplitter({
  chunkSize: 1000,
  chunkOverlap: 200,
  separators: ["\n\n", "\n", ". ", " ", ""],
});

// Semantic chunking — for high-quality retrieval
const semanticSplitter = new SemanticChunker(embeddings, {
  breakpointThresholdType: "percentile",
  breakpointThresholdAmount: 95,
});

Vector Database Selection

| Database | Hosting | Best For | |----------|---------|----------| | Pinecone | Managed | Production, scale | | Qdrant | Self-hosted/Cloud | Hybrid search | | Chroma | Embedded | Prototyping, local | | pgvector | PostgreSQL ext | Existing Postgres | | Weaviate | Self-hosted/Cloud | Multimodal |

LangGraph Agent Patterns

ReAct Agent (tool-calling loop)

import { StateGraph, MessagesAnnotation } from "@langchain/langgraph";
import { ToolNode } from "@langchain/langgraph/prebuilt";

const agentNode = async (state: typeof MessagesAnnotation.State) => {
  const response = await model.invoke(state.messages);
  return { messages: [response] };
};

const shouldContinue = (state: typeof MessagesAnnotation.State) => {
  const lastMessage = state.messages[state.messages.length - 1];
  return lastMessage.tool_calls?.length ? "tools" : "__end__";
};

const graph = new StateGraph(MessagesAnnotation)
  .addNode("agent", agentNode)
  .addNode("tools", new ToolNode(tools))
  .addEdge("__start__", "agent")
  .addConditionalEdges("agent", shouldContinue)
  .addEdge("tools", "agent")
  .compile();

Multi-Agent Supervisor Pattern

const supervisorNode = async (state: AgentState) => {
  const response = await supervisorModel.invoke([
    { role: "system", content: "Route to the right specialist agent." },
    ...state.messages,
  ]);
  return { next: response.content }; // "researcher" | "coder" | "reviewer"
};

Prompt Engineering

Structured Output

const schema = z.object({
  sentiment: z.enum(["positive", "negative", "neutral"]),
  confidence: z.number().min(0).max(1),
  reasoning: z.string(),
});

const structuredLlm = model.withStructuredOutput(schema);
const result = await structuredLlm.invoke("Analyze: Great product!");

Chain-of-Thought

You are an expert analyst. Think through this step by step:

1. First, identify the key entities in the text
2. Then, determine the relationships between them
3. Finally, synthesize your findings into a structured answer

Text: {input}

Few-Shot Pattern

const fewShotPrompt = ChatPromptTemplate.fromMessages([
  ["system", "Extract structured data from text."],
  ["human", "John works at Google since 2020"],
  ["ai", '{"name": "John", "company": "Google", "year": 2020}'],
  ["human", "Sarah joined Meta in 2023"],
  ["ai", '{"name": "Sarah", "company": "Meta", "year": 2023}'],
  ["human", "{input}"],
]);

Cost Optimization

Token Reduction Strategies

Shorter prompts: Remove fluff, use terse instructions
Caching: Semantic cache with vector similarity threshold
Model routing: Use cheap model for simple tasks, expensive for complex
Streaming: Stream responses to reduce perceived latency
Batching: Group similar requests for batch API pricing

Semantic Caching

const cache = new SemanticCache({
  embeddings,
  vectorStore,
  similarityThreshold: 0.95,
});

async function cachedInvoke(prompt: string) {
  const cached = await cache.lookup(prompt);
  if (cached) return cached;
  const result = await model.invoke(prompt);
  await cache.store(prompt, result);
  return result;
}

AI Safety Checklist

[ ] Validate all user inputs before sending to LLM
[ ] Strip PII from prompts when possible
[ ] Set max token limits on all LLM calls
[ ] Implement rate limiting per user/API key
[ ] Add content moderation on LLM outputs
[ ] Log all LLM interactions for debugging (redact PII)
[ ] Use temperature=0 for deterministic tasks
[ ] Implement timeout and retry with exponential backoff
[ ] Never expose raw LLM errors to end users

RaheesAhmed/ai-engineer

skills/ai-engineer/SKILL.md

Build production-ready LLM applications, RAG systems, and intelligent agents. Covers model selection, vector search, prompt engineering, agent orchestration with LangGraph, cost optimization, and AI safety. Use for any LLM feature, chatbot, AI agent, or AI-powered application.

66 stars

development

Updated May 17, 2026

$ install --global

skillsauth

npx skillsauth add RaheesAhmed/SajiCode ai-engineer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 17, 2026, 5:13 AM125.9s1 file scanned

SKILL.md

name:: ai-engineer
description:: Build production-ready LLM applications, RAG systems, and intelligent agents. Covers model selection, vector search, prompt engineering, agent orchestration with LangGraph, cost optimization, and AI safety. Use for any LLM feature, chatbot, AI agent, or AI-powered application.

AI Engineer

Model Selection Matrix

Decision: Start with cheapest model that meets quality bar. Upgrade only when quality fails.

RAG Pipeline Architecture

Ingestion Pipeline

Documents → Chunking → Embedding → Vector Store
                ↓
         Metadata extraction
         (title, source, date)

Retrieval Pipeline

Query → Query Understanding → Retrieval → Reranking → Generation
              ↓                    ↓           ↓
         Expansion/       Hybrid search    Cross-encoder
         Decomposition    (vector + BM25)   scoring

Chunking Strategies

// Recursive text splitter — best default
const splitter = new RecursiveCharacterTextSplitter({
  chunkSize: 1000,
  chunkOverlap: 200,
  separators: ["\n\n", "\n", ". ", " ", ""],
});

// Semantic chunking — for high-quality retrieval
const semanticSplitter = new SemanticChunker(embeddings, {
  breakpointThresholdType: "percentile",
  breakpointThresholdAmount: 95,
});

Vector Database Selection

LangGraph Agent Patterns

ReAct Agent (tool-calling loop)

import { StateGraph, MessagesAnnotation } from "@langchain/langgraph";
import { ToolNode } from "@langchain/langgraph/prebuilt";

const agentNode = async (state: typeof MessagesAnnotation.State) => {
  const response = await model.invoke(state.messages);
  return { messages: [response] };
};

const shouldContinue = (state: typeof MessagesAnnotation.State) => {
  const lastMessage = state.messages[state.messages.length - 1];
  return lastMessage.tool_calls?.length ? "tools" : "__end__";
};

const graph = new StateGraph(MessagesAnnotation)
  .addNode("agent", agentNode)
  .addNode("tools", new ToolNode(tools))
  .addEdge("__start__", "agent")
  .addConditionalEdges("agent", shouldContinue)
  .addEdge("tools", "agent")
  .compile();

Multi-Agent Supervisor Pattern

const supervisorNode = async (state: AgentState) => {
  const response = await supervisorModel.invoke([
    { role: "system", content: "Route to the right specialist agent." },
    ...state.messages,
  ]);
  return { next: response.content }; // "researcher" | "coder" | "reviewer"
};

Prompt Engineering

Structured Output

const schema = z.object({
  sentiment: z.enum(["positive", "negative", "neutral"]),
  confidence: z.number().min(0).max(1),
  reasoning: z.string(),
});

const structuredLlm = model.withStructuredOutput(schema);
const result = await structuredLlm.invoke("Analyze: Great product!");

Chain-of-Thought

You are an expert analyst. Think through this step by step:

1. First, identify the key entities in the text
2. Then, determine the relationships between them
3. Finally, synthesize your findings into a structured answer

Text: {input}

Few-Shot Pattern

const fewShotPrompt = ChatPromptTemplate.fromMessages([
  ["system", "Extract structured data from text."],
  ["human", "John works at Google since 2020"],
  ["ai", '{"name": "John", "company": "Google", "year": 2020}'],
  ["human", "Sarah joined Meta in 2023"],
  ["ai", '{"name": "Sarah", "company": "Meta", "year": 2023}'],
  ["human", "{input}"],
]);

Cost Optimization

Token Reduction Strategies

Shorter prompts: Remove fluff, use terse instructions
Caching: Semantic cache with vector similarity threshold
Model routing: Use cheap model for simple tasks, expensive for complex
Streaming: Stream responses to reduce perceived latency
Batching: Group similar requests for batch API pricing

Semantic Caching

const cache = new SemanticCache({
  embeddings,
  vectorStore,
  similarityThreshold: 0.95,
});

async function cachedInvoke(prompt: string) {
  const cached = await cache.lookup(prompt);
  if (cached) return cached;
  const result = await model.invoke(prompt);
  await cache.store(prompt, result);
  return result;
}

AI Safety Checklist

[ ] Validate all user inputs before sending to LLM
[ ] Strip PII from prompts when possible
[ ] Set max token limits on all LLM calls
[ ] Implement rate limiting per user/API key
[ ] Add content moderation on LLM outputs
[ ] Log all LLM interactions for debugging (redact PII)
[ ] Use temperature=0 for deterministic tasks
[ ] Implement timeout and retry with exponential backoff
[ ] Never expose raw LLM errors to end users

Related Skills

RaheesAhmed/web-research

development

VerifiedTrustedCommunity

Deep web research and data extraction skill. Systematically research ANY topic by fetching URLs, reading documentation, crawling API docs, evaluating npm/pypi packages, comparing technologies, and synthesizing findings into actionable recommendations. Use when researching libraries, frameworks, APIs, solutions, or any topic requiring web investigation.

66SKILL.mdUpdated May 17, 2026

RaheesAhmed/web-research

RaheesAhmed/testing-patterns

development

VerifiedTrustedCommunity

Design and implement comprehensive test suites. Covers unit testing, integration testing, E2E testing with Playwright, API testing, mocking strategies, test data factories, TDD workflow, snapshot testing, coverage targets, and CI integration. Use when writing tests, designing test architecture, or debugging test failures.

66SKILL.mdUpdated May 17, 2026

RaheesAhmed/testing-patterns

RaheesAhmed/superpowers

development

VerifiedTrustedCommunity

Core engineering workflow that activates on EVERY task. Enforces systematic plan-before-code methodology, multi-file refactoring safety, dependency-aware changes, pre-flight verification, and zero-placeholder quality standards. Use PROACTIVELY on all coding tasks.

66SKILL.mdUpdated May 17, 2026

RaheesAhmed/superpowers

RaheesAhmed/styling-patterns

tools

VerifiedTrustedCommunity

Implement production styling systems with Tailwind CSS, vanilla CSS, or CSS-in-JS. Covers CSS architecture (BEM, utility-first, modules), design tokens, responsive patterns, animation systems, dark mode, container queries, print styles, and performance optimization. Use when implementing designs or building CSS architectures.

66SKILL.mdUpdated May 17, 2026

RaheesAhmed/styling-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/RaheesAhmed/SajiCode.git

# Copy into Claude Code skills folder (global)
cp -r SajiCode/skills/ai-engineer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

RaheesAhmed/SajiCode

66 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT