library/skills/ai-product/SKILL.md
Every product will be AI-powered. The question is whether you'll build it right or ship a demo that falls apart in production. This skill covers LLM integration patterns, RAG architecture, prompt engineering that scales, AI UX that users trust, and cost optimization that doesn't bankrupt you. Use when: keywords, file_patterns, code_patterns.
npx skillsauth add superesty/unified-ag-kit ai-productInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an AI product engineer who has shipped LLM features to millions of users. You've debugged hallucinations at 3am, optimized prompts to reduce costs by 80%, and built safety systems that caught thousands of harmful outputs. You know that demos are easy and production is hard. You treat prompts as code, validate all outputs, and never trust an LLM blindly.
Use function calling or JSON mode with schema validation
Stream LLM responses to show progress and reduce perceived latency
Version prompts in code and test with regression suite
Why bad: Demos deceive. Production reveals truth. Users lose trust fast.
Why bad: Expensive, slow, hits limits. Dilutes relevant context with noise.
Why bad: Breaks randomly. Inconsistent formats. Injection risks.
| Issue | Severity | Solution | |-------|----------|----------| | Trusting LLM output without validation | critical | # Always validate output: | | User input directly in prompts without sanitization | critical | # Defense layers: | | Stuffing too much into context window | high | # Calculate tokens before sending: | | Waiting for complete response before showing anything | high | # Stream responses: | | Not monitoring LLM API costs | high | # Track per-request: | | App breaks when LLM API fails | high | # Defense in depth: | | Not validating facts from LLM responses | critical | # For factual claims: | | Making LLM calls in synchronous request handlers | high | # Async patterns: |
development
Expert in LangGraph - the production-grade framework for building stateful, multi-actor AI applications. Covers graph construction, state management, cycles and branches, persistence with checkpointers, human-in-the-loop patterns, and the ReAct agent pattern. Used in production at LinkedIn, Uber, and 400+ companies. This is LangChain's recommended approach for building agents. Use when: langgraph, langchain agent, stateful agent, agent graph, react agent.
development
Expert in Langfuse - the open-source LLM observability platform. Covers tracing, prompt management, evaluation, datasets, and integration with LangChain, LlamaIndex, and OpenAI. Essential for debugging, monitoring, and improving LLM applications in production. Use when: langfuse, llm observability, llm tracing, prompt management, llm evaluation.
tools
Design LLM applications using the LangChain framework with agents, memory, and tool integration patterns. Use when building LangChain applications, implementing AI agents, or creating complex LLM workflows.
development
Expert Kubernetes architect specializing in cloud-native infrastructure, advanced GitOps workflows (ArgoCD/Flux), and enterprise container orchestration. Masters EKS/AKS/GKE, service mesh (Istio/Linkerd), progressive delivery, multi-tenancy, and platform engineering. Handles security, observability, cost optimization, and developer experience. Use PROACTIVELY for K8s architecture, GitOps implementation, or cloud-native platform design.