skills/llm/SKILL.md
LLM engineering: agent frameworks (AI SDK, LangGraph), document and image processing, open-weight model serving (vLLM on AWS/GCP), RAG pipelines, and eval tooling (RAGAS, LangSmith). Triggers on: "llm", "llm engineering", "agent framework", "rag", "retrieval augmented generation", "model serving", "vllm", "langchain", "langgraph", "ragas", "langsmith", "llm eval".
npx skillsauth add cloudvoyant/codevoyant llmInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Context skill for LLM engineering: agent frameworks, document/image processing, open-weight model serving, retrieval-augmented generation, and evaluation tooling.
| You are working on... | Load recipe |
|---|---|
| Agent with Vercel AI SDK or Anthropic SDK | references/recipes/agents-ai-sdk.md |
| Agent with LangChain or LangGraph | references/recipes/agents-langgraph.md |
| Tool calling in any SDK | references/recipes/tool-calling.md |
| PDF text extraction, OCR, or document chunking | references/recipes/document-processing.md |
| Image tiling for Vision APIs or PDF rasterization | references/recipes/image-tiling.md |
| Serving open-weight models on AWS | references/recipes/serving-aws.md |
| Serving open-weight models on GCP | references/recipes/serving-gcp.md |
| RAG pipeline on AWS (OpenSearch, Bedrock) | references/recipes/rag-aws.md |
| RAG pipeline on GCP (Vertex AI, BigQuery) | references/recipes/rag-gcp.md |
| RAG with OSS vector stores (Qdrant, Chroma) | references/recipes/rag-oss.md |
| LLM evaluation, RAGAS, LangSmith, regression testing | references/recipes/llm-eval.md |
LLM engineering is systems engineering with a probabilistic core. The model is just one component in a pipeline that includes document ingestion, embedding, retrieval, prompt construction, tool execution, and output evaluation. These recipes treat each component as an independently testable, observable subsystem. Agent loops have explicit state machines. RAG pipelines have measurable retrieval quality. Serving infrastructure has latency budgets and cost models. Eval is not optional — it is the feedback loop that makes everything else improvable.
development
React patterns: Zustand state management, shadcn/ui + Tailwind CSS, React Three Fiber + Drei for 3D, folder structure, data fetching, and TypeScript conventions. Load when working on React projects (*.tsx) without SvelteKit.
development
QA workflows: investigate and document bugs, post issues to GitHub/GitLab/Linear, and run browser-agent smoke tests. Triggers on: 'qa debug', 'qa report', 'qa smoke', 'run smoke test', 'report bug', 'investigate issue'.
tools
Python project patterns: uv package/workspace management, MLflow experiment tracking, Ray distributed computing, Nvidia Warp GPU kernels, Pydantic validation, Click CLIs, and service architecture. Load when writing Python with pyproject.toml or uv.lock.
development
Code review workflows: create a draft PR/MR, generate AI-powered inline review comments, address change requests, or complete a draft review. Triggers on: "pr open", "pr new", "pr review", "pr address", "pr complete", "open a PR", "create a draft PR", "code review", "pr mr", "pr this PR", "address pr comments", "fix review comments", "complete draft review", "publish review".