plugins/rag-development/skills/rag-development/SKILL.md
Comprehensive RAG development knowledge base covering chunking, embeddings, vector databases, retrieval strategies, advanced patterns (Graph RAG, CRAG, Self-RAG, Agentic RAG), evaluation, and production deployment. TRIGGER WHEN: building, implementing, writing, coding, creating, optimizing, or auditing RAG systems. DO NOT TRIGGER WHEN: the task is outside the specific scope of this component.
npx skillsauth add acaprino/alfio-claude-plugins rag-developmentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Comprehensive knowledge base for building production-grade Retrieval-Augmented Generation systems.
For 80% of use cases, start with:
text-embedding-3-small (best value) or Cohere embed-v4 (best accuracy)Then upgrade incrementally based on measured failures:
Detailed reference documents are in the references/ directory:
chunking-strategies.md -- all chunking approaches with code, benchmarks, and selection guideembedding-models.md -- model comparison, Matryoshka embeddings, fine-tuning, sparse/dense/multi-vectorretrieval-patterns.md -- hybrid search, HyDE, contextual retrieval, re-ranking, MMRadvanced-rag-patterns.md -- Graph RAG, RAPTOR, CRAG, Self-RAG, Agentic RAG, multi-modal RAGvector-databases.md -- Qdrant deep dive, database comparison, scaling strategiesproduction-guide.md -- evaluation, observability, caching, security, cost optimizationDocument Ingestion:
Raw Docs -> Preprocessing (Unstructured.io) -> Chunking -> Context Enrichment -> Embedding -> Vector DB
Query Pipeline:
User Query -> Query Transform -> Encode (Dense + Sparse) -> Hybrid Search -> Re-rank -> LLM Generation
Evaluation Loop:
Ground Truth + Predictions -> RAGAS/DeepEval -> Faithfulness, Relevancy, Precision, Recall
| Decision | Default | Upgrade When | |----------|---------|-------------| | Chunking | Recursive 512 tok | Structured docs -> markdown-aware; cross-refs -> late chunking | | Embedding | text-embedding-3-small | Need accuracy -> embed-v4; self-hosted -> NV-Embed-v2 | | Vector DB | Qdrant + INT8 | Already on Postgres -> pgvector; need managed -> Pinecone | | Search | Dense only | Keyword misses -> add sparse hybrid; poor diversity -> add MMR | | Re-ranking | None | Top-k results contain irrelevant items -> add Cohere Rerank | | Caching | None | Production latency/cost concerns -> semantic cache | | Evaluation | Manual spot checks | Any production use -> RAGAS automated metrics |
tools
Master memory forensics techniques including memory acquisition, process analysis, and artifact extraction using Volatility and related tools. Use when analyzing memory dumps, investigating incidents, or performing malware analysis from RAM captures.
development
Master binary analysis patterns including disassembly, decompilation, control flow analysis, and code pattern recognition. Use when analyzing executables, understanding compiled code, or performing static analysis on binaries.
development
Idiomatic Kotlin implementation patterns: coroutines and structured concurrency, Flow / StateFlow / SharedFlow, Kotlin Multiplatform (KMP) shared-code architecture, Jetpack Compose UI, Ktor server with JWT auth and Exposed, and type-safe DSL design (lambdas with receivers, delegated properties, inline reified, value classes). TRIGGER WHEN: building, writing, or reviewing Kotlin code using coroutines / Flow / suspend functions, expect/actual, Compose composables / ViewModels, Ktor routing, sealed-class state modeling, scope functions, or DSL builders. DO NOT TRIGGER WHEN: libGDX game work (use libgdx-development), Android Java without Kotlin, or pure JVM tuning unrelated to Kotlin language features.
tools
Strategic website planning skill that conducts structured client discovery, produces professional deliverables (website brief, sitemap, design direction, content strategy), and orchestrates frontend-design, frontend-layout, seo-specialist, and content-marketer agents automatically. TRIGGER WHEN: planning a new website or redesign before any code is written. DO NOT TRIGGER WHEN: the task is outside the specific scope of this component.