ai-rag-patterns/SKILL.md
Use when building features that answer questions from private data, documents, policies, or time-sensitive information — RAG architecture, chunking strategies, hybrid search, re-ranking, vector databases, evaluation, agentic RAG, multimodal RAG...
npx skillsauth add peterbamuhigire/skills-web-dev ai-rag-patternsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
ai-rag-patterns or would be better handled by a more specific companion skill.SKILL.md first, then load only the referenced deep-dive files that are necessary for the task.RAG solves the core LLM limitation: they only know what they were trained on. Use RAG to inject private data (invoices, menus, policies, reports) into every AI response.
Core principle: RAG = look up a database + LLM synthesises the results. The LLM never needs to "know" your data.
| Condition | Action | |---|---| | Knowledge base < 200K tokens (~500 pages) | Include everything in context — no RAG needed | | Knowledge base > 200K tokens | Use RAG | | Data changes frequently (menus, prices, stock) | RAG (update documents, not model) | | Data is private/confidential | RAG (keeps data out of training pipelines) | | Need source citations | RAG (chunks are traceable to source) | | Model needs brand voice / domain jargon | Fine-tune instead |
| Factor | RAG | Fine-Tuning | |---|---|---| | Up-to-date content | ✅ Yes (add docs anytime) | ❌ Stale until retrained | | Hallucinations | ✅ Lower (document-grounded) | ❌ Higher | | Source citations | ✅ Yes | ❌ No | | Brand voice control | ❌ Weak | ✅ Strong | | Domain jargon | ❌ Weak | ✅ Strong | | Up-front cost | ✅ Lower | ❌ High |
Default: start with RAG. Fine-tune only when RAG + prompt engineering cannot deliver the required tone or vocabulary.
Extended guidance for ai-rag-patterns was moved to references/skill-deep-dive.md to keep this entrypoint compact and fast to load.
Use that deep dive for:
Pipeline ArchitectureChunking StrategiesEmbedding Model SelectionVector Database SelectionRetrieval AlgorithmsRe-RankingFull RAG Query AlgorithmQuery Rewriting (Multi-Turn)RAG Schema (Multi-Tenant)Evaluation FrameworkProduction PatternsAgentic RAGdata-ai
Use when adding AI-powered analytics to a SaaS platform — semantic search over business data, natural language queries, trend detection, anomaly alerts, and AI-generated insights for dashboards. Covers embeddings, NL2SQL, and per-tenant analytics...
data-ai
Design AI-powered analytics dashboards — what metrics to show, how to display AI predictions and confidence, drill-down patterns, KPI cards, trend visualisation, AI Insights panels, export design, and role-based dashboard variants. Invoke when...
development
Use when designing, building, reviewing, or upgrading production software systems that must be secure, performant, maintainable, scalable, and user-centered. Apply before writing specs, code, architecture, APIs, databases, mobile apps, SaaS platforms, or ERP systems.
development
Professional web app UI using commercial templates (Tabler/Bootstrap 5) with strong frontend design direction when needed. Use for CRUD interfaces, dashboards, admin panels with SweetAlert2, DataTables, Flatpickr. Clone seeder-page.php, use...