Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

itmegirish/.claude/skills/test-draft-pipeline

Name: .claude/skills/test-draft-pipeline
Author: itmegirish

.claude/skills/test-draft-pipeline/SKILL.md

npx skillsauth add itmegirish/boardingmcp-server .claude/skills/test-draft-pipeline

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

SKILL: test-draft-pipeline

Purpose

Run the drafting pipeline, evaluate output quality, and verify all 4 gates + review work correctly.

When to Use

After modifying any pipeline node, gate, or prompt
After creating or updating an exemplar or LKB entry
For regression testing across multiple scenarios
For debugging pipeline failures

Test Runners

Quick Test (single scenario)

agent_steer/Scripts/python.exe research/run_draft_live.py

Unit Tests

agent_steer/Scripts/python.exe -m pytest tests/drafting/ -v

Review Benchmark

agent_steer/Scripts/python.exe research/run_review_benchmark.py

Multi-Scenario Compare

agent_steer/Scripts/python.exe research/run_v5_compare.py

What to Check in Logs

[INTAKE+CLASSIFY] -> doc_type, law_domain, cause_type, facts extracted, parties
[RAG]             -> query terms, chunks retrieved, dedup count
[ENRICHMENT]      -> limitation article, verified_provisions count, LKB hit/miss
[LKB]             -> hit/miss/alias resolution, conditional field resolution
[DRAFT]           -> model used, prompt size, output chars, placeholders found
[EVIDENCE_ANCHORING] -> entities found, anchored, replaced with placeholder
[LKB_COMPLIANCE]     -> primary acts check, superseded law replacements
[POSTPROCESS]        -> formatting fixes applied
[CITATION_VALIDATOR] -> provisions verified, flagged, case citations found
[REVIEW]           -> skipped? cycle count, blocking_issues, inline fix, token usage

Scoring Framework

Universal Checks (ALL doc_types)

| # | Check | What to verify | |---|---|---| | 1 | Court heading present | Court name + place | | 2 | Title present | Document type stated | | 3 | Parties section | Primary + opposite party | | 4 | Jurisdiction section | Territorial + pecuniary + subject matter | | 5 | Facts section | Numbered paragraphs, chronological | | 6 | Legal basis section | At least one statutory provision | | 7 | Prayer section | Specific relief(s) requested | | 8 | Verification clause | Order VI Rule 15 for CPC | | 9 | Continuous numbering | Paragraphs numbered sequentially | | 10 | No fabricated citations | Zero AIR/SCC/ILR unless user-provided | | 11 | Proper placeholders | {{NAME}} format for unknowns | | 12 | Evidence referenced | Annexure labels used | | 13 | Formal language | Court-ready register |

Pipeline-Specific Checks

| # | Check | What to verify | |---|---|---| | 14 | LKB resolved | cause_type matched (direct or via alias) | | 15 | Primary acts cited | LKB primary_acts appear in draft | | 16 | Limitation correct | Matches LKB (or NONE when appropriate) | | 17 | No superseded acts | IPC/CrPC/Evidence Act replaced with BNS/BNSS/BSA | | 18 | Citation validator clean | No unverified provisions flagged | | 19 | Evidence anchoring clean | No unsupported tokens remain | | 20 | Review routing correct | Legal vs formatting severity handled properly |

Debug Checklist

When a draft scores below target:

Check intake — did it classify cause_type correctly? Check [INTAKE+CLASSIFY] log
Check LKB — did lookup succeed? Watch for [LKB] miss in logs. Check aliases
Check enrichment — did limitation resolve? Check [ENRICHMENT] log
Check conditional resolution — did resolve_entry flatten conditionals? Check [LKB] conditional
Check draft prompt — is LKB brief reaching the draft? Check _build_lkb_brief_context
Check RAG — are relevant chunks retrieved? Check [RAG] query terms
Check gates — are there false positives? Check each gate's log output
Check review — did review fix or break things? Check [REVIEW] blocking_issues

Common Issues

| Symptom | Cause | Fix | |---------|-------|-----| | 35 placeholders | LKB miss -> no acts/limitation fed to draft | Add alias in lkb/__init__.py or fix cause_type in intake prompt | | Wrong limitation article | Conditional field not resolved | Check resolve_entry + _INFERENCE_MAP keywords | | Hallucinated section content | LLM uses training memory not RAG | Check anti-hallucination instruction in LKB brief builder | | Review too slow | Too many tokens sent | Already fixed — slim payload (~3.5K tokens) | | Citation flagged incorrectly | Provision not in verified_provisions | Check enrichment RAG scan + user_cited_provisions | | Wrong model used | Settings override in .env | Check OLLAMA_DRAFT_MODEL / OLLAMA_REVIEW_MODEL | | Superseded act in draft | LKB compliance gate missed | Check superseded law patterns in lkb_compliance.py |

Hallucination Tests

| Test | Input | Verify | Fail if | |------|-------|--------|---------| | Date hallucination | No dates provided | All dates {{PLACEHOLDER}} | Concrete date fabricated | | Amount hallucination | Only principal amount | Principal matches exactly | Unrelated amount in draft | | Citation hallucination | No case law request | Zero AIR/SCC/ILR | Case citation in draft | | Name hallucination | No specific names | All names {{PLACEHOLDER}} | Invented name in draft | | Statute hallucination | Check cited provisions | In verified_provisions | Unverified citation not flagged |

Regression Testing

After ANY change, run scenarios covering:

Money recovery — formulaic, should score high
Breach of contract — standard complexity
Dealership damages — complex, multiple damage heads
Partition — complex, genealogy + property schedule
Recovery of possession — tests LKB alias resolution
Defamation — tests cause-type matching

Check:

No quality regression (gate issues count, placeholder count)
No timing regression (within expected bounds)
No new false positives in gates
LKB resolution succeeds for all cause types

Telemetry to Log

Total pipeline duration + per-stage breakdown
Model used + estimated tokens (input/output)
LKB hit/miss/alias
Gate results (per gate: pass/fail/fixes)
Review: skipped/triggered, blocking issues, inline fix applied
Placeholder count in final output
Draft text length (chars)

itmegirish/.claude/skills/test-draft-pipeline

.claude/skills/test-draft-pipeline/SKILL.md

# SKILL: test-draft-pipeline ## Purpose Run the drafting pipeline, evaluate output quality, and verify all 4 gates + review work correctly. ## When to Use - After modifying any pipeline node, gate, or prompt - After creating or updating an exemplar or LKB entry - For regression testing across multiple scenarios - For debugging pipeline failures ## Test Runners ### Quick Test (single scenario) ```bash agent_steer/Scripts/python.exe research/run_draft_live.py ``` ### Unit Tests ```bash agent_

development

Updated Apr 6, 2026

$ install --global

skillsauth

npx skillsauth add itmegirish/boardingmcp-server .claude/skills/test-draft-pipeline

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 6, 2026, 2:17 AM4.5s1 file scanned

SKILL.md

SKILL: test-draft-pipeline

Purpose

Run the drafting pipeline, evaluate output quality, and verify all 4 gates + review work correctly.

When to Use

After modifying any pipeline node, gate, or prompt
After creating or updating an exemplar or LKB entry
For regression testing across multiple scenarios
For debugging pipeline failures

Test Runners

Quick Test (single scenario)

agent_steer/Scripts/python.exe research/run_draft_live.py

Unit Tests

agent_steer/Scripts/python.exe -m pytest tests/drafting/ -v

Review Benchmark

agent_steer/Scripts/python.exe research/run_review_benchmark.py

Multi-Scenario Compare

agent_steer/Scripts/python.exe research/run_v5_compare.py

What to Check in Logs

[INTAKE+CLASSIFY] -> doc_type, law_domain, cause_type, facts extracted, parties
[RAG]             -> query terms, chunks retrieved, dedup count
[ENRICHMENT]      -> limitation article, verified_provisions count, LKB hit/miss
[LKB]             -> hit/miss/alias resolution, conditional field resolution
[DRAFT]           -> model used, prompt size, output chars, placeholders found
[EVIDENCE_ANCHORING] -> entities found, anchored, replaced with placeholder
[LKB_COMPLIANCE]     -> primary acts check, superseded law replacements
[POSTPROCESS]        -> formatting fixes applied
[CITATION_VALIDATOR] -> provisions verified, flagged, case citations found
[REVIEW]           -> skipped? cycle count, blocking_issues, inline fix, token usage

Scoring Framework

Universal Checks (ALL doc_types)

Pipeline-Specific Checks

Debug Checklist

When a draft scores below target:

Check intake — did it classify cause_type correctly? Check [INTAKE+CLASSIFY] log
Check LKB — did lookup succeed? Watch for [LKB] miss in logs. Check aliases
Check enrichment — did limitation resolve? Check [ENRICHMENT] log
Check conditional resolution — did resolve_entry flatten conditionals? Check [LKB] conditional
Check draft prompt — is LKB brief reaching the draft? Check _build_lkb_brief_context
Check RAG — are relevant chunks retrieved? Check [RAG] query terms
Check gates — are there false positives? Check each gate's log output
Check review — did review fix or break things? Check [REVIEW] blocking_issues

Common Issues

Hallucination Tests

Regression Testing

After ANY change, run scenarios covering:

Money recovery — formulaic, should score high
Breach of contract — standard complexity
Dealership damages — complex, multiple damage heads
Partition — complex, genealogy + property schedule
Recovery of possession — tests LKB alias resolution
Defamation — tests cause-type matching

Check:

No quality regression (gate issues count, placeholder count)
No timing regression (within expected bounds)
No new false positives in gates
LKB resolution succeeds for all cause types

Telemetry to Log

Total pipeline duration + per-stage breakdown
Model used + estimated tokens (input/output)
LKB hit/miss/alias
Gate results (per gate: pass/fail/fixes)
Review: skipped/triggered, blocking issues, inline fix applied
Placeholder count in final output
Draft text length (chars)

Related Skills

itmegirish/.claude/skills/v9-architecture

development

VerifiedTrustedCommunity

# SKILL: v9-architecture Use when: planning, building, or reviewing v11.0 architecture components (LKB 2-layer model, document schemas, structured prompt builder, gates, family migrations). ## v11.0 Architecture — Scalable Context-Driven Pipeline ### Core Principles 1. **Better context to LLM = better draft** — no complex engine needed 2. **Separate law from structure** — cause type (92) × document type (12) = 1,104 combinations 3. **Decide law before drafting, enforce law after drafting** #

SKILL.mdUpdated Apr 6, 2026

itmegirish/.claude/skills/v9-architecture

itmegirish/.claude/skills/template-builder

development

VerifiedTrustedCommunity

# SKILL: exemplar-builder ## Purpose Create, validate, and maintain document schemas and LKB Layer 2 data for the v11.0 scalable drafting pipeline. **v11.0 approach:** No exemplar documents in prompts. Instead: LKB 2-layer data + document schema → structured prompt → LLM drafts. ## When to Use - Creating a new document schema (e.g., written_statement, appeal_memo) - Enriching LKB entries with Layer 2 data (available_reliefs, jurisdiction_basis) - Reviewing schema quality against CPC rules - A

SKILL.mdUpdated Apr 6, 2026

itmegirish/.claude/skills/template-builder

itmegirish/.claude/skills/section-validator

development

VerifiedTrustedCommunity

# SKILL: section-validator ## Purpose Build and maintain the 4 deterministic verification gates (Stage 3). Gates run on the full draft text with zero LLM calls. They validate, auto-fix formatting, and flag issues for review. ## When to Use - Building or modifying any gate - Adding new entity extraction patterns - Debugging false positives / false negatives - Extending verified provisions coverage ## Architecture Context (v5.1 — what's running) 4 gates run sequentially on `draft.draft_artifac

SKILL.mdUpdated Apr 6, 2026

itmegirish/.claude/skills/section-validator

itmegirish/.claude/skills/section-drafter-prompt

development

VerifiedTrustedCommunity

# SKILL: draft-prompt ## Purpose Build and refine the draft prompt that produces a complete court-ready legal document in a single LLM call. v5.1 uses free-text drafting — the LLM outputs the entire document (not section-keyed JSON, not gap-fill). Exemplar-guided, LKB-informed. ## When to Use - Building or modifying `prompts/draft_prompt.py` - Debugging why draft quality is low - Tuning exemplars or context injection - Adapting prompt for a new cause type - Optimizing prompt token count ## Ar

SKILL.mdUpdated Apr 6, 2026

itmegirish/.claude/skills/section-drafter-prompt

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/itmegirish/boardingmcp-server.git

# Copy into Claude Code skills folder (global)
cp -r boardingmcp-server/.claude/skills/test-draft-pipeline ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

itmegirish/boardingmcp-server

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT