modelgas/skills/tests/SKILL.md
# Skill: Test Writing ## Purpose Design and generate a validation test suite that assesses conceptual soundness, implementation correctness, numerical stability, and outcome reasonableness. This skill converts model risk into executable tests. ## Inputs Required IR fields: - methodology outputs - ALW outputs - code evidence snippets Skill data inputs: - test_matrix.yaml (required test categories and patterns) ## Outputs - A test plan matrix (test name, purpose, category) - Generated pytest
npx skillsauth add gtylee/codexgas modelgas/skills/testsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Design and generate a validation test suite that assesses conceptual soundness, implementation correctness, numerical stability, and outcome reasonableness.
This skill converts model risk into executable tests.
Required IR fields:
Skill data inputs:
You are a model validation engineer writing tests for a financial model. Design tests that would catch real failures, not just pass happy paths.
Using the model IR and ALW:
Return JSON matching the schema exactly.
development
# Skill: Risk Tiering ## Purpose Determine the governance risk tier of the model by assessing its financial impact, operational reliance, usage pattern, implementation complexity, and strength of existing risk mitigations. This skill establishes the downstream control requirements for all other skills. ## Inputs Required IR fields: - project metadata - symbols and public interfaces - imports and dependencies - commentary_md - evidence_index Skill data inputs: - rubric.yaml (axis definitions,
development
# Skill: Remediation Pack ## Purpose Convert CodexGAS findings into implementable remediation artifacts (patches, tests, config/control updates) that close evidenced weaknesses and governance gaps. ## Inputs Required inputs: - Prior skill outputs (1–7 at minimum; include 8/9 if present) - remediation rules (`data/remediation_rules.yaml`) - Optional `human_declarations` (only if provided; do not invent) - IR evidence index (for evidence ids) ## Outputs Produce a remediation pack containing: -
development
# Skill: Production Control ## Purpose Define technical controls that ensure safe, observable, and auditable operation of the model in batch and service environments. This skill turns governance into runtime behavior. ## Inputs Required IR fields: - model interfaces - deployment assumptions - risk tier output Skill data inputs: - monitors.yaml (control patterns and snippets) ## Outputs - Logging and lineage requirements - Monitoring hooks (input/output, drift, failures) - Audit artifacts -
development
# Skill: OPM Tailoring ## Purpose Define ongoing performance monitoring (OPM) metrics and thresholds that are proportional to the model’s risk tier and usage. This skill operationalizes “model performance” in production. ## Inputs Required IR fields: - risk tier output - test outputs (especially metrics) - model usage characteristics Skill data inputs: - thresholds.yaml (default metrics and bands per tier) ## Outputs - Selected monitoring metrics - Thresholds (green/amber/red) - Breach defi