skills/team/python-architecture-checklist/SKILL.md
Checklist executor for Python 3 architecture reviews. Detects the Python version, framework (FastAPI/Django/Flask), and package layout, then runs a systematic checklist covering clean-architecture boundaries, type-safety discipline, complexity, dependency direction, and config/secrets hygiene — producing a graded report with file:line evidence. Use to review or grade an existing Python codebase. Triggers on "review this python project", "python architecture checklist", "audit python code", "evaluate python codebase", "python code review", "grade this python architecture", "check python patterns". Do NOT use for a Socratic design critique — use architecture-review. Do NOT use for a security audit — use python-security-review. Do NOT use to write new code test-first — use tdd.
npx skillsauth add michaelalber/ai-toolkit python-architecture-checklistInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
"A checklist cannot fly a plane, but a pilot cannot fly safely without one." — Atul Gawande
Shared across the dotnet / python / php / rust architecture checklists — same values, language-specific checks.
| # | Value | What it means |
|---|-------|---------------|
| 1 | Detect before judge | Determine version/framework/structure before applying any item; context decides what is idiomatic. |
| 2 | Evidence over opinion | Every finding cites file:line and the offending pattern. "Overuses globals" is not a finding; "config.py:12 mutates a module-level global" is. |
| 3 | Feature cohesion | Organized by business capability, not technical layer. Cross-feature coupling is a violation. |
| 4 | Dependencies point inward | Domain logic does not import infrastructure (DB, HTTP, filesystem). Boundaries are explicit. |
| 5 | Explicit error handling | Failures handled at the right layer; no silent swallowing; errors carry diagnostic context. |
| 6 | Config & secrets hygiene | No hardcoded secrets; configuration injected, not reached for globally; secrets from env / secret-manager. |
| 7 | Version awareness | Recommendations are gated to the detected version; never suggest an API that does not exist there. |
| 8 | Tests gate change | Untested code is a finding; high-risk modules without tests are prioritized. |
| 9 | Graded, actionable output | A letter grade (A–F) from counted findings, plus prioritized, version-correct recommendations. |
Shared skeleton: DETECT → SCAN → REPORT → RECOMMEND.
DETECT Python version (pyproject.toml / .python-version), framework (FastAPI/Django/Flask),
layout (src/ vs flat, packages), and whether a test suite exists. Record findings; if a
version cannot be determined, ask — never assume 3.13.
SCAN Run the Python Checklist below section by section. Gather evidence with tooling:
ruff check . # style + lint + complexity drift
mypy <pkg> # type completeness and correctness
radon cc -s <pkg> # cyclomatic complexity hot spots
radon mi <pkg> # maintainability index
Every violation becomes a finding with file:line and a severity (critical/high/medium/low).
REPORT Emit the graded report (Output Template). Grade = function of counted findings.
RECOMMEND Prioritize: critical → quick wins → modernization. Version-gate every recommendation;
never suggest an API absent from the detected Python version.
| # | Check | Severity |
|---|-------|----------|
| 1 | Clean boundaries — domain modules import zero infrastructure; boundaries use typing.Protocol; no circular imports | Critical |
| 2 | Type safety — all public functions fully annotated; mypy clean; no bare # type: ignore (each needs a reason) | High |
| 3 | YAGNI / Rule of Three — abstractions justified by ≥3 concrete uses; no speculative base classes or plugin layers | High |
| 4 | Complexity — radon cc grade ≥ B per function; maintainability index ≥ 70; no god-modules | High |
| 5 | Dependency discipline — versions pinned in pyproject.toml (not setup.py); stdlib preferred; each dep justified | Medium |
| 6 | Async discipline — async def for I/O-bound work; no blocking calls (time.sleep, sync I/O) inside async; ProcessPoolExecutor for CPU-bound | High |
| 7 | Resource lifecycle — file handles / sessions / connections via context managers, not manual open/close | Medium |
| 8 | Config & secrets — settings via pydantic BaseSettings; secrets from env; no hardcoded credentials | Critical |
| 9 | Test coverage — behavioral tests exist for business logic; AAA; high-risk untested modules flagged | High |
Tooling config: ruff · mypy. Full section-by-section list: review checklist.
<arch-checklist-state>
language: python
mode: DETECT | SCAN | REPORT | RECOMMEND | COMPLETE
detected: [python 3.x | framework | layout | tests:yes/no]
issues_found: [critical:N high:N medium:N low:N]
last_action: [what was just done]
next_action: [what should happen next]
</arch-checklist-state>
Shared across all four architecture checklists.
## Architecture Checklist: [project] (Python)
**Version**: [3.x] | **Framework**: [FastAPI/Django/Flask/none] | **Tests**: [yes/no]
| Section | Pass | Fail | Warn |
|---------|------|------|------|
| Boundaries / Types / Complexity / Deps / Async / Config / Tests | … | … | … |
### Grade: [A–F]
Grading: **A** 0 crit/0 high/≤3 med · **B** 0 crit/≤2 high · **C** 0 crit, gaps in one area ·
**D** 1+ crit · **F** fundamental problems (no boundaries, secrets in code, untyped public API).
| Severity | Location | Finding | Recommendation |
|----------|----------|---------|----------------|
| CRITICAL | file:line | [pattern] | [version-gated fix] |
**Quick wins**: [low-effort, high-impact] · **Modernization**: [larger items with effort estimate]
match statements or tomllib to a 3.9 project produces invalid findings.file:line. Never grade on vibes.python-security-review — note them and route there.tdd drives the fix.architecture-review — When the grade is D/F, escalate to the Socratic critic: this checklist finds what is wrong; architecture-review builds why.python-security-review — Companion for the security dimension (OWASP, bandit, pip-audit).python-feature-slice — Correct-pattern reference when the checklist flags structural violations.tdd — Methodology for adding tests the checklist flags as missing, and for driving any refactor.dotnet / php / rust-architecture-checklist — Sibling skills sharing this exact Core Values + workflow + output.development
Federal / government security overlay applied ON TOP OF a base language security review (dotnet/python/php/rust/react). Language-agnostic: adds NIST SP 800-53 control mapping, FIPS 140-2/3 cryptographic compliance (with a per-language crypto table), CUI handling, EO 14028 supply-chain requirements, and DOE Order 205.1B, and emits POA&M-ready findings with FIPS 199 impact levels. Use for federal/DOE/DOD/national-laboratory systems. Triggers on "federal security review", "NIST compliance", "NIST 800-53", "FISMA", "CUI", "FIPS audit", "DOE security", "POA&M", "ATO review". Do NOT use alone — run the matching <lang>-security-review FIRST; this overlay maps and extends it.
tools
OWASP-based security review of React / TypeScript front-end applications. Detects the framework (Vite/CRA/Next), entry points, and data flows, scans against the OWASP Top 10 (2025) mapped to React client-side patterns (XSS via raw HTML, URL/protocol injection, secrets in the bundle, insecure token storage, dependency CVEs, missing CSP, open redirects), and produces a manager-friendly executive summary plus a graded technical findings table. Use to audit React code for vulnerabilities. Triggers on "react security review", "frontend security audit", "audit react for vulnerabilities", "owasp react", "react xss", "react security posture", "npm audit review". For federal / gov / DOE / NIST / FIPS / CUI context, run security-review-federal after this base review. Do NOT use to grade architecture/structure — use react-architecture-checklist.
tools
Analyzes legacy React codebases and produces actionable modernization plans. Primary migration paths include class components to function components + hooks, Create React App to Vite, React 16/17 to 18 to 19, JavaScript to TypeScript, Enzyme to React Testing Library, legacy Redux to Redux Toolkit / Zustand / Context, and deprecated lifecycle/API removal. Does NOT perform the migration — assesses, quantifies risk, and plans. Triggers on phrases like "modernize react", "class to hooks", "upgrade react", "migrate CRA to vite", "react legacy migration", "react 17 to 18", "react js to typescript", "react technical debt", "enzyme to RTL".
development
Scaffolds feature-based React / TypeScript architecture using feature folders, presentational + container components, custom hooks, a typed data layer, and structural CQRS (query hooks vs mutation hooks). React analog of dotnet-vertical-slice and python-feature-slice — no DI framework; uses props/context for dependency injection and a query cache for server state. Use when creating feature-based React projects, adding React features, organizing components by feature rather than by technical type, or scaffolding a feature's data layer. Triggers on phrases like "scaffold react feature", "create react slice", "react feature folder", "react vertical slice", "add react feature", "react feature architecture", "organize react by feature".