Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

fatih-developer/output-critic

Name: output-critic
Author: fatih-developer

skills/output-critic/SKILL.md

npx skillsauth add fatih-developer/fth-skills output-critic

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Output Critic Protocol

Evaluate the output by type, score each criterion, make an accept/reject decision, and suggest concrete improvements. Goal: prevent weak output from reaching the next step.

Workflow

1. Detect output type
2. Apply type-specific criteria
3. Score each criterion
4. Calculate overall score
5. Make accept / conditional / reject decision
6. Suggest improvements

Acceptance Threshold

| Overall Score | Decision | Action | |---|---|---| | 8-10 | ACCEPT | Proceed | | 6-7 | CONDITIONAL | Apply minor fixes, then proceed | | 0-5 | REJECT | Apply improvements, re-evaluate |

Type-Specific Criteria

Code

| Criterion | Weight | Question | |-----------|--------|----------| | Correctness | 30% | Produces expected output? Handles edge cases? | | Readability | 20% | Meaningful names? Clean indentation? | | Security | 20% | SQL injection? Hardcoded secrets? Unsafe input? | | Performance | 15% | Unnecessary loops? N+1 queries? Memory leaks? | | Testability | 15% | Functions independently testable? |

Report / Written Content

| Criterion | Weight | Question | |-----------|--------|----------| | Accuracy | 30% | Claims supported? Misleading statements? | | Coverage | 25% | All requested topics addressed? Missing sections? | | Clarity | 20% | Target audience can understand? Jargon explained? | | Structure | 15% | Logical flow? Consistent headings? | | Actionability | 10% | Reader knows what to do next? |

Plan / Task List

| Criterion | Weight | Question | |-----------|--------|----------| | Completeness | 30% | All necessary steps present? Critical step missing? | | Atomicity | 25% | Each step does one thing? Overly broad steps? | | Dependency accuracy | 20% | Order makes sense? Circular dependencies? | | Verifiability | 15% | Each step has clear "done" criteria? | | Realism | 10% | Steps are achievable? Overly optimistic estimates? |

Data / Table

| Criterion | Weight | Question | |-----------|--------|----------| | Accuracy | 35% | Numbers consistent? Calculations correct? | | Completeness | 25% | Missing rows/columns? Nulls explained? | | Format consistency | 20% | Units, date formats, currency consistent? | | Readability | 20% | Meaningful headers? Proper sorting? |

Output Format

OUTPUT CRITIC
Type     : [output type]
Decision : ACCEPT / CONDITIONAL / REJECT
Score    : [X/10]

## Criterion Scores

| Criterion | Score | Note |
|-----------|-------|------|
| [Criterion 1] | X/10 | [short note] |
| [Criterion 2] | X/10 | [short note] |
| **Overall** | **X/10** | |

## Strengths
- [What was done well — specific]

## Weaknesses
- [What is missing / wrong — specific]

## Improvement Suggestions
1. [Concrete action — what to do, where]
2. [Concrete action]

## Next Step
[Accept -> proceed | Conditional -> fix X | Reject -> apply suggestions, resubmit]

Re-Evaluation

When improved output is resubmitted:

RE-EVALUATION
Previous score: X/10
New score     : Y/10
Change        : +N points
Improved      : [which criteria]
Still open    : [remaining issues if any]

When to Skip

User said "quick and dirty, doesn't need to be perfect"
Prototype / draft stage (user explicitly stated)
Single-line simple output

Guardrails

Never accept security issues — hardcoded secrets = automatic REJECT regardless of other scores.
Be specific in suggestions — "improve code" is useless; "move API key to env var at line 12" is actionable.
Cross-skill: works with task-decomposer (validates plan quality), output-critic is the quality gate before task completion.

fatih-developer/output-critic

skills/output-critic/SKILL.md

Evaluate every produced output (code, report, plan, data, API response) against type-specific quality criteria, score 1-10, make accept/reject decisions, and provide actionable improvement suggestions. Triggers on 'evaluate', 'check', 'review', 'quality control', 'is this good enough', 'score it', or before passing output to the next step in an agentic workflow.

4 stars

development

Updated Apr 13, 2026

$ install --global

skillsauth

npx skillsauth add fatih-developer/fth-skills output-critic

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 20, 2026, 1:33 PM100.0s1 file scanned

SKILL.md

name:: output-critic
description:: Evaluate every produced output (code, report, plan, data, API response) against type-specific quality criteria, score 1-10, make accept/reject decisions, and provide actionable improvement suggestions. Triggers on 'evaluate', 'check', 'review', 'quality control', 'is this good enough', 'score it', or before passing output to the next step in an agentic workflow.

Output Critic Protocol

Evaluate the output by type, score each criterion, make an accept/reject decision, and suggest concrete improvements. Goal: prevent weak output from reaching the next step.

Workflow

1. Detect output type
2. Apply type-specific criteria
3. Score each criterion
4. Calculate overall score
5. Make accept / conditional / reject decision
6. Suggest improvements

Acceptance Threshold

| Overall Score | Decision | Action | |---|---|---| | 8-10 | ACCEPT | Proceed | | 6-7 | CONDITIONAL | Apply minor fixes, then proceed | | 0-5 | REJECT | Apply improvements, re-evaluate |

Type-Specific Criteria

Code

Report / Written Content

Plan / Task List

Data / Table

Output Format

OUTPUT CRITIC
Type     : [output type]
Decision : ACCEPT / CONDITIONAL / REJECT
Score    : [X/10]

## Criterion Scores

| Criterion | Score | Note |
|-----------|-------|------|
| [Criterion 1] | X/10 | [short note] |
| [Criterion 2] | X/10 | [short note] |
| **Overall** | **X/10** | |

## Strengths
- [What was done well — specific]

## Weaknesses
- [What is missing / wrong — specific]

## Improvement Suggestions
1. [Concrete action — what to do, where]
2. [Concrete action]

## Next Step
[Accept -> proceed | Conditional -> fix X | Reject -> apply suggestions, resubmit]

Re-Evaluation

When improved output is resubmitted:

RE-EVALUATION
Previous score: X/10
New score     : Y/10
Change        : +N points
Improved      : [which criteria]
Still open    : [remaining issues if any]

When to Skip

User said "quick and dirty, doesn't need to be perfect"
Prototype / draft stage (user explicitly stated)
Single-line simple output

Guardrails

Never accept security issues — hardcoded secrets = automatic REJECT regardless of other scores.
Be specific in suggestions — "improve code" is useless; "move API key to env var at line 12" is actionable.
Cross-skill: works with task-decomposer (validates plan quality), output-critic is the quality gate before task completion.

Related Skills

fatih-developer/prompt-crafter

tools

VerifiedTrustedCommunity

Create, optimize, critique, and structure prompts for AI systems. Use this skill whenever the user is designing or improving a prompt, system prompt, coding prompt, image prompt, evaluation rubric, agent prompt, workflow prompt, or MCP-oriented prompt package. Also use it when the user asks to turn vague AI behavior into a precise instruction set, tool policy, agent spec, or prompt architecture.

5SKILL.mdUpdated Jun 4, 2026

fatih-developer/prompt-crafter

fatih-developer/plan-hardener

testing

VerifiedTrustedCommunity

Assumption-first architecture review skill to stress-test project plans and expose hidden risks.

5SKILL.mdUpdated Jun 4, 2026

fatih-developer/plan-hardener

fatih-developer/design-md-enforcer

testing

VerifiedTrustedCommunity

Enforce and manage DESIGN.md specifications, extract design systems from URLs, and combine design reasoning with token roles to prevent drift.

5SKILL.mdUpdated Jun 4, 2026

fatih-developer/design-md-enforcer

fatih-developer/claude-style-coding

testing

VerifiedTrustedCommunity

Forces the agent to act with a Claude-like product mindset, prioritizing user journey, UX states, and visual quality before coding.

5SKILL.mdUpdated Jun 4, 2026

fatih-developer/claude-style-coding

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/fatih-developer/fth-skills.git

# Copy into Claude Code skills folder (global)
cp -r fth-skills/skills/output-critic ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

fatih-developer/fth-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT