Overview

This document is the authoritative rubric definition used by the tech-claim-examiner agent to evaluate technical claims in high-depth content section units (예: 문제 해결 / 상세 프로젝트 / 경력 기술서). It does NOT perform evaluation itself — it defines the evaluation contract that the examiner follows.

The rubric consists of:

Five evaluation axes (A1–A5): The core evaluation framework. Each axis produces one of three verdicts (PASS / FAIL / P1). All five axes use an absolute standard. A5 is additionally structure-agnostic.
Two critical authenticity rules (R-Phys, R-Cross): Integrity gates evaluated separately from the axes. Both can trigger automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. Verb-scope inflation (previously a separate rule, retired in v4) is now caught by A4 integrity_suspected sub-flag (see a4-ownership-scope.md).

Evaluation structure:

| Layer | Components | Verdict contribution | |-------|------------|---------------------| | Depth | A1, A2 | Absolute | | Significance | A3, A4 | Absolute | | Presentation | A5 | Absolute (structure-agnostic) | | Integrity | R-Phys, R-Cross | Auto-fail triggers (verb-scope inflation rule retired in v4 — see A4 integrity_suspected) |

Downstream consumers: resume-forge and review-resume skills consume the examiner's output schema (see output-schema.md).

Why 5 Axes (Rationale)

The previous 11-axis system (v1) was retired by agent-council unanimous vote. The core problems with v1:

Over-engineered: Sub-axes (E3a, E3b, sub-dimensions with weighted formulas) added evaluation overhead without improving signal quality. Evaluators spent more time computing scores than assessing claims.
Backend-biased: The v1 depth criteria implicitly favored distributed systems and infrastructure work. Frontend, data, and product-engineering bullets systematically underscored due to structural mismatch.
Structurally rigid: Phase A/B/C routing imposed a fixed evaluation order that created decision-tree overhead. Simple bullets required the same ceremonial path as complex ones.
Catch-22 default-FAIL: The "FAIL unless proven" stance combined with the prohibition on inference created a rubric where legitimately strong bullets failed due to formatting choices rather than substance gaps.

The 5-axis redesign compresses the evaluation into five clear questions:

| Question | Axis | |----------|------| | Does this bullet reveal technical judgment? | A1 | | Is the cause-and-effect logic and arithmetic internally consistent? | A2 | | Is there a stated outcome that matters (tech or business)? | A3 | | Is the ownership claim coherent with the described scope? | A4 | | Can a recruiter extract the key message within 6–30 seconds of scanning? | A5 |

These five questions cover the full evaluation surface without sub-weighting formulas or cascading phase routing.

Anti-pattern → Axis Traceability Matrix

Common resume bullet anti-patterns, the axis that catches them, and the verdict tier produced.

| Anti-pattern | Axis | Verdict | |---|---|---| | Name-only mention (도구 이름만 나열, 메커니즘 없음) | A1 | FAIL or P1 | | Vanity outcome (팀 만족도 향상 등 정량 metric 없는 결과) | A3 | FAIL | | Verb inflation (주도/총괄 + scope marker 없음) | A4 | FAIL or A4 integrity_suspected | | Missing baseline (응답 시간 80% 단축 with no before/after window) | A2 (Rule 1) | P1 (Soft) | | Fuzzy noun outcome (성능 개선 / 처리량 향상 정량화 없음) | A3 (or A2 Rule 6) | P1 | | Offline-as-production (load-test 수치를 production metric으로 표기) | A2 (Rule 3) | FAIL (Hard) | | Arithmetic error (claimed delta math 일치 안 함) | A2 (Rule 1) | FAIL (Hard) |

Five Axes — Quick Reference

| Axis | Standard | One-line | Reference file | |------|----------|----------|----------------| | A1 Technical Credibility | Absolute | 기술적 판단이 드러나는가 (5/5 signals strict) | a1-technical-credibility.md | | A2 Causal Honesty | Absolute | 원인→결과 logic + arithmetic 일관성 | a2-causal-honesty.md | | A3 Outcome Presence & Clarity | Absolute | tech OR business 결과 명시 (so what?) | a3-outcome-presence-clarity.md | | A4 Ownership & Scope | Absolute | 동사-scope coherence (led/built/contributed) | a4-ownership-scope.md | | A5 Scanability | Absolute (structure-agnostic) | 6-30s scan에 핵심 파악 가능 | a5-scanability.md |

Axis Verdicts

Each axis (A1–A5) produces one of three verdicts:

| Verdict | Meaning | |---------|---------| | PASS | The claim satisfies the axis criterion with evidence present in the bullet text | | FAIL | The claim clearly violates the axis criterion; revision required | | P1 | The claim partially satisfies the criterion; improvement recommended but not blocking |

All five axes (A1–A5) use an Absolute standard: the passing bar does not move with experience level. candidate_context.years is referenced only by A4 for scope evaluation, not by A1.

A5 is structure-agnostic: it does not require a specific format (e.g., "action verb + metric + outcome"). It asks only whether the core message is extractable by a scanning reader within 6–30 seconds, regardless of how the bullet is structured.

A1 Evaluation Criteria

Question: Does this bullet reveal technical judgment? (이 bullet에 기술적 판단이 드러나는가?)

PASS — Bullet body explicitly shows all 5 of the following signals:

Constraint awareness — technical constraint to solve (throughput bottleneck, race condition, consistency gap, legacy coupling, cost ceiling, etc.)
Technology selection — a specific system/algorithm/pattern deliberately chosen
Mechanism — how the chosen technology works (partitioning key, memoization strategy, eviction policy, cutover path, etc.)
Trade-off / risk — cost/risk accepted, or rejection-reason for a rejected alternative
Rationale — context-based "why X over Y"

FAIL:

Tool/library name drop only (no rationale)
Outcome metrics only, no selection or mechanism grounding
Generic verbs ("도입", "구축", "개선", "활용") with no what/how/why
Feature noun-phrases ("seamless multi-step flow") masquerading as mechanism

P1 (canonical — a1-technical-credibility.md defers to this): Exactly 4 of 5 signals present (one signal absent), OR all 5 of 5 signals present but at least one signal is at name-level only (no mechanism or rationale behind it). Either condition fails the 5/5 strict PASS bar. Not vacuous enough to FAIL — examiner returns improvement hint targeting the shallowest signal.

Section-wide signal mapping: real-world에서 signal이 sub-bullets로 분산 가능 — section 전체에서 매핑 허용. 단일 bullet line에 모든 signal이 집중되지 않아도 section 전체에서 5 signals 모두 확인되면 PASS 판정 가능 — 단일 bullet line에 집중되지 않아도 허용.

Years are not referenced for A1. Ownership signals belong to A4, not A1.

Authenticity as Critical Rules

Authenticity is NOT a scored axis. It is a set of integrity gates evaluated as critical rules by the examiner. Rules are checked after all five axes are scored, and their triggered status feeds directly into final_verdict.

R-Phys — Physical Impossibility

Trigger condition: A numeric claim is physically or mathematically impossible given the described context.

Examples of R-Phys violations:

"Reduced latency by 50,000%" (percentage improvement cannot exceed 100% of the baseline for latency reduction)
"Increased throughput from 10 RPS to 10,000,000 RPS with a config change" (plausible magnitude would require hardware, not config)

Effect: Automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. The examiner must name the specific number and explain why it is physically incoherent.

R-Cross — Cross-Entry Contradiction

Trigger condition: The claim directly contradicts another entry on the same resume, and both cannot be simultaneously true.

Example: Entry A claims "Designed and implemented the entire payment microservice from scratch." Entry B claims "Contributed to payment microservice API design alongside a team of 8 engineers." These contradict each other on ownership scope.

Applicability: R-Cross는 항상 emit된다. cross-entry context가 제공되지 않은 경우 triggered: false로 기록하고, reasoning에 "cross-entry context not provided"와 같이 absence를 명시한다.

Effect: Automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. The examiner must cite both entries and identify the specific contradiction.

Verb-Scope Inflation — Retired in v4

The verb-scope inflation check (previously a standalone critical rule) is retired as a separate rule in v4. Detection is now handled by the A4 integrity_suspected sub-flag (see a4-ownership-scope.md). Detailed worked examples for R-Phys and R-Cross are in agents/tech-claim-examiner.md.

Evaluation Protocol

The examiner evaluates axes sequentially: A1 → A2 → A3 → A4 → A5. After all five axes, critical rules are checked.

Reasoning-Before-Verdict

For each axis, the examiner MUST:

Write technical reasoning first — what evidence is present, what is absent, what questions the claim raises
Provide an evidence_quote — a direct excerpt from the bullet text supporting the verdict
Assign the verdict last — PASS / FAIL / P1 derived from the reasoning, not assumed upfront

If the reasoning does not support the verdict, the verdict is wrong. Verdict-first reasoning is prohibited.

Critical Rule Invariant

After A1–A4 are scored and A5 emits structural_verdict, the following invariant is applied before generating final_verdict:

IF r_phys.triggered == true OR r_cross.triggered == true
THEN final_verdict = REQUEST_CHANGES
(regardless of A1–A4 verdicts and structural_verdict)

Final Verdict Derivation (when invariant does not apply)

| Condition | final_verdict | |-----------|---------------| | No A1–A4 axis is FAIL AND count(P1 across A1-A4) < 3 AND structural_verdict ∈ {PASS, P1} | APPROVE | | Any of A1–A4 is FAIL | REQUEST_CHANGES | | count(P1 across A1-A4) ≥ 3 | REQUEST_CHANGES | | structural_verdict == FAIL | REQUEST_CHANGES |

Note: A5 verdict는 structural_verdict로 노출된다. structural_verdict == FAIL AND A1-A4 모두 PASS/P1 AND count(P1 across A1-A4) < 3인 경우 final_verdict = REQUEST_CHANGES를 트리거하되, consumer routing은 source-extraction이 아닌 readability-fix lane으로 처리된다. See output-schema.md §A5 Co-failure Disambiguation and a5-scanability.md.

P1 verdicts do not block APPROVE but are surfaced in interview_hints as improvement recommendations.

Output schema details (field names, types, required/optional) are in output-schema.md.

Output Format

The examiner's full output schema is defined in output-schema.md. Key fields:

INTERNAL fields (reasoning trace, not shown to candidates):

| Field | Description | |-------|-------------| | verdicts.a1_* through verdicts.a5_* | Per-axis reasoning, evidence_quote, verdict | | verdicts.a4_ownership_scope.integrity_suspected | A4 verb-scope structural overclaim flag (bool) | | critical_rule_flags.r_phys | triggered (bool), reasoning (string) | | critical_rule_flags.r_cross | triggered (bool), reasoning (string) |

PUBLIC fields (returned to downstream caller):

Canonical definition: output-schema.md §Output Schema. 이 테이블은 downstream consumer를 위한 요약이며 schema 변경 시 반드시 동기 수정.

| Field | Description | |-------|-------------| | schema_version | string — output schema contract version (e.g., v4.0) | | final_verdict | APPROVE or REQUEST_CHANGES | | structural_verdict | PASS/P1/FAIL — A5 axis verdict, readability routing key | | interview_hints | string[] — actionable improvement suggestions |

interview_hints Rules

Written in the same language as the source bullet (Korean bullet → Korean hints)
Do NOT include axis identifiers (A1, A2, etc.) in the hint text — hints are candidate-facing
Each hint is actionable and specific, not generic ("add more technical detail" is prohibited)
P1 verdicts from any axis generate a hint even when final_verdict is APPROVE

Migration from v1 (11-axis)

| v1 concept | v4 equivalent | |------------|---------------| | E1–E6 (depth axes) | A1 + A2 (depth + causal) | | R1–R5 (readability axes) | A5 alone (structure-agnostic) | | Phase A/B/C protocol | Sequential A1→A4 + A5 (structural) + 2 critical rules | | Constraint Cascade Score | A2 causal_honesty + A3 outcome_significance |

All v1 axis tokens (E1–E6, R1–R5, E3b, Constraint Cascade, CASCADING, LISTED, FLAT, Narrative Necessity, Layer 2, Phase A/B/C) are retired. They must not appear in examiner output or downstream skill prompts outside of this migration reference.

Completion Checklist

The examiner verifies all of the following before delivering output:

[ ] A1 Technical Credibility: reasoning written, evidence_quote included, verdict assigned
[ ] A2 Causal Honesty: reasoning written, evidence_quote included, verdict assigned
[ ] A3 Outcome Presence & Clarity: reasoning written, evidence_quote included, verdict assigned
[ ] A4 Ownership & Scope: reasoning written, evidence_quote included, verdict assigned
[ ] A5 Scanability: reasoning written, evidence_quote included, verdict assigned
[ ] R-Phys: triggered status explicitly stated (true / false)
[ ] R-Cross: triggered status explicitly stated (true / false). cross-entry context not provided인 경우 false로 기록하고 reasoning에 absence 명시
[ ] Critical rule invariant applied: if r_phys or r_cross triggered, final_verdict is REQUEST_CHANGES
[ ] interview_hints written in source bullet language, no axis identifiers in hint text
[ ] final_verdict determined and recorded

Overview

The rubric consists of:

Five evaluation axes (A1–A5): The core evaluation framework. Each axis produces one of three verdicts (PASS / FAIL / P1). All five axes use an absolute standard. A5 is additionally structure-agnostic.
Two critical authenticity rules (R-Phys, R-Cross): Integrity gates evaluated separately from the axes. Both can trigger automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. Verb-scope inflation (previously a separate rule, retired in v4) is now caught by A4 integrity_suspected sub-flag (see a4-ownership-scope.md).

Evaluation structure:

Downstream consumers: resume-forge and review-resume skills consume the examiner's output schema (see output-schema.md).

Why 5 Axes (Rationale)

The previous 11-axis system (v1) was retired by agent-council unanimous vote. The core problems with v1:

Over-engineered: Sub-axes (E3a, E3b, sub-dimensions with weighted formulas) added evaluation overhead without improving signal quality. Evaluators spent more time computing scores than assessing claims.
Backend-biased: The v1 depth criteria implicitly favored distributed systems and infrastructure work. Frontend, data, and product-engineering bullets systematically underscored due to structural mismatch.
Structurally rigid: Phase A/B/C routing imposed a fixed evaluation order that created decision-tree overhead. Simple bullets required the same ceremonial path as complex ones.
Catch-22 default-FAIL: The "FAIL unless proven" stance combined with the prohibition on inference created a rubric where legitimately strong bullets failed due to formatting choices rather than substance gaps.

The 5-axis redesign compresses the evaluation into five clear questions:

These five questions cover the full evaluation surface without sub-weighting formulas or cascading phase routing.

Anti-pattern → Axis Traceability Matrix

Common resume bullet anti-patterns, the axis that catches them, and the verdict tier produced.

Five Axes — Quick Reference

Axis Verdicts

Each axis (A1–A5) produces one of three verdicts:

All five axes (A1–A5) use an Absolute standard: the passing bar does not move with experience level. candidate_context.years is referenced only by A4 for scope evaluation, not by A1.

A1 Evaluation Criteria

Question: Does this bullet reveal technical judgment? (이 bullet에 기술적 판단이 드러나는가?)

PASS — Bullet body explicitly shows all 5 of the following signals:

Constraint awareness — technical constraint to solve (throughput bottleneck, race condition, consistency gap, legacy coupling, cost ceiling, etc.)
Technology selection — a specific system/algorithm/pattern deliberately chosen
Mechanism — how the chosen technology works (partitioning key, memoization strategy, eviction policy, cutover path, etc.)
Trade-off / risk — cost/risk accepted, or rejection-reason for a rejected alternative
Rationale — context-based "why X over Y"

FAIL:

Tool/library name drop only (no rationale)
Outcome metrics only, no selection or mechanism grounding
Generic verbs ("도입", "구축", "개선", "활용") with no what/how/why
Feature noun-phrases ("seamless multi-step flow") masquerading as mechanism

Section-wide signal mapping: real-world에서 signal이 sub-bullets로 분산 가능 — section 전체에서 매핑 허용. 단일 bullet line에 모든 signal이 집중되지 않아도 section 전체에서 5 signals 모두 확인되면 PASS 판정 가능 — 단일 bullet line에 집중되지 않아도 허용.

Years are not referenced for A1. Ownership signals belong to A4, not A1.

Authenticity as Critical Rules

R-Phys — Physical Impossibility

Trigger condition: A numeric claim is physically or mathematically impossible given the described context.

Examples of R-Phys violations:

"Reduced latency by 50,000%" (percentage improvement cannot exceed 100% of the baseline for latency reduction)
"Increased throughput from 10 RPS to 10,000,000 RPS with a config change" (plausible magnitude would require hardware, not config)

Effect: Automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. The examiner must name the specific number and explain why it is physically incoherent.

R-Cross — Cross-Entry Contradiction

Trigger condition: The claim directly contradicts another entry on the same resume, and both cannot be simultaneously true.

Effect: Automatic REQUEST_CHANGES regardless of A1–A4 verdicts and structural_verdict. The examiner must cite both entries and identify the specific contradiction.

Verb-Scope Inflation — Retired in v4

Evaluation Protocol

The examiner evaluates axes sequentially: A1 → A2 → A3 → A4 → A5. After all five axes, critical rules are checked.

Reasoning-Before-Verdict

For each axis, the examiner MUST:

Write technical reasoning first — what evidence is present, what is absent, what questions the claim raises
Provide an evidence_quote — a direct excerpt from the bullet text supporting the verdict
Assign the verdict last — PASS / FAIL / P1 derived from the reasoning, not assumed upfront

If the reasoning does not support the verdict, the verdict is wrong. Verdict-first reasoning is prohibited.

Critical Rule Invariant

After A1–A4 are scored and A5 emits structural_verdict, the following invariant is applied before generating final_verdict:

IF r_phys.triggered == true OR r_cross.triggered == true
THEN final_verdict = REQUEST_CHANGES
(regardless of A1–A4 verdicts and structural_verdict)

Final Verdict Derivation (when invariant does not apply)

Note: A5 verdict는 structural_verdict로 노출된다. structural_verdict == FAIL AND A1-A4 모두 PASS/P1 AND count(P1 across A1-A4) < 3인 경우 final_verdict = REQUEST_CHANGES를 트리거하되, consumer routing은 source-extraction이 아닌 readability-fix lane으로 처리된다. See output-schema.md §A5 Co-failure Disambiguation and a5-scanability.md.

P1 verdicts do not block APPROVE but are surfaced in interview_hints as improvement recommendations.

Output schema details (field names, types, required/optional) are in output-schema.md.

Output Format

The examiner's full output schema is defined in output-schema.md. Key fields:

INTERNAL fields (reasoning trace, not shown to candidates):

PUBLIC fields (returned to downstream caller):

Canonical definition: output-schema.md §Output Schema. 이 테이블은 downstream consumer를 위한 요약이며 schema 변경 시 반드시 동기 수정.

interview_hints Rules

Written in the same language as the source bullet (Korean bullet → Korean hints)
Do NOT include axis identifiers (A1, A2, etc.) in the hint text — hints are candidate-facing
Each hint is actionable and specific, not generic ("add more technical detail" is prohibited)
P1 verdicts from any axis generate a hint even when final_verdict is APPROVE

Migration from v1 (11-axis)

Completion Checklist

The examiner verifies all of the following before delivering output:

[ ] A1 Technical Credibility: reasoning written, evidence_quote included, verdict assigned
[ ] A2 Causal Honesty: reasoning written, evidence_quote included, verdict assigned
[ ] A3 Outcome Presence & Clarity: reasoning written, evidence_quote included, verdict assigned
[ ] A4 Ownership & Scope: reasoning written, evidence_quote included, verdict assigned
[ ] A5 Scanability: reasoning written, evidence_quote included, verdict assigned
[ ] R-Phys: triggered status explicitly stated (true / false)
[ ] R-Cross: triggered status explicitly stated (true / false). cross-entry context not provided인 경우 false로 기록하고 reasoning에 absence 명시
[ ] Critical rule invariant applied: if r_phys or r_cross triggered, final_verdict is REQUEST_CHANGES
[ ] interview_hints written in source bullet language, no axis identifiers in hint text
[ ] final_verdict determined and recorded

Adoption

toongri/skills/tech-claim-rubric

$ install --global

Security Scan Results

SKILL.md

Overview

Why 5 Axes (Rationale)

Anti-pattern → Axis Traceability Matrix

Five Axes — Quick Reference

Axis Verdicts

A1 Evaluation Criteria

Authenticity as Critical Rules

R-Phys — Physical Impossibility

R-Cross — Cross-Entry Contradiction

Verb-Scope Inflation — Retired in v4

Evaluation Protocol

Reasoning-Before-Verdict

Critical Rule Invariant

Final Verdict Derivation (when invariant does not apply)

Output Format

interview_hints Rules

Migration from v1 (11-axis)

Completion Checklist

Related Skills

toongri/manage-ticket

toongri/goal

toongri/pin-wrap-up

toongri/pin-setup

toongri/skills/tech-claim-rubric

$ install --global

Security Scan Results

SKILL.md

Overview

Why 5 Axes (Rationale)

Anti-pattern → Axis Traceability Matrix

Five Axes — Quick Reference

Axis Verdicts

A1 Evaluation Criteria

Authenticity as Critical Rules

R-Phys — Physical Impossibility

R-Cross — Cross-Entry Contradiction

Verb-Scope Inflation — Retired in v4

Evaluation Protocol

Reasoning-Before-Verdict

Critical Rule Invariant

Final Verdict Derivation (when invariant does not apply)

Output Format

interview_hints Rules

Migration from v1 (11-axis)

Completion Checklist

Related Skills

toongri/manage-ticket

toongri/goal

toongri/pin-wrap-up

toongri/pin-setup