skills/25-HosungYou-Diverga/skills/c5/SKILL.md
Meta-Analysis Master with Data Integrity, Effect Size, Error Prevention & Sensitivity Multi-gate validation and workflow orchestration for meta-analysis. Absorbed C6 (Data Integrity Guard), C7 (Error Prevention Engine), B3 (Effect Size Extractor), E5 (Sensitivity Analysis - Meta) capabilities Triggers: meta-analysis, pooled effect, heterogeneity, forest plot, funnel plot, Hedges g, data integrity, effect size extraction, sensitivity analysis
npx skillsauth add brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research c5Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
diverga_check_prerequisites("c5") → must return approved: true
If not approved → AskUserQuestion for each missing checkpoint (see .claude/references/checkpoint-templates.md)
diverga_mark_checkpoint("CP_ANALYSIS_PLAN", decision, rationale)Read .research/decision-log.yaml directly to verify prerequisites. Conversation history is last resort.
Orchestrate complete meta-analysis workflows with multi-gate validation. This agent owns gate progression decisions and coordinates other agents (B2, B3, C6, C7) throughout the meta-analysis pipeline.
C5 is the decision authority for meta-analysis workflows:
Activate C5-MetaAnalysisMaster when user mentions:
┌─────────────────────────────────────────────────────────────┐
│ GATE VALIDATION PIPELINE │
├─────────────────────────────────────────────────────────────┤
│ Gate 1: EXTRACTION VALIDATION │
│ - Required fields present (Study_ID, ES_ID, Outcome_Name) │
│ - Data completeness score ≥ Tier 2 threshold (40%) │
│ - No duplicate ES_IDs │
├─────────────────────────────────────────────────────────────┤
│ Gate 2: CLASSIFICATION VALIDATION │
│ - ES type classified (post-test, ANCOVA, change, pre-post)│
│ - ES hierarchy enforced (post-test > ANCOVA > change) │
│ - Multiple ES from same study: use highest priority │
├─────────────────────────────────────────────────────────────┤
│ Gate 3: STATISTICAL VALIDATION │
│ - Hedges' g calculated or calculable │
│ - SE_g available or calculable │
│ - Values within reasonable range (|g| ≤ 3.0) │
├─────────────────────────────────────────────────────────────┤
│ Gate 4: INDEPENDENCE VALIDATION │
│ - 4a: Temporal Classification (NO pre-test outcomes) │
│ - 4b: Study Independence (no double-counting) │
│ - 4c: Effect Independence (handle dependent ES) │
└─────────────────────────────────────────────────────────────┘
| Phase | Name | Entry Criteria | Exit Criteria | Calls | |-------|------|----------------|---------------|-------| | 1 | Study Selection | Search terms defined | Eligible studies identified | B1 | | 2 | Data Extraction | PDFs available | All ES extracted | B3, C6 | | 3 | Effect Size Calc | Raw data available | Hedges' g computed | C6 | | 4 | Quality Assessment | ES computed | Risk of bias rated | B2, C7 | | 5 | Analysis Execution | Data validated | Model results | - | | 6 | Sensitivity | Primary analysis done | Robustness checked | - | | 7 | Reporting | All analyses done | PRISMA diagram | - |
When multiple effect sizes are available from the same study-outcome:
| Priority | ES Type | Use When | Code |
|----------|---------|----------|------|
| 1 (Best) | Post-test between-groups | Control group exists | POST_BETWEEN |
| 2 | ANCOVA-adjusted | Pre-test as covariate | ANCOVA |
| 3 | Change score | No between-group post | CHANGE |
| 4 (Last) | Single-group pre-post | No control group | PRE_POST |
| NEVER | Pre-test as outcome | - | PRE_TEST → REJECT |
| Parameter | Threshold | Action | |-----------|-----------|--------| | |g| > 3.0 | Anomaly | Flag for human review | | |g| > 5.0 | Extreme outlier | Auto-exclude with log | | Data completeness < 40% | Tier 3 | STOP: Human review required | | Missing Hedges' g > 30% | High | Trigger C6 SD recovery | | Pre-test pattern detected | - | Auto-REJECT |
effect_size_record:
Study_ID: str # Required
ES_ID: str # Required
Outcome_Name: str # Required
M_Treatment: float # Optional
SD_Treatment: float # Optional
n_Treatment: int # Optional
M_Control: float # Optional
SD_Control: float # Optional
n_Control: int # Optional
validated_effect_size:
Study_ID: str
ES_ID: str
Outcome_Name: str
ES_Type: str # POST_BETWEEN, ANCOVA, CHANGE, PRE_POST
Hedges_g: float
SE_g: float
Data_Tier: int # 1, 2, or 3
Gates_Passed: list[str]
Validation_Notes: str
def handle_gate_failure(gate_id, record, reason):
if gate_id == "4a": # Pre-test
action = "REJECT" # Always reject pre-test
elif record.Data_Tier == 3:
action = "HUMAN_REVIEW"
elif anomaly_severity == "extreme":
action = "REJECT"
else:
action = "FLAG_AND_CONTINUE"
log_decision(gate_id, record, reason, action)
return action
Automatic rollback to previous phase if:
50% of records fail any single gate
| Checkpoint | Trigger | Requires |
|------------|---------|----------|
| META_TIER3_REVIEW | Any Tier 3 data | Confirm include/exclude |
| META_ANOMALY_REVIEW | |g| > 3.0 | Verify or exclude |
| META_PRETEST_CONFIRM | Ambiguous pre/post | Classify temporality |
| META_MULTIGROUP_CHOICE | Multiple ES available | Select ES to use |
User: "메타분석을 위해 추출된 효과크기를 검증해 줘"
C5 Response:
1. [PHASE 2 CHECK] Data extraction completeness
- Calling C6-DataIntegrityGuard for completeness report
2. [GATE 1] Extraction Validation
- 365 records submitted
- 3 records missing Study_ID → REJECT
- 362 records pass Gate 1
3. [GATE 2] Classification Validation
- ES type assigned to 362 records
- 10 records classified as PRE_TEST → flagged for Gate 4a
4. [GATE 3] Statistical Validation
- C6 reports: 243 have Hedges_g, 119 missing
- Missing > 30% → Triggering C6 SD recovery
- After recovery: 275 have Hedges_g (75.9%)
- 5 records with |g| > 3.0 → flagged for review
5. [GATE 4a] Temporal Classification
- C7 advisory: "10 records match pre-test pattern"
- C5 decision: REJECT 10 pre-test records
- Final validated: 265 effect sizes
[CHECKPOINT] META_ANOMALY_REVIEW triggered for 5 records
Waiting for human confirmation...
C5 owns the final validation phase of the Universal Codebook workflow:
def validate_final(verified_data, require_all_verified=True, require_all_signed_off=True):
"""
Final validation before dataset is ready for analysis.
Used in Phase 4 of Universal Codebook workflow.
Returns:
{status, issues, can_proceed}
"""
issues = []
# Check verification status
pending_count = sum(1 for r in verified_data if r["verified_status"] == "PENDING")
if pending_count > 0 and require_all_verified:
issues.append({
"type": "VERIFICATION_INCOMPLETE",
"count": pending_count,
"message": f"{pending_count} records still PENDING verification"
})
# Check sign-off
unsigned_count = sum(1 for r in verified_data if not r.get("sign_off", False))
if unsigned_count > 0 and require_all_signed_off:
issues.append({
"type": "SIGNOFF_INCOMPLETE",
"count": unsigned_count,
"message": f"{unsigned_count} records missing sign-off"
})
# Run gate validation on verified data
for record in verified_data:
gate_results = run_all_gates(record)
if not all(gate_results.values()):
failed_gates = [g for g, passed in gate_results.items() if not passed]
issues.append({
"type": "GATE_FAILURE",
"es_id": record["es_id"],
"failed_gates": failed_gates
})
return {
"status": "APPROVED" if not issues else "BLOCKED",
"issues": issues,
"can_proceed": len(issues) == 0,
"summary": {
"total_records": len(verified_data),
"verified": len(verified_data) - pending_count,
"signed_off": len(verified_data) - unsigned_count,
"gates_passed": len(verified_data) - len([i for i in issues if i["type"] == "GATE_FAILURE"])
}
}
def run_all_gates(record):
"""Run all 4 gates on a single record."""
return {
"gate_1_extraction": validate_gate_1(record),
"gate_2_classification": validate_gate_2(record),
"gate_3_statistical": validate_gate_3(record),
"gate_4_independence": validate_gate_4(record)
}
def orchestrate_codebook_workflow(pdf_folder, project_name):
"""
Full Universal Codebook workflow orchestration.
Phases:
1. AI Extraction (C6)
2. Triage (C7)
3. Human Review (Manual, generates queue)
4. Final Validation (C5)
"""
# Phase 1: AI Extraction
print(f"[PHASE 1] Starting AI extraction from {pdf_folder}")
extraction_result = c6.extract_with_provenance(
pdf_folder=pdf_folder,
methods=["rag", "ocr"],
reconciliation="hierarchy"
)
print(f" Extracted: {len(extraction_result)} records")
# Phase 2: Triage
print("[PHASE 2] Triaging extractions")
triage_result = c7.triage_extractions(extraction_result)
queue = c7.generate_review_queue(triage_result)
print(f" Review queue: {len(queue)} records need review")
print(f" Priority 1 (conflicts): {sum(1 for q in queue if q['priority'] == 1)}")
print(f" Priority 2 (low conf): {sum(1 for q in queue if q['priority'] == 2)}")
# Phase 3: Human Review
print("[PHASE 3] Generating review queue for human reviewers")
export_review_queue(queue, f"{project_name}_review_queue.xlsx")
print(" Queue exported. Waiting for human verification...")
# Return queue for human review
return {
"status": "AWAITING_HUMAN_REVIEW",
"extraction_result": extraction_result,
"triage_result": triage_result,
"review_queue": queue,
"next_step": "Complete human verification, then call c5.validate_final()"
}
| Code | Message | Action |
|------|---------|--------|
| C5_GATE1_FAIL | Missing required field: {field} | Reject record |
| C5_GATE2_NOTYPE | Cannot classify ES type | Flag for review |
| C5_GATE3_NOCALC | Cannot calculate Hedges' g | Trigger SD recovery |
| C5_GATE4A_PRETEST | Pre-test outcome detected | Auto-reject |
| C5_ANOMALY | Extreme value detected: g={value} | Human review |
| C5_TIER3 | Data completeness below 40% | Human review required |
| C5_VERIFY_INCOMPLETE | Records still PENDING verification | Block final |
| C5_SIGNOFF_MISSING | Records missing sign-off | Block final |
development
Conduct rigorous thematic analysis (TA) of qualitative data following Braun and Clarke's (2006) six-phase framework. Use whenever the user mentions 'thematic analysis', 'TA', 'Braun and Clarke', 'qualitative coding', 'identifying themes', or asks for help analysing interviews, focus groups, open-ended survey responses, or transcripts to identify patterns. Also trigger for questions about inductive vs theoretical coding, semantic vs latent themes, essentialist vs constructionist epistemology, building a thematic map, or writing up a qualitative findings section. Covers all six phases, the four upfront analytic decisions, the 15-point quality checklist, and the five common pitfalls. Produces a Word document write-up and an annotated thematic map. Does NOT cover IPA, grounded theory, discourse analysis, conversation analysis, or narrative analysis — use a different method for those.
development
Guide users through writing a systematic literature review (SLR) following the PRISMA 2020 framework. Use this skill whenever the user mentions 'systematic review', 'systematic literature review', 'SLR', 'PRISMA', 'PRISMA 2020', 'PRISMA flow diagram', 'PRISMA checklist', or asks for help writing, structuring, or auditing a literature review that follows reporting guidelines. Also trigger when the user asks about inclusion/exclusion criteria for a review, search strategies for databases like Scopus/WoS/PubMed, study selection processes, risk of bias assessment, or narrative synthesis for a review paper. This skill covers the full PRISMA 2020 checklist (27 items), produces a Word document manuscript in strict journal article format, generates an annotated PRISMA flow diagram, and enforces APA 7th Edition referencing throughout. It does NOT cover meta-analysis or statistical pooling. By Chuah Kee Man.
testing
Performs placebo-in-time sensitivity analysis with hierarchical null model and optional Bayesian assurance. Use when checking model robustness, verifying lack of pre-intervention effects, or estimating study power.
data-ai
Fit, summarize, plot, and interpret a chosen CausalPy experiment. Use after the causal method has been selected, including when configuring PyMC/sklearn models and scale-aware custom priors.