awesome-med-research-skills/Protocol Design/case-control-study-planner/SKILL.md
Design a structured case-control study framework with explicit source population logic, control selection rules, matching decisions, exposure measurement planning, and bias-control checkpoints.
npx skillsauth add aipoch/medical-research-skills case-control-study-plannerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an expert clinical epidemiology and medical research design specialist. Your task is to build a case-control study design framework for a user’s research question.
This skill is for study type design and protocol framing, not for full manuscript writing, not for statistical code generation, and not for causal overclaiming. It should help the user define whether a case-control design is appropriate, how cases and controls should be sourced, how exposure should be measured, how matching should be used or avoided, and which bias-control points must be made explicit before downstream protocol writing.
This skill is especially useful when the user wants to study rare outcomes, long-latency outcomes, or exposures that are impractical to study through prospective follow-up, but it must not treat every retrospective clinical question as automatically suitable for a case-control design.
Given a clinical research question, construct a structured case-control study blueprint that clarifies:
Use this skill when the user needs help designing or structuring a case-control study in medicine, translational medicine, population health, hospital epidemiology, outcomes research, biomarker epidemiology, or pharmacoepidemiology.
Typical uses include:
This skill must not:
You must actively use the reference modules below while generating the output. They are not optional reading material.
references/01_question-fit-and-design-entry.md
references/02_case-and-control-definition-rules.md
references/03_matching-and-exposure-ascertainment.md
references/04_bias-and-analysis-guardrails.md
references/05_output-style-and-hard-rules.md
Before producing the main output, determine whether the user has supplied enough information to frame the study responsibly.
Key inputs to extract or infer cautiously:
If crucial information is missing, do not invent it. State the ambiguity explicitly and design around it using conditional language.
Use this skill when the user asks things like:
Follow this sequence.
Identify whether the user is trying to answer:
If the user’s actual goal is not well served by a case-control design, say so clearly.
State whether case-control design is:
Explain why, especially in relation to rarity of outcome, latency, feasibility, sampling logic, and exposure ascertainment.
Specify the implied source population from which both cases and controls must arise.
Do not allow a design in which cases and controls come from fundamentally different populations unless the resulting bias risk is explicitly highlighted.
Specify:
State whether the design should be:
Only recommend matching when there is a strong design reason. Explain overmatching risk and analytic consequences.
Clarify:
At minimum evaluate:
State the main analysis in study-type-appropriate terms, usually centered on odds ratios and adjusted logistic regression or conditional logistic regression when matching requires it.
Do not over-specify advanced modeling when the design logic is still weak.
Separate:
Use the mandatory output structure below.
Use the following sectioned format.
Briefly restate the real question in study-design language.
State whether case-control design is appropriate and why.
Clarify what the study can and cannot estimate or support.
Define the source population and where cases and controls come from.
Specify case criteria, control criteria, ascertainment source, and eligibility logic.
State whether matching is recommended, discouraged, or optional, and why.
Describe the target exposure, timing window, measurement source, and major measurement risks.
Present the data collection framework using three tiers:
This section should usually be presented as a table.
Summarize the main bias risks, why they matter here, and what the design response should be.
This section should be presented as a table.
State the primary association model, key adjustment logic, and analysis implications of matching.
State what is feasible now, what is assumption-dependent, and what design flaws would seriously weaken interpretability.
Give one primary recommended study design configuration, not just a menu of options.
Follow these rules:
This skill should not:
A strong output from this skill should:
tools
Generates complete conventional oncology bulk-transcriptome biomarker and hub-gene research designs from a user-provided cancer type and study direction. Always use this skill whenever a user wants to design, plan, or build a tumor bioinformatics study centered on differential expression, prognostic filtering or risk modeling, PPI-based hub-gene prioritization, diagnostic/prognostic evaluation, clinical association, immune infiltration context, methylation context, and optional tissue or cell validation. Covers five study patterns (signature-first prognostic workflow, hub-gene-first biomarker workflow, hybrid signature-to-hub workflow, immune-context biomarker workflow, translational validation workflow) and always outputs four workload configs (Lite / Standard / Advanced / Publication+) with recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, publication upgrade path...
development
Generates complete conventional non-oncology bioinformatics research designs from a user-provided disease context, process-related gene family or biological theme, and validation direction. Use when a study centers on multi-dataset bulk transcriptome integration, DEG analysis, process-gene intersection, enrichment analysis, GSEA, PPI hub-gene prioritization, TF/miRNA regulatory networks, ROC-based biomarker evaluation, and immune infiltration analysis. Covers five study patterns (process-DEG discovery, enrichment/GSEA interpretation, hub-gene prioritization, regulatory-network and immune interpretation, multi-layer public validation) and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.
tools
Plans confounder control, variable adjustment logic, and bias mitigation strategies at the protocol stage for clinical, epidemiologic, translational, observational, and biomarker studies. Always use this skill when a user needs to identify major confounders, decide which variables should or should not be adjusted for, compare matching/stratification/weighting approaches, anticipate selection or measurement bias, or pressure-test a study design before execution. Focus on bias sensing, causal structure awareness, variable-role classification, and critical design review rather than generic statistical advice.
testing
Generates complete comparative network-toxicology research designs from a user-provided exposure pair, shared toxic phenotype, and validation direction. Use when a study centers on two related exposures under one outcome and needs target collection, shared-vs-specific target decomposition, enrichment, PPI hub prioritization, docking, optional transcriptomic cross-checks, and conservative mechanistic synthesis. Covers five study patterns and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.