awesome-med-research-skills/Evidence Insight/evidence-level-ranker/SKILL.md
Ranks papers by evidence family, methodological quality tier, validation depth, and claim discipline; assigns anchor, context-setting, mechanistic support, or caution citation roles; prevents prestige-based or design-label-based ranking errors.
npx skillsauth add aipoch/medical-research-skills evidence-level-rankerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill to rank papers by evidence strength, methodological quality, and citation priority within one explicit comparison framework.
This skill should identify what kind of evidence each paper provides, how much methodological trust it deserves, how much validation or corroboration it contains, and whether it should be treated as a high-priority anchor citation, context-setting citation, mechanistic support citation, or low-priority / caution citation.
This skill must not equate study design labels with true evidentiary value automatically. A meta-analysis is not automatically decisive, an RCT is not automatically well-conducted, a cohort is not automatically weak, and a mechanism study is not automatically non-informative. The skill must rank literature based on the combination of design family, execution quality, validation depth, bias control, and claim discipline.
This skill is especially useful when the user needs to:
Use reference modules as execution dependencies, not decoration.
references/evidence-family-taxonomy.md → use when identifying study design family in Step 2.references/methodological-quality-audit-rules.md → use when assessing execution quality in Step 3.references/validation-depth-rules.md → use when judging internal vs. external vs. orthogonal validation in Step 4.references/claim-discipline-rules.md → use when separating what a paper shows from what it claims in Step 5.references/citation-priority-rules.md → use when assigning citation roles in Step 6.references/cross-design-ranking-framework.md → use when comparing papers across different evidence families in Steps 6–7.references/literature-integrity-rules.md → governs all citation handling and evidence statement accuracy in Section J.references/output-section-guidance.md → enforces section-level output format for Sections A–J.references/workflow-step-template.md → structures the workflow explanation.If the paper set includes mixed evidence families, this skill should explicitly use all relevant modules rather than collapsing all papers into one generic score.
This skill accepts: one paper, a set of papers, or a literature shortlist for evidence-strength ranking and citation-priority assignment.
If the user's request does not involve ranking papers by evidence quality — for example, asking to write a literature review, retrieve papers, summarize clinical guidelines, or make treatment recommendations — do not proceed with the ranking pipeline. Instead respond:
"Evidence Level Ranker is designed to rank a provided set of papers by evidence family, methodological quality, validation depth, and claim discipline, and to assign citation roles. Your request ([restatement]) appears to be outside this scope. Please provide the set of papers you want ranked, or use a more appropriate tool."
Before ranking, confirm what the user is actually asking to compare.
Required or strongly preferred inputs:
If the input is incomplete, this skill should still proceed by ranking based on the available materials, but it must label major uncertainty sources explicitly.
This skill should distinguish between:
The core function of this skill is to convert a mixed literature set into a transparent evidence ranking, with each paper positioned on four linked but non-identical dimensions:
This skill should rank papers comparatively, but it must also explain why each paper occupies its position. Rankings without explicit reasoning are incomplete.
Determine whether the papers address:
Do not force a false apples-to-apples comparison when papers are serving different evidentiary purposes.
For each paper, identify the actual study design using methods, not just the authors’ self-description.
Separate:
Do not confuse data source, assay type, model type, or platform type with study design.
Assess how strong the methods actually are.
Review at least these dimensions when relevant:
Do not use statistical significance or reported effect size magnitude as evidence of good methods. Assess sampling logic, bias control, and study design independently of reported p-values. A p < 0.05 result in a poorly executed study is not evidence of methodological strength.
A higher-level design should not be ranked highly if execution is weak.
Assess how much the results are supported beyond the initial finding.
Distinguish:
Do not over-credit repeated analysis of closely related datasets as if it were independent validation.
Check whether the paper’s conclusions stay inside the evidence boundary.
Flag overclaim patterns such as:
A paper with good methods but overextended conclusions should lose citation priority for strong claims.
For each paper, assign all of the following:
Recommended citation roles:
Rank the papers explicitly and explain the ranking logic.
The ranking should reflect not only nominal evidence hierarchy, but also actual execution quality, validation strength, and claim appropriateness.
Make explicit where the ranking is uncertainty-limited.
Examples:
Use the following structure every time.
State what is being ranked, for what question, and for what downstream use.
List each paper with its true study design / evidence family.
For each paper, summarize the main strengths and weaknesses affecting trustworthiness.
State what validation exists and how much confidence it adds.
State whether the paper’s stated conclusions stay within the evidence boundary.
Provide a ranked list from strongest to weakest for the stated purpose, with clear reasoning.
For each paper, assign one citation role:
State the main factors that drove the order.
Explain where incomplete information or mixed evidence roles limit certainty.
When the user provides or references specific papers, preserve verified bibliographic details accurately. If any publication details, PMIDs, DOIs, trial identifiers, or validation claims cannot be verified from the provided material, mark them as unverified rather than guessing.
This skill should not:
A high-quality output from this skill should:
tools
Generates complete conventional oncology bulk-transcriptome biomarker and hub-gene research designs from a user-provided cancer type and study direction. Always use this skill whenever a user wants to design, plan, or build a tumor bioinformatics study centered on differential expression, prognostic filtering or risk modeling, PPI-based hub-gene prioritization, diagnostic/prognostic evaluation, clinical association, immune infiltration context, methylation context, and optional tissue or cell validation. Covers five study patterns (signature-first prognostic workflow, hub-gene-first biomarker workflow, hybrid signature-to-hub workflow, immune-context biomarker workflow, translational validation workflow) and always outputs four workload configs (Lite / Standard / Advanced / Publication+) with recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, publication upgrade path...
development
Generates complete conventional non-oncology bioinformatics research designs from a user-provided disease context, process-related gene family or biological theme, and validation direction. Use when a study centers on multi-dataset bulk transcriptome integration, DEG analysis, process-gene intersection, enrichment analysis, GSEA, PPI hub-gene prioritization, TF/miRNA regulatory networks, ROC-based biomarker evaluation, and immune infiltration analysis. Covers five study patterns (process-DEG discovery, enrichment/GSEA interpretation, hub-gene prioritization, regulatory-network and immune interpretation, multi-layer public validation) and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.
tools
Plans confounder control, variable adjustment logic, and bias mitigation strategies at the protocol stage for clinical, epidemiologic, translational, observational, and biomarker studies. Always use this skill when a user needs to identify major confounders, decide which variables should or should not be adjusted for, compare matching/stratification/weighting approaches, anticipate selection or measurement bias, or pressure-test a study design before execution. Focus on bias sensing, causal structure awareness, variable-role classification, and critical design review rather than generic statistical advice.
testing
Generates complete comparative network-toxicology research designs from a user-provided exposure pair, shared toxic phenotype, and validation direction. Use when a study centers on two related exposures under one outcome and needs target collection, shared-vs-specific target decomposition, enrichment, PPI hub prioritization, docking, optional transcriptomic cross-checks, and conservative mechanistic synthesis. Covers five study patterns and always outputs Lite / Standard / Advanced / Publication+ with a recommended primary plan, stepwise workflow, figure plan, validation hierarchy, minimal executable version, publication upgrade path, and strictly verified literature retrieval.