bundled/skills/literature-matrix/SKILL.md
Systematic research idea discovery through paper combination matrix. Use when finding research ideas, evaluating paper combinations, building unified theoretical frameworks, or generating code skeletons from combined methods.
npx skillsauth add foryourhealth111-pixel/vco-skills-codex literature-matrixInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Systematic research idea discovery: collect N papers, evaluate all N×(N-1)/2 combinations via a 5-dimension scoring matrix, deep-analyze top candidates with full-text evidence, build unified theoretical frameworks, and generate code skeletons.
Trigger when any of these applies:
/literature-matrixWill NOT:
Required inputs (ask if missing):
Phase 0: Init → Phase 1: Collect Papers → Phase 2: Build Matrix → Phase 3: Deep Analysis → Phase 4: Framework → Phase 5: Code
↑ |
└─────────────────────────── Checkpoint resume (pause/resume at any phase) ───────────────────────────────────────────────┘
1. Check ./paper_matrix/checkpoint.json for existing progress
2. Ask: domain, keywords, timerange, paper count, source mode, weight preset
3. Create directory: ./paper_matrix/{papers,analysis,ideas,frameworks,code}/
4. Save checkpoint
GET https://api.semanticscholar.org/graph/v1/paper/search
?query={keywords}&year={range}&fieldsOfStudy={domain}
&fields=title,authors,venue,year,citationCount,openAccessPdf,externalIds
Each paper scored on 4 criteria:
✅ Open-source (GitHub repo exists)
✅ Accessible (clear method description)
✅ Trending (high citation velocity)
✅ Recognized (top venue: oral/spotlight)
| Dimension | Default Weight | What it measures |
|--------------------|---------------|-------------------------------------|
| Complementarity | 0.25 | A's method solves B's limitation? |
| Data Compatibility | 0.20 | Shared data types/formats? |
| Theory Unifiability| 0.20 | Natural unified framework exists? |
| Innovation Delta | 0.20 | 1+1>2 effect? |
| Implementation | 0.15 | Code integration difficulty? |
Weight presets:
- 理论导向: 0.20, 0.15, 0.30, 0.25, 0.10
- 工程导向: 0.25, 0.25, 0.10, 0.15, 0.25
- 快速发表: 0.30, 0.20, 0.15, 0.20, 0.15
- 自定义: user specifies all 5 weights
Layer 1 (Rule): Exclude same-author, same-subfield, already-cited pairs → ~50% removed
Layer 2 (AI): Score remaining pairs on 5 dimensions via abstracts → rank by weighted sum
Layer 3 (User): Discuss top-30 with user → narrow to 15-20 candidates
L1 Auto: arXiv PDF → PMC → Unpaywall → Semantic Scholar openAccessPdf
L2 Assist: Provide DOI + download path, ask user to fetch via library
L3 Fallback: Abstract-only analysis, mark as ⚠️ low confidence
Parallel: f(x) = α·A(x) + (1-α)·B(x) → convex combination
Serial: f(x) = B(A(x)) → pipeline framework
Nested: f(x) = A(x; module=B) → modular architecture
Extension: f(x) = α·A + β·B + (1-α-β)·C → simplex constraint
Theoretical: interaction term α(1-α)·h(A,B) exists
Experimental: performance at α∈(0,1) exceeds linear interpolation
Problem: A+B solves what neither A nor B can alone
Computational: combination requires novel optimization
L1 Metadata: [来源: API元数据] → high confidence
L2 Content: [来源: 论文全文, Section X] → medium-high confidence
L3 Inference:[推断: 基于[来源], 置信度: X] → low-medium confidence
{"version":"1.0", "current_phase":2, "config":{...},
"phase_0":{"status":"completed"},
"phase_2":{"status":"in_progress","evaluated":450,"total":780}}
Act as a proactive, patient, rigorous research mentor throughout the entire workflow.
Behavioral principles:
Dialogue patterns by phase:
See references/dialogue-templates.md for complete dialogue examples.
/literature-matrix 多组学融合 耐药性检测 --papers 40 --timerange 2024-2026./paper_matrix/ directory, configure domain=bioinformatics, preset=快速发表base_framework.py, experiment.py with α grid search/literature-matrix --resume./paper_matrix/checkpoint.json: Phase 2 in progress, 450/780 evaluated/literature-matrix 脂质组学 机器学习 --link-project| Symptom | Diagnosis | Fix | |---------|-----------|-----| | Semantic Scholar API returns empty | Keywords too specific or API rate limit | Broaden keywords, add retry with backoff | | Too few open-access papers | Domain has low OA rate | Use L2 acquisition (ask user to download), expand time range | | All combinations score low | Papers too similar or too different | Adjust paper selection: mix methods papers with application papers | | Checkpoint corrupted | Interrupted during write | Delete checkpoint.json, restart from Phase 0 | | α=0.5 not optimal | Combination is serial, not parallel | Switch to pipeline framework (serial type), not convex combination |
Detailed implementation guides:
references/index.md — Navigation hubreferences/workflow-phases.md — Complete Phase 0-5 behavioral instructionsreferences/evaluation-system.md — 5-dimension scoring, weight presets, prompt templatesreferences/paper-acquisition.md — 3-level acquisition strategy with API detailsreferences/theoretical-framework.md — Combination types, non-trivial templates, α analysisreferences/provenance-system.md — 3-layer tracing, confidence levels, link requirementsreferences/checkpoint-system.md — JSON schema, resume flow, error recoveryreferences/dialogue-templates.md — Socratic dialogue examples per phasereferences/output-templates.md — Idea card, framework draft, code skeleton templatespaper_matrix/REQUIREMENTS.md), Semantic Scholar API docs, academic publishing conventionsdevelopment
Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.
development
Use when the user asks to inspect Sentry issues or events, summarize recent production errors, or pull basic Sentry health data via the Sentry API; perform read-only queries with the bundled script and require `SENTRY_AUTH_TOKEN`.
development
World-class prompt engineering skill for LLM optimization, prompt patterns, structured outputs, and AI product development. Expertise in Claude, GPT-4, prompt design patterns, few-shot learning, chain-of-thought, and AI evaluation. Includes RAG optimization, agent design, and LLM system architecture. Use when building AI products, optimizing LLM performance, designing agentic systems, or implementing advanced prompting techniques.
development
World-class ML engineering skill for productionizing ML models, MLOps, and building scalable ML systems. Expertise in PyTorch, TensorFlow, model deployment, feature stores, model monitoring, and ML infrastructure. Includes LLM integration, fine-tuning, RAG systems, and agentic AI. Use when deploying ML models, building ML platforms, implementing MLOps, or integrating LLMs into production systems.