skills/scientific-writing/scientific-manuscript-writing/SKILL.md
Scientific manuscript writing: IMRAD, citation styles (APA/AMA/Vancouver/IEEE), figures/tables, reporting guidelines (CONSORT/STROBE/PRISMA/ARRIVE), writing principles (clarity/conciseness/accuracy), venue-specific style. For LaTeX see companion assets.
npx skillsauth add jaechang-hits/sciagent-skills scientific-manuscript-writingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Scientific manuscript writing is the discipline of communicating research findings with precision, clarity, and reproducibility. This knowhow covers the complete lifecycle of manuscript preparation: from planning and structuring a paper using IMRAD format, through applying correct citation styles and designing effective figures, to ensuring compliance with study-specific reporting guidelines. It applies across biomedical, social science, engineering, and computational fields.
This entry consolidates writing principles, manuscript structure, citation systems, figure/table design, and reporting standards into a unified reference for producing publication-ready scientific documents.
The three pillars of scientific writing govern all manuscript text:
Additional principles include objectivity (present results without bias, acknowledge conflicting evidence), consistency (same term for same concept, uniform notation), and logical organization (clear "red thread" connecting sections). See references/writing_principles_style.md for detailed guidance with examples.
IMRAD (Introduction, Methods, Results, And Discussion) is the standard organizational structure for original research articles, adopted across most scientific disciplines since the 1970s. It mirrors the scientific method:
| Section | Question Answered | Primary Tense | Typical Length | |---------|-------------------|---------------|----------------| | Title | What is this about? | N/A | 10-15 words | | Abstract | Complete summary | Mixed | 100-250 words | | Introduction | Why did you study this? | Present/past | 4-5 paragraphs | | Methods | How did you do it? | Past | 2-4 pages | | Results | What did you find? | Past | 2-4 pages | | Discussion | What does it mean? | Past/present | 3-5 pages | | Conclusion | Take-home message | Present | 1-2 paragraphs |
The Introduction follows a funnel structure: broad context, narrowing literature review, gap identification, and study objectives. Methods must provide sufficient detail for replication. Results present findings objectively without interpretation. Discussion interprets findings, compares with prior work, acknowledges limitations, and proposes future directions.
Venue variations: Nature/Science use modified IMRAD with methods in supplement. ML conferences (NeurIPS/ICML) use Introduction-Method-Experiments-Conclusion with numbered contributions and ablation studies. See references/manuscript_structure.md for section-by-section guidance.
Five major citation styles serve different disciplines:
| Style | Format | Primary Disciplines | |-------|--------|---------------------| | AMA | Superscript numbers | Medicine, health sciences | | Vancouver | Numbers in brackets [1] | Biomedical sciences | | APA | Author-date (Smith, 2023) | Psychology, social sciences | | Chicago | Notes-bibliography or author-date | Humanities, some sciences | | IEEE | Numbers in brackets [1] | Engineering, computer science |
Best practices across all styles: Cite primary sources when possible. Include recent literature (within 5-10 years for active fields, 2-3 years for ML). Balance citation distribution across Introduction and Discussion. Verify all citations against original sources. Keep self-citations below 20%. Always check the target journal's author guidelines for the required style.
See references/citation_guide.md for complete format specifications, journal-specific requirements, and DOI formatting rules.
Figures and tables are the backbone of a scientific paper; many readers examine them before reading the text.
Decision rule: Can the information be conveyed in 1-2 sentences? If yes, use text only. If no and precise values are needed, use a table. If no and patterns/trends matter most, use a figure.
Core design principles:
Error bar rule: Always state which measure is shown (SD for data spread, SEM for measurement precision, 95% CI for significance assessment). 95% CI is generally preferred because non-overlapping CIs indicate significant differences.
See references/figures_tables_guide.md for figure type selection, table formatting, statistical presentation, and journal-specific requirements.
What type of scientific document are you writing?
|
+-- Original research article
| |
| +-- Clinical trial --> IMRAD + CONSORT reporting guideline
| +-- Observational study --> IMRAD + STROBE reporting guideline
| +-- Animal study --> IMRAD + ARRIVE reporting guideline
| +-- Diagnostic accuracy --> IMRAD + STARD reporting guideline
| +-- Prediction model --> IMRAD + TRIPOD reporting guideline
| +-- ML/CS research --> Intro-Method-Experiments-Conclusion
|
+-- Review / synthesis
| +-- Systematic review --> PRISMA reporting guideline
| +-- Narrative review --> Thematic structure
| +-- Meta-analysis --> PRISMA + forest plots
|
+-- Case report --> CARE guideline
|
+-- Study protocol --> SPIRIT guideline
|
+-- Quality improvement --> SQUIRE guideline
|
+-- Economic evaluation --> CHEERS guideline
|
+-- Professional report / white paper --> scientific_report.sty (see assets/)
| Venue Type | Structure | Citation Style | Key Adaptation | |-----------|-----------|---------------|----------------| | Nature/Science | Modified IMRAD, methods in supplement | Numbered superscript | Accessible language, broad significance, story-driven | | Medical journals (NEJM, JAMA) | Strict IMRAD | Vancouver/AMA | Structured abstracts, clinical focus, CONSORT/STROBE | | Field-specific journals | Standard IMRAD | Varies by field | Full technical detail, field terminology | | ML conferences (NeurIPS, ICML) | Intro-Method-Experiments-Conclusion | Numbered or author-year | Numbered contributions, ablations, pseudocode | | Professional reports | Chapter-based | N/A | Use scientific_report.sty for formatting |
Overstating conclusions beyond the evidence: Claiming causation from observational data or generalizing beyond the study population.
Inconsistent terminology: Using different words for the same concept (switching between "medication," "drug," and "pharmaceutical").
Mixing verb tenses inappropriately: Using present tense for your specific results or past tense for established facts.
Insufficient methods detail for reproducibility: Omitting sample sizes, software versions, statistical test justifications, or ethical approval.
Redundant data presentation: Repeating all table values in the text, or duplicating information between figures and tables.
Missing or poorly designed error bars: Omitting error bars entirely, or not specifying whether they represent SD, SEM, or CI.
Abbreviation overload: Defining too many abbreviations, or abbreviating terms used only once or twice.
Ignoring journal-specific requirements: Submitting with wrong citation style, exceeding word limits, or using incorrect figure formats.
Planning Phase
Literature Review
Outline Construction
Drafting (section order: Methods, Results, Discussion, Introduction, Abstract, Title)
Figure and Table Finalization
Revision and Quality Control
Submission Preparation
| File | Content | Original Source |
|------|---------|----------------|
| references/writing_principles_style.md | Clarity/conciseness/accuracy strategies, verb tense rules, word choice, paragraph structure, revision checklist, venue-specific writing styles | Condensed from writing_principles.md (825 lines) |
| references/manuscript_structure.md | Complete IMRAD guide, section-by-section content and common mistakes, venue variations, ML conference structure | Condensed from imrad_structure.md (659 lines) |
| references/citation_guide.md | AMA, Vancouver, APA, Chicago, IEEE format specifications, journal-specific styles, citation best practices, DOI formatting | Condensed from citation_styles.md (721 lines) |
| references/figures_tables_guide.md | Figure type selection, table formatting, statistical presentation, accessibility, journal-specific requirements, submission checklist | Condensed from figures_tables.md (807 lines) |
| references/reporting_guidelines.md | CONSORT, STROBE, PRISMA, SPIRIT, STARD, TRIPOD, ARRIVE, CARE, SQUIRE, CHEERS checklist summaries and usage workflow | Condensed from reporting_guidelines.md (749 lines) |
Omitted: professional_report_formatting.md (665 lines) -- LaTeX-specific formatting details are covered directly by the scientific_report.sty template and its companion REPORT_FORMATTING_GUIDE.md in the assets/ directory. Key concepts (box environments, scientific notation commands) are noted in the assets section below.
| File | Description |
|------|-------------|
| assets/scientific_report.sty | LaTeX style package for professional reports (not journal manuscripts). Provides Helvetica typography, colored box environments (keyfindings, methodology, limitations, etc.), alternating-row tables, and scientific notation commands (\pvalue, \effectsize, \CI, \meansd). Compile with XeLaTeX or LuaLaTeX. |
| assets/scientific_report_template.tex | Complete LaTeX template demonstrating all style features: title page, executive summary, chapters with box environments, statistical tables, figure formatting, appendices. |
| assets/REPORT_FORMATTING_GUIDE.md | Quick reference guide for the style package: color palette (hex values), box environment syntax, scientific notation command reference, table/figure formatting patterns, compilation instructions. |
matplotlib-scientific-plotting -- Create publication-quality figures for manuscriptsseaborn-statistical-plots -- Statistical visualizations with automatic CI and aggregationplotly-interactive-plots -- Interactive figures for supplementary materialsstatsmodels-statistical-modeling -- Statistical models whose outputs you report in manuscriptspymc-bayesian-modeling -- Bayesian analysis results for reportingpeer-review-methodology -- Structured peer review of manuscripts (complements this entry)latex-research-posters -- Poster preparation using LaTeXscientific-slides -- Conference presentation preparationscientific-brainstorming -- Ideation methods for framing research questionsassets/scientific_report.sty -- LaTeX style package for professional scientific reportsassets/scientific_report_template.tex -- Complete report template with all style features demonstratedassets/REPORT_FORMATTING_GUIDE.md -- Quick reference card for box environments, color palette, and scientific notation commandstools
Fast short-read DNA aligner for WGS/WES/ChIP-seq. 2× faster BWA-MEM successor; outputs SAM/BAM with read group headers for GATK. Primary plus supplementary records for chimeric reads. Use STAR for RNA-seq splice-aware alignment; Bowtie2 is a comparable alternative.
tools
smina molecular docking CLI. AutoDock Vina fork with customizable scoring functions, native SDF/MOL2/PDB ligand input, autoboxing, local energy minimization, and per-atom score breakdowns. Pipeline: receptor PDBQT prep -> ligand prep (RDKit/OpenBabel) -> dock via autobox or explicit grid -> rescore/minimize with custom scoring -> rank poses by affinity. Choose smina over Vina when you need custom scoring terms (--custom_scoring), local optimization of an existing pose (--local_only), per-atom contributions (--atom_term_data), or SDF/MOL2 ligands without manual PDBQT conversion. For unknown binding sites use diffdock-blind-docking; for the Python-bindings/Vinardo workflow use autodock-vina-docking.
development
mdtraj molecular dynamics trajectory analysis (Python). Reads DCD/XTC/TRR/NetCDF/H5/PDB topologies and trajectories; computes RMSD vs time, radius of gyration, per-residue RMSF, residue-residue contact frequency maps, phi/psi torsions for Ramachandran plots (general + Gly/Pro), and 8-state DSSP secondary structure. Modules: trajectory I/O, geometry (distances/angles/dihedrals), structural analysis (RMSD/Rg/RMSF/SASA), contacts, hydrogen bonds, secondary structure (DSSP), NMR observables. For broader atom-selection grammar use mdanalysis-trajectory; for running MD simulations use OpenMM/GROMACS.
development
Programmatic PubMed access via NCBI E-utilities REST API. Covers Boolean/MeSH queries, field-tagged search, endpoints (ESearch, EFetch, ESummary, EPost, ELink), history server for batches, citation matching, systematic review strategies. Use for biomedical literature search or automated pipelines.