skills/22-christopherkenny-skills/skills/paper-summary/SKILL.md
Writes expert academic paper summaries for social science research, particularly political science and applied statistics. Use when asked to summarize, review, or create a reading summary of an academic paper, PDF, or research article. Accepts a file (PDF or text format) or a directory of papers. Produces a structured markdown summary — approximately 400–600 words — covering primary contributions, major questions and answers with point estimates, methods and data, and limitations and robustness. Includes BibTeX citation retrieved from Google Scholar and keyword metadata.
npx skillsauth add brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research paper-summaryInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an expert reader of social science academic research, with deep fluency in political science, comparative politics, American politics, political methodology, and applied statistics. You write concise, accurate, detail-rich summaries that serve as quick-reference notes — not substitutes for reading the paper.
| Position | Required | Description |
|----------|----------|-------------|
| 1 | Yes | Path to a paper file (PDF, .txt, .md, .qmd, .tex) or a directory containing papers |
| 2 | No | Output summary path. Defaults to <citekey>.md (Google Scholar citation key format) in the same directory as the input. |
Example invocations:
/paper-summary papers/acemoglu2001.pdf
/paper-summary papers/
/paper-summary papers/king1994.pdf summaries/king1994summary.md
If a directory is supplied, identify all PDF and text files within it and produce one <citekey>.md per paper, placed alongside the source files.
Derive the output filename using the same algorithm Google Scholar uses for BibTeX citation keys: first author's last name (lowercase) + four-digit year + first non-stopword word of the title (lowercase). Strip punctuation and diacritics.
acemoglu2001colonial.mdolson1965logic.mdbecker1964theory.mdIf the paper is not yet published or the year is unknown, use the submission year or omit the year.
Follow the standard efficient reading approach in three passes. Record brief bullet notes after each pass; these ensure the final summary covers the whole paper.
Read the abstract carefully. Record 2–4 bullet points:
Read the full introduction and conclusion. Scan all section and subsection headings. Record:
Read the entire paper. Attend specifically to:
Search for a BibTeX citation using the WebSearch tool. Query Google Scholar by paper title and first author's last name. Retrieve the BibTeX entry Google Scholar provides (via the "Cite" → "BibTeX" link). If the paper is very recent or not indexed, construct a citation from the paper's own metadata.
Google Scholar generates citation keys using a fixed algorithm: first author's last name (lowercase) + four-digit publication year + first non-stopword word of the title (lowercase). Stopwords skipped include: a, an, the, of, in, on, at, for, and, or, but, with, to, from, by, as, is, are, was, were, be, this, that, which. The same key used for the BibTeX entry should also be used as the output filename (without the .md extension).
Write the summary as a markdown file with the following structure:
---
title: Full Paper Title
authors: First Last, First Last, and First Last
keywords: [keyword1, keyword2, keyword3, keyword4, keyword5]
journal: Journal Name (or preprint archive, e.g., arXiv, SSRN, OSF Preprints)
bibtex: |
@article{authorYEARword,
author = {LastName, FirstName and LastName, FirstName},
title = {Title of the Paper},
journal = {Journal Name},
year = {YYYY},
volume = {V},
number = {N},
pages = {X--Y},
doi = {10.xxxx/xxxxx}
}
---
# [Full Paper Title]
## Primary Contributions
[2–4 sentences. What does this paper add to the existing literature? What prior gap, puzzle, or debate does it address? State the theoretical or empirical advancement clearly and specifically — avoid generic claims like "fills a gap."]
## Major Questions and Answers
[State the central research question explicitly. Then give the answer, including quantitative results where available: point estimates, confidence intervals, standard errors, or effect sizes. Reference specific tables or figures. Example: "The authors estimate that a one-standard-deviation increase in ethnic fractionalization reduces democratic stability by 0.18 points (SE = 0.04), a result reported in Table 3 and robust across all model specifications."]
## Methods
[Identify the research design (observational study, experiment, regression discontinuity, instrumental variables, difference-in-differences, etc.), the data source(s), the sample (country-years, survey respondents, legislative districts, etc.), the time period, and the key outcome and treatment variables. Note the identification strategy in plain language. 2–4 sentences. Reference key methodological tables or figures.]
## Limitations and Robustness
[Describe what the authors do to validate their findings: robustness checks, placebo tests, alternative codings, subsample analyses, sensitivity to functional form — citing specific appendix tables or figures where these appear. Then note genuine limitations that remain: external validity constraints, data quality issues, measurement concerns, or alternative explanations not fully ruled out. 2–4 sentences.]
400–600 words across the four body sections (excluding the YAML block, the H1 heading, and section heading lines). This is a quick-reference document, not a précis of every argument.
Write body text with one sentence per line. Do not wrap a sentence across multiple lines and do not place more than one sentence on the same line. Blank lines between sections are preserved as usual; within a section, each sentence starts at the left margin on its own line with no leading spaces.
Correct:
The authors use a regression discontinuity design exploiting close elections.
The running variable is the margin of victory in the previous election (Table 1).
Bandwidth selection follows Calonico, Cattaneo, and Titiunik (2014).
Incorrect (multiple sentences on one line):
The authors use a regression discontinuity design exploiting close elections. The running variable is the margin of victory.
Incorrect (sentence broken mid-line):
The authors use a regression discontinuity design
exploiting close elections.
After drafting the summary, evaluate it against the checklist below. Print the checklist with pass/fail marks to the console (not to the output file). If any item fails, revise the summary before writing the final file. Re-evaluate mentally after revision.
PAPER SUMMARY CHECKLIST
========================
[ ] YAML block has `title:` (full paper title) and `authors:` (all authors, natural order)
[ ] H1 heading contains the full paper title
[ ] YAML block has at least 3 keywords relevant to the paper's topic and method
[ ] YAML block contains a `journal` field (journal name or preprint archive)
[ ] YAML block contains a `bibtex` field with the BibTeX entry in Google Scholar format
[ ] Output filename matches the BibTeX citation key (authorYEARword.md)
[ ] Primary Contributions states a specific literature gap or debate, not generic boilerplate
[ ] Major Questions and Answers gives a concrete answer to the research question
[ ] Point estimate(s) with uncertainty (CI, SE, or p-value) are from the preferred/primary specification and present where the paper reports them
[ ] Methods identifies: data source, research design, and key variables
[ ] Limitations and Robustness names at least one robustness check and at least one limitation
[ ] Body text references at least two specific pages, tables, or figures from the paper
[ ] Word count is between 400 and 600 words (body sections only)
[ ] No verbatim copying, OR at most 2 direct quotes each with quotation marks and page citation
[ ] Each sentence in the body occupies exactly one line (no mid-sentence line breaks, no two sentences on one line)
[ ] All four sections contain paper-specific claims, not generic filler
If any item is not met, note the specific failure in the console output, revise, and confirm the issue is resolved before writing the output file.
.txt, .md, .qmd, and .tex files.<citekey>.md adjacent to the input file. The citation key is determined in step 7 once the paper has been read — use a placeholder path if needed and rename after deriving the key.Read tool. For PDFs, read the full document; if the PDF is long (>20 pages), read in page-range chunks — cover the abstract and introduction first, then body sections, then any appendix.WebSearch to retrieve the BibTeX entry from Google Scholar.Write tool. Never modify the source paper.development
Conduct rigorous thematic analysis (TA) of qualitative data following Braun and Clarke's (2006) six-phase framework. Use whenever the user mentions 'thematic analysis', 'TA', 'Braun and Clarke', 'qualitative coding', 'identifying themes', or asks for help analysing interviews, focus groups, open-ended survey responses, or transcripts to identify patterns. Also trigger for questions about inductive vs theoretical coding, semantic vs latent themes, essentialist vs constructionist epistemology, building a thematic map, or writing up a qualitative findings section. Covers all six phases, the four upfront analytic decisions, the 15-point quality checklist, and the five common pitfalls. Produces a Word document write-up and an annotated thematic map. Does NOT cover IPA, grounded theory, discourse analysis, conversation analysis, or narrative analysis — use a different method for those.
development
Guide users through writing a systematic literature review (SLR) following the PRISMA 2020 framework. Use this skill whenever the user mentions 'systematic review', 'systematic literature review', 'SLR', 'PRISMA', 'PRISMA 2020', 'PRISMA flow diagram', 'PRISMA checklist', or asks for help writing, structuring, or auditing a literature review that follows reporting guidelines. Also trigger when the user asks about inclusion/exclusion criteria for a review, search strategies for databases like Scopus/WoS/PubMed, study selection processes, risk of bias assessment, or narrative synthesis for a review paper. This skill covers the full PRISMA 2020 checklist (27 items), produces a Word document manuscript in strict journal article format, generates an annotated PRISMA flow diagram, and enforces APA 7th Edition referencing throughout. It does NOT cover meta-analysis or statistical pooling. By Chuah Kee Man.
testing
Performs placebo-in-time sensitivity analysis with hierarchical null model and optional Bayesian assurance. Use when checking model robustness, verifying lack of pre-intervention effects, or estimating study power.
data-ai
Fit, summarize, plot, and interpret a chosen CausalPy experiment. Use after the causal method has been selected, including when configuring PyMC/sklearn models and scale-aware custom priors.