skills/paper-research/SKILL.md
End-to-end paper research support for arXiv/literature surveys, reproducibility-focused paper shortlisting, and experiment design. Use when you need to (1) search arXiv with complex queries, (2) download PDFs, extract text/sections, and fetch BibTeX, (3) dedupe/cluster results into a structured report, and (4) turn findings into a lit-review plan, benchmark/evaluation suite, and representation/probing experiment checklist (e.g., implicit reasoning, hidden-CoT, multilingual reasoning, cross-lingual alignment).
npx skillsauth add LiYu0524/Auto-Reasearch-Skills paper-researchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Run a fast, reproducible “survey → shortlist → synthesize” loop for research topics, backed by small scripts that fetch arXiv metadata/PDFs/BibTeX, extract text, and generate structured Markdown briefs.
notes/implicit-reasoning-survey/python3 scripts/arxiv_survey.py --terms "implicit reasoning" "hidden chain-of-thought" "multilingual reasoning" --max-results 100 --download-pdfs --pdf-dir ./pdfs --out ./arxiv.jsonlpython3 scripts/pdf_extract.py --pdf-dir ./pdfs --out-dir ./texts --sectionspython3 scripts/arxiv_bibtex.py --from-jsonl ./arxiv.jsonl --out ./refs.bibpython3 scripts/generate_report.py --jsonl ./arxiv.jsonl --out ./REPORT.mdThen ask Codex to synthesize (taxonomy/benchmarks/experiments) using REPORT.md + your notes.
Do this:
REPORT.md (table + clusters) and refs.bib.When relevant, include “fastest path to reproduce” (datasets, eval harnesses, probing code).
Prioritize:
Do this:
arxiv_survey.py with stricter terms and fewer results (e.g., 30–80).REPORT.md by reproducibility criteria:
Use this structure:
Use assets/experiment_checklist.md as the backbone checklist.
Copy and fill these as working docs:
assets/research_brief.md → one-topic brief (taxonomy + top papers + open questions)assets/paper_comparison_table.md → consistent per-paper extraction fieldsassets/experiment_checklist.md → step-by-step experimental checklistAll scripts are pure-Python (stdlib) where possible. pdf_extract.py supports optional extractors; if none are available, it prints a clear install hint.
scripts/arxiv_survey.pySearch arXiv via the official Atom API, write results to JSONL, and optionally download PDFs.
scripts/arxiv_bibtex.pyFetch BibTeX from arxiv.org for a list of arXiv IDs or a JSONL produced by arxiv_survey.py.
scripts/pdf_extract.pyExtract text from PDFs into .txt and optionally produce rough section splits (heuristics).
scripts/dedupe_jsonl.pyDedupe a JSONL file by arxiv_id and near-duplicate titles (useful when iterating queries).
scripts/generate_report.pyGenerate a structured Markdown report (table + clusters + TODO note slots) from arxiv.jsonl.
Read when you need query patterns or a report schema:
references/arxiv_query_guide.mdreferences/report_fields.mdtesting
Review research papers (especially PDFs). Use when the user asks to read/通读/讲解/总结/审稿 a paper and wants a Chinese-first explanation of what it does, what is novel (创新点), plus reviewer-style strengths/weaknesses, major/minor concerns, and questions to authors.
research
学术插图生成 - 使用 PaperBanana 多智能体框架从方法文本自动生成框架图和统计图
development
Manage Google Docs and Google Drive with full document operations and file management. Includes Markdown support for creating formatted documents with headings, bold, italic, lists, tables, and checkboxes. Also supports Drive operations (upload, download, share, search).
data-ai
Generate draw.io diagrams as .drawio files, optionally export to PNG/SVG/PDF with embedded XML