
Author high-quality deep-research prompts (Gemini / ChatGPT Pro / Perplexity Deep Research), then capture the resulting documents into the PKB — including figure extraction, agent-transcribed alt-text for load-bearing images, frontmatter, and wikilink wiring to the sourcing task.
Creating diagrams in any style — Mermaid flowcharts (structured, code-based) or Excalidraw (hand-drawn, organic). Use style parameter to select.
General extraction/ingestion skill that routes to specific workflows based on input type. Extracts structured information from documents, emails, reviews, feedback, and other sources.
Scan and process inline HTML comments for human-agent collaboration. Finds <!-- @nic: --> or <!-- @ns: --> comments and responds with dated <!-- @claude: --> replies. Works on markdown, Python, and other text files.
Generate morning briefing bundle with decision coversheets, email drafts, and annotation targets from the daily note. Run /daily first.
Extract pending decisions from task queue, prioritize by blocking count, output to daily note for batch processing.
Archive information extraction - assess archival documents and identify information worth preserving in the knowledge base.
Incremental PKM maintenance - weeding, pruning, linking, consolidating. Tends the knowledge base bit by bit.
Write production-quality Python code following fail-fast philosophy, type safety, and modern best practices. Enforces rigorous standards for academic and research code where correctness and replicability are paramount.
Assist the user in reviewing academic work (papers, dissertations, drafts). Focuses on preparation, navigation, and synthesis, NOT replacing critical judgment.
Extract structured training examples from document sets to build datasets for teaching LLMs specific tasks or styles. Use when processing review documents, feedback annotations, or revision histories.
Daily note lifecycle - briefing and progress sync. Reports the state of the day; does not prioritise or recommend. SSoT for daily note structure.
Unified memory skill: immediate mode (/remember) persists knowledge via PKB MCP; maintenance mode (/sleep, GHA cron) runs periodic consolidation — transcript mining, knowledge synthesis, data quality, brain sync.
Survey a corpus, classify, and dispatch outputs. Three modes: retro (transcript review → issues), trend (longitudinal performance analysis), sweep (GitHub issue triage → fix-epics). Delegates execution to junior/jr to keep main context clean.
Academic research methodology guardian. Ensures agents working on empirical research maintain methodological integrity: research questions drive all design decisions, methods are appropriate and justified, data collection quality is verified before proceeding, and convenience shortcuts that compromise validity are caught and refused.
Multi-agent strategic review of documents, plans, and proposals. Commissions review agents and iterates until the review meets quality standards. Use --critic for a fast pauli-only pre-hoc critique.
Judgement-based QA pass. Does this artifact meet its goal and serve its user? Demands excellence, not compliance. Owned by marsha; reads the spec's Fitness Rubric (designed upstream via /design-rubric).
Canonical session close — commit, push, PR, release_task, reflection blocks, handover. Use /dump for emergency bail (no commit/PR/reflection).
Mirror PKB tasks onto the Cowork native task list at claim time and sync completion back to PKB. Cowork-only; ships only in the cowork build of aops-core.
Periodic consolidation agent — unified into the /remember skill (maintenance mode). This stub exists for backwards compatibility with installed GHA workflows.
Emergency session bail — fast resume task + short handover, no commit/PR/reflection. For when you (or the user) need a clean context now. Use /end-session for canonical close.
Peer review of research funding applications and academic submissions. Scheme-agnostic — fetches current criteria from the relevant handbook each round, since weights and language change. Covers Detailed Assessor and College-of-Experts / General Assessor roles, plus collegial draft review.
Iterative improvement protocol — declare a measurable target, run cycles via /loop, experiment, measure, learn, and accumulate work in a DRAFT PR.
Assess hydrator quality using real session data
Framework index curation and acceptance testing. Ensures documentation stays in sync with implementation.
Long-running iterative supervisor — dispatch work items to polecat workers across multiple bursts with state recovery.
Process annotated decisions from daily note, update task statuses, and unblock dependent tasks.
Email triage workflow with mandatory archive receipt logging to task body
Generic reflective execution loop — learn from doing, capture friction, improve instructions
Agent session quality assessment — merged into /qa as Agent Session Evaluation mode
Verify factual claims in documents against authoritative sources. Catches hallucinations, fabricated quotes, and misattributed claims.
Generate draft OSB case decisions with IRAC analysis, position variants, and precedent support. Use when preparing case analysis for Oversight Board deliberation.
Process annotated briefing bundle — execute decisions, stage email drafts, create tasks from annotations. Never auto-sends email.
Establish and refine ground truth labels for evaluation datasets. Use when creating, reviewing, or updating labels for any judgment/reasoning task.
Reconstruct plain-English narrative of recent work from session summaries
--- name: skill title: Skill category: instruction --- # Review Training Data Extraction Skill Extract training pairs (review feedback → source evidence) from matched peer review/source document pairs to build a dataset for teaching LLMs to perform academic peer review in Nic's style. ## Purpose Process matched review/source pairs to create training data that captures: 1. **Review feedback units** - specific comments, suggestions, critiques 2. **Source evidence** - the text/pattern in the s
Generate comprehensive session insights from transcripts using a Claude subagent
Convert markdown documents to professionally formatted PDFs with academic-style typography, Roboto fonts, proper page layouts, and styling suitable for research documents, reviews, reports, and academic writing.
Project-local framework development skill — workflow routing, task lifecycle, and categorical conventions for working on academicOps
Batch convert documents (DOCX, PDF, XLSX, TXT, PPTX, MSG, DOC) to markdown, preserving tracked changes and comments.
Creating visually compelling, hand-drawn diagrams with organic mind-map layouts and accessibility-focused design.
Creating clear, readable, and attractive Mermaid flowcharts with best practices for accessibility, layout, and maintainability.
Strategic planning agent — graph structure ownership, task decomposition, knowledge-building, and PKM maintenance. Works on WHAT exists and HOW it relates.
QA verification, qualitative assessment, criteria design, and test planning
Program / portfolio supervision — the autonomous top loop above /supervisor. "Ready the release" → discover and decompose the constituent epics → run /supervisor on each → surface only escalations + merge-ready PRs. Stateless tick driven by /loop; all cross-tick state lives in the program task body.
Epic-level task supervisor — owns an epic from decomposition through the review surface. Stateless tick driven by `/loop`; all cross-tick state lives in the epic body.
Scaffold research project repositories with smart defaults — repo creation, directory structure, CI/CD, documentation, and PKB integration in one pass.
Delegated instruction testing — write instructions, commission contextless execution, observe friction, iterate, review quality, codify.
Instruction quality gate — reviews agent instructions (task bodies, workflow steps, skill procedures, self-test protocols) for shallow-execution vulnerabilities before deployment. Two modes: author (pre-hoc review) and audit (trace a failure back to the instruction gap). The bar is excellence, not compliance.
Core academicOps skill — institutional memory, strategic coordination, workflow routing, and framework governance. Merges butler (chief-of-staff) with framework development conventions.
Analyze writing samples and create a comprehensive personal writing style guide
Design-stage fitness rubric — persona immersion, scenario design, dimensions that define what excellence looks like for the people a feature serves. Two modes — author (produce a rubric for a new spec) and critique (red-team an existing spec). Output lives on the spec, not in the verification brief. Owned by pauli.
Debug Claude Code or Gemini CLI in headless mode with full output capture.
HDR (Higher Degree Research) student task conventions, reference letter workflows, and document access patterns.
Support academic research data analysis using dbt and Streamlit. Use this skill when working with computational research projects (identified by dbt/ directory, Streamlit apps, or empirical data pipelines). The skill enforces academicOps best practices for reproducible, transparent, self-documenting research with collaborative single-step workflow.