Skip to main content

About Getting Started

Verify Personas AI News Submit Get In Touch

Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub

Goose Amp Cursor Claude Code

Letta OpenCode Claude OpenAI Codex

Factory VS Code Gemini CLI GitHub

Goose Amp Cursor Claude Code

Letta OpenCode Claude OpenAI Codex

Get in touch

Let's build the future of AI skills together

We're building this out of love for the community and would really love your feedback, suggestions, and ideas. Whether you're a skill author, an enterprise team, or just curious — we want to hear from you.

Community-driven

Built by developers, for developers. Your feedback directly shapes the platform.

Security-first

Every skill passes a 4-layer security scan before it's published.

Open ecosystem

Contribute skills, report issues, or suggest new features on GitHub.

Stockholm, Sweden

Name

Email

Phone (optional)

Message

The marketplace for AI agent skills. Discover, download, and share.

Browse

All Skills
Blog
Publishers
Personas
Trending This Week
Security Scanning
Submit a Skill
Claude Code vs Cursor
Best Skills for Cursor
Python Skills
TypeScript Skills
Token Economics

Popular

Skills for Claude Code
Skills for Cursor
Skills for Codex CLI
Skills for Windsurf
Skills for Gemini CLI
Skills for Copilot
Skills for Product Managers
Skills for Superpower
Skills for Graphify

Top Skills

Anthropic Skills
OpenAI Skills
Cloudflare Skills
Vercel Skills
Hugging Face Skills
Most Installed

About

About Us
Docs
Getting Started
News
Blog
GitHub
Discord

© 2026 SkillsAuth. All rights reserved.

Terms of Service Privacy

Home/Skills/a-green-hand-jack

a-green-hand-jack

74 verified skills275 total stars

github.com/a-green-hand-jack

model-card-writer

Generate model cards, reproducibility statements, and datasheet documentation for ML models and datasets. Use when releasing a model, completing venue-required artifact documentation, or writing a reproducibility/datasheet section for NeurIPS, ICLR, ICML, or artifact evaluation.

submit-paper

Check LaTeX academic papers before submission. Use for readiness, final mode, camera-ready preparation, source hygiene, and conference deadlines.

reference-corpus-analyzer

Produce a multi-paper comparison matrix across a literature corpus with tiered read depth. Use when multiple papers need to be compared side-by-side for method differences, performance gaps, closest-work ranking, or trend identification — distinct from per-paper source cards (reference-reading-summarizer) and single-paper project linking (reference-project-synthesizer).

result-diagnosis

Use when results are valid but surprising, negative, unstable, or ambiguous — to decide debug/rerun/ablate/revise/park. Not for engineering failures like NaN/OOM (use experiment-debugger). Not for confound or claim-drift audit before locking results into the paper (use research-results-auditor).

auto-paper-improvement-loop

Run multi-round review-implement-recompile improvement cycles on a paper draft. Use when a draft needs iterative writing quality passes with reviewer independence (fresh context per review round), edit-whitelist gating, and crash-resumable state. Distinct from paper-reviewer-simulator (report only) and paper-draft-consistency-editor (single pass).

paper-introduction-argument-writer

Plan and draft ML/AI introductions as venue-aware argument chains. Use for hook, gap, insight, method, result, contribution flow, and paragraph roles.

paper-positioning-planner

Decide what an ML/AI paper should strategically sell. Use for contribution choice, claim scope, paper archetype, novelty framing, audience, and claims to avoid.

reference-project-synthesizer

Connect structured reference source cards to the active ML project. Use when papers, collaborator docs, Markdown notes, specs, scripts, BibTeX files, or source bundles should inform claims, risks, baselines, benchmarks, experiments, algorithm design, implementation, writing contracts, citations, collaborator actions, project initialization, or memory writeback.

statistical-analysis-planner

Plan and report statistical rigor for ML experiment results. Use when significance testing, effect size reporting, confidence intervals, seed variance analysis, or multiple-comparison corrections are needed before including results in a paper or rebuttal.

sidecar-task-runner

Run artifact-driven sidecar agent tasks through one-shot Codex CLI sessions. Use when a main agent should delegate bounded scans, drafts, audits, pre-reviews, or mechanical repo tasks to a fast isolated sidecar model such as gpt-5.3-codex-spark while keeping final decisions with the main agent.

table-results-review

Review ML/AI result tables, LaTeX table files, captions, provenance, and paper table style. Use for benchmark, ablation, metric, model-spec, and compute tables.

experiment-story-writer

Turn ML/AI tables, figures, ablations, and metrics into claim-aware results prose. Use for result paragraphs, figure/table narrative, and provisional metrics.

reference-library-manager

Manage project reference sources under reference/. Use when scanning, ingesting, indexing, deduplicating, monitoring, or tracking processing status for papers, PDFs, Word docs, Markdown notes, BibTeX files, scripts, specs, or source bundles without deeply reading them.

memory-publication-auditor

Audit private skills, memories, notes, or operational logs before turning them into public skills, templates, docs, or reusable patterns. Use when scanning personal/private memory for publishable knowledge, redaction needs, privacy risks, source-visibility leaks, or PR-ready public skill candidates.

appendix-organizer

Plan and write appendix or supplementary material for ML papers. Use when the appendix needs to be structured, main-paper claim boundaries need to be enforced, NeurIPS/ICLR reproducibility checklists need sections, or cross-references between paper and supplement need to be aligned.

project-pivot-planner

Plan mid-project direction changes when consistent negative results or novelty challenges require scope narrowing, angle change, or kill decisions. Use after multiple result-diagnosis cycles fail to recover the original claim. Distinct from research-idea-validator (project start) and result-diagnosis (per-experiment).

reference-reading-summarizer

Read and summarize project reference sources into structured source cards. Use for skimming papers, PDFs, Word docs, Markdown notes, BibTeX files, scripts, specs, collaborator feedback, or source bundles; extract writing patterns, methods, theory, benchmarks, baselines, implementation hints, risks, constraints, and project seeds without yet deciding project implications.

code-reviewer

Run isolated code reviews for core algorithm or production code changes. Use when the user asks for a fresh-context reviewer, writer/reviewer separation, Spark pre-review, code review, implementation audit, review bundle, independent review, or review artifacts under `.agent/code-reviews/`.

token-usage-auditor

Audit project token usage from local Codex, Codex sidecar, and Claude Code logs. Use when the user asks to measure token burn, token consumption, project attention, agent usage, Codex/Claude Code usage, sidecar usage, token efficiency, or lifecycle telemetry for a project.

ml-research-router

Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.

project-ops-router

Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.

paper-writing-assistant

Use when writing or revising actual paper prose — sections, result narratives, venue-aware style, provisional metrics. Not for planning the writing contract before drafting (use paper-writing-contract-planner). Not for tracking section status or edit-state across drafting sessions (use paper-writing-memory-manager).

paper-writing-router

Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.

ml-research-bootstrap

Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.

method-section-explainer

Plan and draft ML/AI method sections. Use for notation flow, module ordering, algorithm boxes, overview figures, design rationale, and appendix boundaries.

limitations-scope-writer

Draft ML/AI limitations, scope, failure cases, ethics, and conclusion caveats. Use to control claim boundaries and reduce overclaiming.

camera-ready-finalizer

Finalize accepted ML/AI papers for camera-ready submission. Use for de-anonymization, rebuttal promises, supplement updates, final LaTeX checks, and release handoff.

personalization-memory

Maintain automatic personalization writeback from agent trajectories, logs, sidecar artifacts, and repeated user preferences. Use when a task produces reusable preferences, lessons, private user memory, project contracts, or candidate public skill rules without interrupting the user.

experiment-report-writer

Write structured experiment reports from notes, configs, logs, metrics, tables, and figures. Use for result analysis, research updates, and presentation-ready summaries.

figure-results-review

Review ML/AI result figures, captions, LaTeX wrappers, and visual style. Use for paper plots, figure screenshots, result narratives, and venue-ready figure polish.

init-latex-project

Initialize a LaTeX academic paper project. Use for new conference or journal papers needing templates, macros, venue preambles, and writing guidance.

paper-result-asset-builder

Build paper-facing tables and figures from CSV experiment outputs. Use to inventory evidence, aggregate seeds, select result slices, generate LaTeX assets, and record provenance.

latex-layout-issue-bundler

Create repo-local LaTeX layout issue bundles from a PDF page, crop, source snippet, and compile log. Use when the user wants to avoid manual PDF screenshots, capture page-specific layout problems, or hand Codex/Claude Code a reproducible paper layout debugging artifact.

paper-draft-consistency-editor

Edit ML/AI paper drafts for internal consistency. Use after sections exist to align claims, terminology, figures, tables, captions, limitations, and conclusion.

safe-git-ops

Perform Git operations safely with sandbox-aware failure handling. Use for commit, push, merge, rebase, stash, worktree, conflicts, lock files, permission errors, or Git state diagnosis.

compute-budget-planner

Estimate GPU compute budget before running ML experiments. Use when planning how much compute an experiment, ablation matrix, or sweep will cost, sizing smoke tests, finding cheaper alternatives, or deciding whether a planned run fits available resources.

feedback-synthesizer

Turn inbound advisor, collaborator, or reviewer feedback into structured project updates. Use when meeting notes, emails, or review comments need to become claim updates, risk entries, action items, and experiment decisions — distinct from rebuttal writing for formal reviews.

abstract-title-contribution-writer

Draft ML/AI paper titles, abstracts, and contribution lists. Use for title options, abstract structure, contribution bullets, and claim-strength calibration.

discovery-router

Route research discovery tasks — idea validation, literature review, reference reading, corpus comparison, or project synthesis — to the correct skill. Use when the task involves exploring ideas, surveying literature, reading papers, comparing multiple papers, or connecting references to the project. Do not perform the review or synthesis directly.

init-python-project

Initialize or enhance a Python/ML project. Use for new repos or forks needing production structure, uv environment setup, and research evidence docs.

experiment-evidence-router

Route ML experiment planning, execution, debugging, result interpretation, and evidence packaging tasks to the correct skill. Use this when the task involves experiments, compute, results, or evidence — instead of guessing between run-experiment, run-status-monitor, experiment-debugger, result-diagnosis, research-results-auditor, statistical-analysis-planner, or paper packaging skills. Do not solve the task directly.

paper-writing-memory-manager

Use to track nonlinear drafting state — section status, claim-text dependencies, stale prose, style decisions, and edit impact across sessions. Not for writing prose (use paper-writing-assistant). Not for planning the initial writing contract (use paper-writing-contract-planner).

experiment-debugger

Use when training has engineering failures — NaN/gradient issues, GPU OOM, slow data loading, wrong metrics, reproducibility failures. Not for checking job queue/status (use run-status-monitor). Not for valid-but-surprising scientific results (use result-diagnosis). Not for confound or claim audit before writing (use research-results-auditor).

new-workspace

Create Git branches or worktrees for research code and paper versions. Use for experiments, baselines, rebuttal fixes, arXiv/camera-ready branches, and worktree memory.

research-results-auditor

Use when auditing completed results for confounds, claim-drift, protocol integrity, or attribution before locking claims into the paper. Not for deciding what to do after a surprising result (use result-diagnosis). Not for significance tests or effect sizes (use statistical-analysis-planner). Not for engineering failures (use experiment-debugger).

remote-project-control

Coordinate local, Git remote, and SSH/HPC/RunAI research projects. Use for server state, sync safety, job submission, interactive sessions, logs, artifact lookup, context recovery, raw SSH commands, remote shell one-liners, SSH quoting issues, remote-cmd, remote-bash, or avoiding local shell expansion of remote variables.

research-project-memory

Maintain hierarchical ML research project memory. Use for claim, evidence, provenance, risk, action, handoff, worktree, phase, source-visibility, paper/code/slides, review, and rebuttal state.

run-experiment

Use when launching or preparing a new ML experiment job — local, SLURM, or RunAI. Not for checking existing job status (use run-status-monitor). Not for NaN/OOM/crash debugging (use experiment-debugger). Not for computing costs before deciding to run (use compute-budget-planner).

project-init

Initialize an ML research project control root. Use for paper/code/slides repos, shared memory, GitHub Project alignment, agent guidance, worktree policy, and lifecycle handoffs.

paper-writing-contract-planner

Use before drafting starts to lock venue, archetype, section order, paragraph roles, evidence slots, and forbidden claims. Not for writing actual prose (use paper-writing-assistant). Not for tracking section status during drafting (use paper-writing-memory-manager).

run-status-monitor

Use when probing the status of an existing job — queued, stuck, running, or finished — across local, SLURM, RunAI, or SSH. Not for launching new jobs (use run-experiment). Not for debugging NaN/OOM/engineering failures (use experiment-debugger). Not for interpreting valid but surprising results (use result-diagnosis).

skill-system-auditor

Audit a skill collection for consistency, lifecycle coverage, routing, documentation drift, memory writeback, stale references, helper paths, and validation readiness.

data-pipeline-manager

Manage ML dataset pipelines before training. Use when the user needs to acquire, preprocess, split, or version datasets, design train/val/test protocols, audit data quality, check for train/test contamination, or make data decisions that affect experimental validity and reviewer trust.

work-timeline-planner

Build retrospective or forward-looking work timelines from git history, docs, notes, or chat records. Use for progress summaries, mentor reports, and phase planning.

paper-evidence-gap-miner

Mine existing results for paper evidence gaps before new compute. Use when claims lack support, CSVs may already contain evidence, or tables/figures can be derived.

baseline-selection-audit

Audit ML/AI experimental baselines for necessity, fairness, currency, and reviewer risk. Use when choosing baselines or checking SOTA comparisons.

project-sync

Sync verified code-side experiment results into paper memory. Use when logs, reports, run docs, or user-confirmed metrics should become paper-facing evidence.

related-work-positioning-writer

Draft ML/AI related work as novelty-boundary writing. Use for closest-work grouping, citation roles, paragraph plans, boundary statements, and safe novelty wording.

paper-reviewer-simulator

Simulate target-conference reviewers for an ML/AI paper. Use for reviewer critique, predicted scores, reject risks, meta-review, and pre-submission risk audit.

release-code

Prepare research code repositories for public release. Use for open-source cleanup, README/LICENSE/CITATION, GitHub releases, tags, and reproducibility packages.

rebuttal-strategist

Plan and write ML/AI rebuttals after real reviews arrive. Use for reviewer intent, response strategy, follow-up experiments, point-by-point replies, and revision promises.

research-idea-validator

Validate rough CS/AI research ideas with the FIVE+C framework. Use to decide pursue, revise, park, or kill based on novelty, feasibility, evidence, and risks.

update-docs

Refresh project documentation after code changes. Use after implementing features, changing behavior, or preparing a milestone commit.

research-slide-deck-builder

Design and write reusable research slide decks. Use for advisor updates, lab talks, reading reports, proposals, conference talks, Slidev content, and slide structure.

add-git-tag

Create annotated Git milestone tags. Use when completing a phase, releasing a version, marking a research checkpoint, or generating a milestone summary from git history before tagging.

algorithm-design-planner

Turn an ML/AI research idea into a concrete method design. Use for objectives, architecture, inference, assumptions, ablations, and implementation handoff.

advisor-update-writer

Write advisor, mentor, lab-meeting, or collaborator updates. Use for weekly updates, progress memos, decision requests, blocker summaries, and next actions.

artifact-evaluation-prep

Prepare research artifact packages for evaluation or public release. Use for reproduction commands, environment checks, data packaging, and artifact forms.

citation-coverage-audit

Audit whether a paper cites necessary classic, closest, benchmark, and recent work. Use for missing-citation checks and related-work coverage before submission.

citation-audit

Audit LaTeX citations and references before submission. Use for BibTeX metadata, unresolved keys, invalid labels, DOI/arXiv data, and citation-claim support.

experiment-design-planner

Design hypothesis-driven ML/AI experiments before running. Use for ablations, baselines, metrics, controls, seeds, logging, and claim-evidence matrices.

conference-writing-adapter

Adapt ML/AI paper writing to a target venue. Use for venue style, structure, positioning, reviewer-friendly prose, and section or paragraph guidance.

literature-review-sprint

Run a focused CS/AI literature review sprint. Use to survey a topic, map related work, check novelty, rank papers, assess closest-work risk, and derive next actions.

paper-evidence-board

Maintain a paper-facing evidence board. Use to align claims, experiments, figures, tables, sections, reviewer risks, evidence gaps, and next actions during writing.