Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

a-green-hand-jack/sidecar-task-runner

Name: sidecar-task-runner
Author: a-green-hand-jack

skills/sidecar-task-runner/SKILL.md

npx skillsauth add a-green-hand-jack/ml-research-skills sidecar-task-runner

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Sidecar Task Runner

Use this skill to run bounded helper tasks from a main agent without sharing the main conversation as the task state. The sidecar produces artifacts; the main agent verifies, integrates, rejects, or escalates them.

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
├── templates/
│   ├── precommit-classifier.md
│   └── personalization-scanner.md
└── scripts/
    └── prepare_sidecar_task.py

Core Contract

The main agent owns planning, integration, final judgment, and user-facing actions.
The sidecar gets a small prompt artifact, not the main chat transcript.
Default to gpt-5.3-codex-spark for fast local scans, first drafts, pre-reviews, consistency checks, and other low/medium-risk tasks.
Prefer deterministic scripts and direct shell commands for facts; use the sidecar for synthesis, ranking, summarization, mismatch detection, and candidate text.
Default sandbox is read-only. Use workspace-write only when the sidecar must write its own artifacts or make explicitly bounded edits.
Keep human gates before external or irreversible actions such as git tag, git push, release upload, job submission, or public issue creation.
Do not use a sidecar as the only reviewer for high-risk algorithm design, core correctness, security/privacy, final merge approval, or final submission decisions.

Artifact Layout

Use a repo-local directory:

.agent/sidecars/<task-id>/
├── prompt.md
├── input-manifest.md
├── output.md
├── findings.md
├── decision.md
├── model.md
└── model.json

Meanings:

prompt.md: exact prompt given to the sidecar.
input-manifest.md: files, commands, diffs, or project state the sidecar may inspect.
output.md: raw sidecar final response, usually written with codex exec -o.
findings.md: distilled findings if the sidecar writes structured results.
decision.md: main-agent decision after reading the sidecar output.
model.md: human-readable command, model, sandbox, and token notes.
model.json: structured run metadata for token and lifecycle audits.

Prepare A Task

Create the artifact directory with:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --task-id <task-id> \
  --title "<short task title>" \
  --phase tooling \
  --task-type audit \
  --prompt-file <prompt-file>

If no prompt file exists, pass --prompt "<task instructions>".

For a fast precommit classification sidecar, use the bundled preset:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --title "Precommit classifier" \
  --phase maintenance \
  --task-type audit \
  --preset precommit-classifier \
  --input "git status --short" \
  --input "git diff"

For a personalization scan that extracts reusable preferences from sanitized artifacts without asking the user:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --title "Personalization scan" \
  --phase maintenance \
  --task-type audit \
  --preset personalization-scanner \
  --input "memory/current-status.md" \
  --input ".agent/sidecars/*/decision.md"

Run Codex Spark

For a read-only sidecar:

codex exec --ephemeral \
  -m gpt-5.3-codex-spark \
  -C . \
  -s read-only \
  -o .agent/sidecars/<task-id>/output.md \
  "$(cat .agent/sidecars/<task-id>/prompt.md)"

For a sidecar that must write its own findings.md or make tightly scoped artifact edits:

codex exec --ephemeral \
  -m gpt-5.3-codex-spark \
  -C . \
  -s workspace-write \
  -o .agent/sidecars/<task-id>/output.md \
  "$(cat .agent/sidecars/<task-id>/prompt.md)"

Avoid codex resume, codex fork, claude --continue, or claude --resume for first-pass sidecar work. Those are continuation mechanisms, not clean sidecar boundaries.

Prompt Requirements

Every sidecar prompt should state:

task goal and non-goals
files or directories it may inspect
whether code edits are allowed
required output path and format
escalation criteria for strong reviewer or main-agent judgment
privacy boundary: do not copy raw private logs or unrelated conversation text into repo artifacts

Keep prompts narrow. If the task needs several independent analyses, create separate sidecar runs rather than one broad prompt.

Model Routing

Good Spark sidecar tasks:

first-pass code review before a strong review
precommit path classification: Fast Path, Skill Path, Code Path, or Risk Path
personalization scans over trajectories, sidecar artifacts, diffs, and memory to propose reusable preferences
git milestone proposal from recent commits and docs
README / AGENTS / CLAUDE consistency scan
test-gap scan after a focused implementation
experiment log triage and config mismatch inventory
evidence inventory across CSVs, reports, tables, and figures
reviewer-comment clustering and action-item extraction
citation placeholder or TODO scans

Use the main agent or a stronger fresh reviewer for:

core algorithm design decisions
final code approval for high-risk changes
security, privacy, or data-governance decisions
large refactors with unclear blast radius
final paper positioning, rebuttal strategy, or submission judgment

Precommit Classifier Contract

Use the precommit-classifier preset when a commit/push closeout would otherwise be slowed down by deciding which gates to run. The sidecar inspects only read-only Git state and affected public repo files, then recommends:

the commit path: Fast Path, Skill Path, Code Path, or Risk Path
the minimal validation commands
whether reinstall is needed
whether reinstall can be limited to specific changed skills

The main agent must still stage files, commit, push, reinstall, and report the outcome. A sidecar must not perform external or irreversible actions.

Personalization Scanner Contract

Use the personalization-scanner preset when a main agent wants low-cost automatic memory writeback candidates from trajectories or repo artifacts. The sidecar may inspect only explicitly listed inputs and returns candidate preferences with scope, confidence, evidence, suggested target, and privacy notes.

The main agent must still decide what to write, keep private facts out of public repo memory, and perform any project-memory or skill-repo edits. The sidecar must not ask the user, quote raw logs, or promote public skill rules by itself. After accepting sidecar output, the main agent writes: accepted findings to the relevant memory board (claim/risk/action/decision), gate decision to decision.md, and sidecar token metadata stays in .agent/sidecars/<task-id>/model.json — not in project memory.

Token Telemetry

Because --ephemeral may avoid normal persisted session logs, record run metadata in model.json. If Codex CLI reports token usage in the terminal, copy the numeric fields into model.json or model.md before running token-usage-auditor.

If exact usage is unavailable, leave usage values as null and record:

Token usage: unavailable from ephemeral run output.

Never store raw prompts from private chat logs just to recover token counts.

a-green-hand-jack/sidecar-task-runner

skills/sidecar-task-runner/SKILL.md

Run artifact-driven sidecar agent tasks through one-shot Codex CLI sessions. Use when a main agent should delegate bounded scans, drafts, audits, pre-reviews, or mechanical repo tasks to a fast isolated sidecar model such as gpt-5.3-codex-spark while keeping final decisions with the main agent.

4 stars

tools

Updated May 16, 2026

$ install --global

skillsauth

npx skillsauth add a-green-hand-jack/ml-research-skills sidecar-task-runner

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 16, 2026, 4:27 AM116.2s5 files scanned

SKILL.md

name:: sidecar-task-runner
description:: Run artifact-driven sidecar agent tasks through one-shot Codex CLI sessions. Use when a main agent should delegate bounded scans, drafts, audits, pre-reviews, or mechanical repo tasks to a fast isolated sidecar model such as gpt-5.3-codex-spark while keeping final decisions with the main agent.
allowed-tools:: Read, Write, Edit, Bash, Glob

Sidecar Task Runner

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
├── templates/
│   ├── precommit-classifier.md
│   └── personalization-scanner.md
└── scripts/
    └── prepare_sidecar_task.py

Core Contract

The main agent owns planning, integration, final judgment, and user-facing actions.
The sidecar gets a small prompt artifact, not the main chat transcript.
Default to gpt-5.3-codex-spark for fast local scans, first drafts, pre-reviews, consistency checks, and other low/medium-risk tasks.
Prefer deterministic scripts and direct shell commands for facts; use the sidecar for synthesis, ranking, summarization, mismatch detection, and candidate text.
Default sandbox is read-only. Use workspace-write only when the sidecar must write its own artifacts or make explicitly bounded edits.
Keep human gates before external or irreversible actions such as git tag, git push, release upload, job submission, or public issue creation.
Do not use a sidecar as the only reviewer for high-risk algorithm design, core correctness, security/privacy, final merge approval, or final submission decisions.

Artifact Layout

Use a repo-local directory:

.agent/sidecars/<task-id>/
├── prompt.md
├── input-manifest.md
├── output.md
├── findings.md
├── decision.md
├── model.md
└── model.json

Meanings:

prompt.md: exact prompt given to the sidecar.
input-manifest.md: files, commands, diffs, or project state the sidecar may inspect.
output.md: raw sidecar final response, usually written with codex exec -o.
findings.md: distilled findings if the sidecar writes structured results.
decision.md: main-agent decision after reading the sidecar output.
model.md: human-readable command, model, sandbox, and token notes.
model.json: structured run metadata for token and lifecycle audits.

Prepare A Task

Create the artifact directory with:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --task-id <task-id> \
  --title "<short task title>" \
  --phase tooling \
  --task-type audit \
  --prompt-file <prompt-file>

If no prompt file exists, pass --prompt "<task instructions>".

For a fast precommit classification sidecar, use the bundled preset:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --title "Precommit classifier" \
  --phase maintenance \
  --task-type audit \
  --preset precommit-classifier \
  --input "git status --short" \
  --input "git diff"

For a personalization scan that extracts reusable preferences from sanitized artifacts without asking the user:

python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
  --repo . \
  --title "Personalization scan" \
  --phase maintenance \
  --task-type audit \
  --preset personalization-scanner \
  --input "memory/current-status.md" \
  --input ".agent/sidecars/*/decision.md"

Run Codex Spark

For a read-only sidecar:

codex exec --ephemeral \
  -m gpt-5.3-codex-spark \
  -C . \
  -s read-only \
  -o .agent/sidecars/<task-id>/output.md \
  "$(cat .agent/sidecars/<task-id>/prompt.md)"

For a sidecar that must write its own findings.md or make tightly scoped artifact edits:

codex exec --ephemeral \
  -m gpt-5.3-codex-spark \
  -C . \
  -s workspace-write \
  -o .agent/sidecars/<task-id>/output.md \
  "$(cat .agent/sidecars/<task-id>/prompt.md)"

Avoid codex resume, codex fork, claude --continue, or claude --resume for first-pass sidecar work. Those are continuation mechanisms, not clean sidecar boundaries.

Prompt Requirements

Every sidecar prompt should state:

task goal and non-goals
files or directories it may inspect
whether code edits are allowed
required output path and format
escalation criteria for strong reviewer or main-agent judgment
privacy boundary: do not copy raw private logs or unrelated conversation text into repo artifacts

Keep prompts narrow. If the task needs several independent analyses, create separate sidecar runs rather than one broad prompt.

Model Routing

Good Spark sidecar tasks:

first-pass code review before a strong review
precommit path classification: Fast Path, Skill Path, Code Path, or Risk Path
personalization scans over trajectories, sidecar artifacts, diffs, and memory to propose reusable preferences
git milestone proposal from recent commits and docs
README / AGENTS / CLAUDE consistency scan
test-gap scan after a focused implementation
experiment log triage and config mismatch inventory
evidence inventory across CSVs, reports, tables, and figures
reviewer-comment clustering and action-item extraction
citation placeholder or TODO scans

Use the main agent or a stronger fresh reviewer for:

core algorithm design decisions
final code approval for high-risk changes
security, privacy, or data-governance decisions
large refactors with unclear blast radius
final paper positioning, rebuttal strategy, or submission judgment

Precommit Classifier Contract

the commit path: Fast Path, Skill Path, Code Path, or Risk Path
the minimal validation commands
whether reinstall is needed
whether reinstall can be limited to specific changed skills

The main agent must still stage files, commit, push, reinstall, and report the outcome. A sidecar must not perform external or irreversible actions.

Personalization Scanner Contract

Token Telemetry

If exact usage is unavailable, leave usage values as null and record:

Token usage: unavailable from ephemeral run output.

Never store raw prompts from private chat logs just to recover token counts.

Related Skills

a-green-hand-jack/ml-research-bootstrap

testing

VerifiedTrustedCommunity

Bootstrap project-local ml-research-skills. Use from global installs when creating a new ML research project, enabling this collection in an existing ML research repo, or deciding whether to install the full bundle locally. Route to project-init for new projects; do not handle paper or experiment work directly.

4SKILL.mdUpdated May 26, 2026

a-green-hand-jack/ml-research-bootstrap

a-green-hand-jack/project-ops-router

development

VerifiedTrustedCommunity

Route project operations tasks — git, memory, bootstrap, remote, workspace, code review, timeline, ops — to the correct skill. Use when the task involves commits, pushes, worktrees, project memory, enabling project-local skills, SSH/server coordination, sidecar runners, or audits. Do not solve the ops task directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/project-ops-router

a-green-hand-jack/paper-writing-router

testing

VerifiedTrustedCommunity

Route ML/AI paper writing tasks to the correct skill — contract planning, prose drafting, section writing, consistency editing, review simulation, rebuttal, submission, or citation work. Use when the task involves writing, revising, reviewing, or submitting a paper instead of guessing between paper-writing-assistant, paper-writing-contract-planner, paper-reviewer-simulator, auto-paper-improvement-loop, or citation skills. Do not draft prose directly.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/paper-writing-router

a-green-hand-jack/ml-research-router

data-ai

VerifiedTrustedCommunity

Project-local router for ML research skill selection. Use inside an initialized ML research project, or while maintaining this skill repo, when the user describes an ML research/paper/experiment/discovery/ops/release workflow and may not know the skill; route to a domain router or high-signal leaf. Do not use for generic non-ML projects.

4SKILL.mdUpdated May 19, 2026

a-green-hand-jack/ml-research-router

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/a-green-hand-jack/ml-research-skills.git

# Copy into Claude Code skills folder (global)
cp -r ml-research-skills/skills/sidecar-task-runner ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

a-green-hand-jack/ml-research-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT