Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

linzhe001/baseline-repro

Name: baseline-repro
Author: linzhe001

.agents/skills/baseline-repro/SKILL.md

npx skillsauth add linzhe001/Harness-Research baseline-repro

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Baseline Repro

References

Read these first:

../../../.agents/references/workflow-guide.md
../../../.agents/references/pre-training-rule.md
../../../.agents/references/language-policy.md
./references/baseline-report.md
../../../PROJECT_STATE.json

When To Use

Use this skill for WF5 when the user wants baselines reproduced fairly before new method implementation.

Required Work

Read the baseline list from docs/Technical_Spec.md.
Create or refresh the first runnable project environment for the baselines and sync the ## Environment section in CLAUDE.md.
Use docs/Dataset_Stats.md and project context to align data and evaluation conditions.
Resolve the canonical evaluation protocol from the reproduced baselines and persist the tracked metric names for WF8.
Reproduce each requested baseline with minimal environment-specific changes.
Compare reproduced metrics against paper-reported metrics.
Write docs/Baseline_Report.md using the canonical template.
Update:
- PROJECT_STATE.json baseline metrics
- PROJECT_STATE.json evaluation protocol or tracked metrics for later WF8 comparison
- project_map.json baseline status and entry point
- CLAUDE.md environment facts and baseline reference

Output Rules

Keep adaptation notes, training config notes, and reproduced-versus-paper comparison.
Treat environment creation here as part of the canonical WF5 gate, not as a separate pre-workflow step.
Use the canonical pre-training commit rule for baseline code changes.
Treat template wording as structure-only; localize headings and narrative text according to ../../../.agents/references/language-policy.md unless a field is explicitly English-only.

Codex Adaptation

Treat natural-language requests as the canonical $baseline-repro flow.
Preserve the original expectations around faithful reproduction and minimal baseline edits.
Use the Codex toolchain, but keep the canonical output files and state updates.

Execution Rule

Follow the local prompt, baseline report template, and language policy instead of simplifying the reproduction stage.

linzhe001/baseline-repro

.agents/skills/baseline-repro/SKILL.md

Codex wrapper for WF5 baseline reproduction. Use when the user wants baseline adaptation, reproduction tracking, and `docs/Baseline_Report.md` following the original workflow contract.

1 stars

development

Updated Apr 17, 2026

$ install --global

skillsauth

npx skillsauth add linzhe001/Harness-Research baseline-repro

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 17, 2026, 1:58 AM25.5s3 files scanned

SKILL.md

name:: baseline-repro
description:: Codex wrapper for WF5 baseline reproduction. Use when the user wants baseline adaptation, reproduction tracking, and `docs/Baseline_Report.md` following the original workflow contract.

Baseline Repro

References

Read these first:

../../../.agents/references/workflow-guide.md
../../../.agents/references/pre-training-rule.md
../../../.agents/references/language-policy.md
./references/baseline-report.md
../../../PROJECT_STATE.json

When To Use

Use this skill for WF5 when the user wants baselines reproduced fairly before new method implementation.

Required Work

Read the baseline list from docs/Technical_Spec.md.
Create or refresh the first runnable project environment for the baselines and sync the ## Environment section in CLAUDE.md.
Use docs/Dataset_Stats.md and project context to align data and evaluation conditions.
Resolve the canonical evaluation protocol from the reproduced baselines and persist the tracked metric names for WF8.
Reproduce each requested baseline with minimal environment-specific changes.
Compare reproduced metrics against paper-reported metrics.
Write docs/Baseline_Report.md using the canonical template.
Update:
- PROJECT_STATE.json baseline metrics
- PROJECT_STATE.json evaluation protocol or tracked metrics for later WF8 comparison
- project_map.json baseline status and entry point
- CLAUDE.md environment facts and baseline reference

Output Rules

Keep adaptation notes, training config notes, and reproduced-versus-paper comparison.
Treat environment creation here as part of the canonical WF5 gate, not as a separate pre-workflow step.
Use the canonical pre-training commit rule for baseline code changes.
Treat template wording as structure-only; localize headings and narrative text according to ../../../.agents/references/language-policy.md unless a field is explicitly English-only.

Codex Adaptation

Treat natural-language requests as the canonical $baseline-repro flow.
Preserve the original expectations around faithful reproduction and minimal baseline edits.
Use the Codex toolchain, but keep the canonical output files and state updates.

Execution Rule

Follow the local prompt, baseline report template, and language policy instead of simplifying the reproduction stage.

Related Skills

linzhe001/validate-run

development

VerifiedTrustedCommunity

WF7.5 training pipeline validation. Before entering WF8 iteration, first use Codex to review code for baseline equivalence, then run a 100-step smoke test to verify end-to-end pipeline functionality.

1SKILL.mdUpdated Apr 17, 2026

linzhe001/validate-run

linzhe001/survey-idea

business

VerifiedTrustedCommunity

WF1 Inspiration survey and gap analysis. Takes the user's research idea, performs literature search, gap analysis, competitor analysis, and feasibility scoring, then outputs Feasibility_Report.md. Use when the user has a new CV research idea that needs a feasibility assessment.

1SKILL.mdUpdated Apr 17, 2026

linzhe001/survey-idea

linzhe001/release

tools

VerifiedTrustedCommunity

WF10 Submission/Release Tool. Multi-scene training, result packaging, filename validation, dry-run submission checks. Used after ablation experiments are complete and before competition submission.

1SKILL.mdUpdated Apr 17, 2026

linzhe001/refine-arch

development

VerifiedTrustedCommunity

WF2 Architecture refinement and MVP design. Reads the feasibility report, analyzes the base codebase architecture, designs plug-and-play new modules, defines the MVP, provides A/B/C alternative plans, and outputs Technical_Spec.md. Use when a research idea needs to be translated into a concrete technical architecture design.

1SKILL.mdUpdated Apr 17, 2026

linzhe001/refine-arch

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/linzhe001/Harness-Research.git

# Copy into Claude Code skills folder (global)
cp -r Harness-Research/.agents/skills/baseline-repro ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

linzhe001/Harness-Research

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT