.claude/skills/release/SKILL.md
WF10 Submission/Release Tool. Multi-scene training, result packaging, filename validation, dry-run submission checks. Used after ablation experiments are complete and before competition submission.
npx skillsauth add linzhe001/Harness-Research releaseInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Competition/release requirements are read from PROJECT_STATE.json project_meta or CLAUDE.md ## Challenge Quick Ref.
Typical requirements include: submission file format, filename conventions, evaluation metrics, etc.
For language behavior, see ../../shared/language-policy.md.
</context>
validate — Check submission package completenesstransforms_test.json to determine all test viewpoints that need to be renderedpackage — Generate submission package{EVAL_SCRIPT} from CLAUDE.md ## Entry Scripts:
python {EVAL_SCRIPT} --checkpoint {best_ckpt} --split test --output_dir submission/
submission/README.md (method description)validate to confirm completenesssubmit — Multi-scene training + packaging (full pipeline){MULTI_SCENE_SCRIPT} from CLAUDE.md ## Entry Scripts:
python {MULTI_SCENE_SCRIPT} --scenes {scene_list} --config {best_config}
c. Evaluate and record metricspackagevalidateValidation reports, packaging summaries, and user-facing release notes should follow ../../shared/language-policy.md, while manifest keys, file names, paths, commands, and intent labels remain in English.
Update PROJECT_STATE.json:
current_stage.status → "completed"artifacts.submission_package → package pathhistory append completion record
</instructions>
development
WF7.5 training pipeline validation. Before entering WF8 iteration, first use Codex to review code for baseline equivalence, then run a 100-step smoke test to verify end-to-end pipeline functionality.
business
WF1 Inspiration survey and gap analysis. Takes the user's research idea, performs literature search, gap analysis, competitor analysis, and feasibility scoring, then outputs Feasibility_Report.md. Use when the user has a new CV research idea that needs a feasibility assessment.
development
WF2 Architecture refinement and MVP design. Reads the feasibility report, analyzes the base codebase architecture, designs plug-and-play new modules, defines the MVP, provides A/B/C alternative plans, and outputs Technical_Spec.md. Use when a research idea needs to be translated into a concrete technical architecture design.
testing
--- name: orchestrator description: CV research project orchestrator. Coordinates the 10-stage research workflow (WF1-WF10 + WF7.5 gate), tracks progress, and manages PROJECT_STATE.json. Supported commands: init (initialize), status (view state), next (advance stage), rollback (revert), decision (record decisions). Use when the user wants to manage CV research project progress, initialize a project, view status, or switch workflow stages. argument-hint: "[command: init|status|next|rollback|decis