Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jamesgray-ai/improve

Name: improve
Author: jamesgray-ai

.claude/skills/improve/SKILL.md

npx skillsauth add jamesgray-ai/handsonai improve

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Improve Workflow

Evaluate and evolve running AI workflows. Review how a deployed workflow is performing against its original baseline, identify degradation or growth signals, and recommend whether to tune, redesign, or evolve the orchestration mechanism.

Workflow

1. Load workflow context

Read the Building Block Spec (including Evaluation Criteria), Run Guide, and original Test Results (baseline scores). Understand what was built, how it was designed to work, and what quality bar was established.

2. Current state assessment

Interview the user with concrete questions:

How often are you running this workflow?
How much manual editing does the output typically need?
Have your requirements or business context changed?
Are there new steps or decisions that have emerged since deployment?
What's working well that you want to preserve?

3. Quality evaluation

Identify signals of degradation or opportunity:

| Signal | What It Means | |--------|---------------| | Increasing manual edits | Context may need updating (stale examples, changed standards) | | New decision types appearing | May need additional skills or agent capabilities | | Steps being skipped | Workflow coverage gap — missing steps need to be added | | Output quality inconsistent | Prompt or context needs tuning | | User adding steps manually | Workflow scope has grown beyond original design |

4. Graduation assessment

Should the orchestration mechanism evolve?

Prompt → Skill-Powered Prompt — if repeatable sub-routines have emerged that deserve codification
Skill-Powered Prompt → Agent — if AI needs to make sequencing decisions rather than follow a fixed order
Single Agent → Multi-Agent — if complexity has grown to require specialized sub-agents

Only recommend graduation when there's a concrete capability gap, not just because "it could be more sophisticated."

5. Regression evaluation

Re-run the eval suite from Step 5 (Test):

Run the same test scenarios from the original baseline
Score on the same dimensions
Compare to baseline scores
Identify areas of degradation or improvement
Determine if the eval criteria themselves need updating (requirements may have shifted)

6. Operationalization review (organizational workflows)

For workflows used by teams (not just individuals), assess:

Adoption — Is the team actually using it? What's the usage frequency?
Training — Do new team members know how to use it?
Governance — Are outputs being reviewed appropriately? Are there quality controls?

Skip this step for individual/personal workflows.

7. Recommendation

Produce one of the following:

No changes needed — workflow is performing at or above baseline, requirements haven't shifted
Tune — specific building blocks to adjust (identify which ones and what to change) → loop back to Build (Step 4) and Test (Step 5)
Redesign — requirements have changed enough that the workflow structure needs rethinking → loop back to Design (Step 3)
Evolve — graduate to a more capable orchestration mechanism → loop back to Design (Step 3) with an explicit graduation recommendation

Output

Write results to outputs/[workflow-name]-improvement-plan.md.

Include:

Current performance summary — how the workflow is being used and performing
Regression scores — comparison table of baseline vs. current scores
Issues identified — specific problems with diagnosed root causes
Recommendation — No changes / Tune / Redesign / Evolve, with rationale
Action items — concrete next steps if changes are recommended

Guidelines

Don't prompt for information the user can't answer. If they don't track usage metrics, work with qualitative signals instead.
Focus on concrete signals, not abstract evaluation. "Your context file references Q3 goals but it's Q1" beats "your context may be stale."
This step is typically invoked weeks or months after initial deployment, in a separate conversation from the original build.
Not every workflow needs improvement. If it's working, say so and move on.

jamesgray-ai/improve

.claude/skills/improve/SKILL.md

Evaluate a running AI workflow for quality, relevance, and evolution opportunities. Use when the user wants to review how a deployed workflow is performing, check if it needs tuning, or assess whether it should graduate to a more capable orchestration mechanism. This is Step 7 (Improve) of the Business-First AI Framework.

3 stars

development

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add jamesgray-ai/handsonai improve

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 2:05 PM28.6s1 file scanned

SKILL.md

name:: improve
description:: >
user-invocable:: true

Improve Workflow

Workflow

1. Load workflow context

2. Current state assessment

Interview the user with concrete questions:

How often are you running this workflow?
How much manual editing does the output typically need?
Have your requirements or business context changed?
Are there new steps or decisions that have emerged since deployment?
What's working well that you want to preserve?

3. Quality evaluation

Identify signals of degradation or opportunity:

4. Graduation assessment

Should the orchestration mechanism evolve?

Prompt → Skill-Powered Prompt — if repeatable sub-routines have emerged that deserve codification
Skill-Powered Prompt → Agent — if AI needs to make sequencing decisions rather than follow a fixed order
Single Agent → Multi-Agent — if complexity has grown to require specialized sub-agents

Only recommend graduation when there's a concrete capability gap, not just because "it could be more sophisticated."

5. Regression evaluation

Re-run the eval suite from Step 5 (Test):

Run the same test scenarios from the original baseline
Score on the same dimensions
Compare to baseline scores
Identify areas of degradation or improvement
Determine if the eval criteria themselves need updating (requirements may have shifted)

6. Operationalization review (organizational workflows)

For workflows used by teams (not just individuals), assess:

Adoption — Is the team actually using it? What's the usage frequency?
Training — Do new team members know how to use it?
Governance — Are outputs being reviewed appropriately? Are there quality controls?

Skip this step for individual/personal workflows.

7. Recommendation

Produce one of the following:

No changes needed — workflow is performing at or above baseline, requirements haven't shifted
Tune — specific building blocks to adjust (identify which ones and what to change) → loop back to Build (Step 4) and Test (Step 5)
Redesign — requirements have changed enough that the workflow structure needs rethinking → loop back to Design (Step 3)
Evolve — graduate to a more capable orchestration mechanism → loop back to Design (Step 3) with an explicit graduation recommendation

Output

Write results to outputs/[workflow-name]-improvement-plan.md.

Include:

Current performance summary — how the workflow is being used and performing
Regression scores — comparison table of baseline vs. current scores
Issues identified — specific problems with diagnosed root causes
Recommendation — No changes / Tune / Redesign / Evolve, with rationale
Action items — concrete next steps if changes are recommended

Guidelines

Don't prompt for information the user can't answer. If they don't track usage metrics, work with qualitative signals instead.
Focus on concrete signals, not abstract evaluation. "Your context file references Q3 goals but it's Q1" beats "your context may be stale."
This step is typically invoked weeks or months after initial deployment, in a separate conversation from the original build.
Not every workflow needs improvement. If it's working, say so and move on.

Related Skills

jamesgray-ai/writing-workflow-sops

documentation

VerifiedTrustedCommunity

Write Standard Operating Procedure documentation for workflows and save as markdown files. Selects full or lightweight SOP template based on autonomy level (deterministic vs. guided/autonomous), then adapts for workflow type (Manual, Augmented, Automated). Use when the user asks to write an SOP, document a workflow, create procedure documentation, or capture how a workflow is executed. Triggers on "write an SOP", "document this workflow", "create operating instructions", "how is this workflow executed".

3SKILL.mdUpdated Apr 15, 2026

jamesgray-ai/writing-workflow-sops

jamesgray-ai/writing-process-guides

documentation

VerifiedTrustedCommunity

Write Business Process Guide documentation that explains when, why, and how to execute a complete business process with its component workflows, and save as markdown files. Use when documenting a business process end-to-end, creating playbooks, or explaining how multiple workflows fit together. Triggers on "write process guide", "document this process", "create a playbook for", "how do these workflows connect".

3SKILL.mdUpdated Apr 15, 2026

jamesgray-ai/writing-process-guides

jamesgray-ai/syncing-skills-to-github

development

VerifiedTrustedCommunity

This skill should be used when the user wants to sync skills to GitHub, push skill changes to a remote repository, or back up local skills. Syncs Claude Agent Skills from ~/.claude/skills/ (local) to GitHub repository using git commands. Commits changes, pushes to remote, and updates Notion AI Building Blocks with GitHub URLs.

3SKILL.mdUpdated Apr 15, 2026

jamesgray-ai/syncing-skills-to-github

jamesgray-ai/registering-building-blocks

development

VerifiedTrustedCommunity

This skill should be used when the user wants to register or update AI building blocks (Skills, Agents, Prompts, Context MDs) in the Notion AI Building Blocks database. Triggers after skill creation, agent creation, prompt authoring, context MD updates, or when the user asks to register, add, or track a building block in Notion.

3SKILL.mdUpdated Apr 15, 2026

jamesgray-ai/registering-building-blocks

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jamesgray-ai/handsonai.git

# Copy into Claude Code skills folder (global)
cp -r handsonai/.claude/skills/improve ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jamesgray-ai/handsonai

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT