Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

athola/utility

Name: utility
Author: athola

plugins/leyline/skills/utility/SKILL.md

npx skillsauth add athola/claude-night-market utility

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Utility Skill

Overview

A decision framework for agent orchestration based on Liu et al., "Utility-Guided Agent Orchestration for Efficient LLM Tool Use" (arXiv:2603.19896). Each candidate action is scored by subtracting weighted costs from expected gain, producing a single utility value that guides action selection. The framework prevents over-calling tools and premature stopping by making both errors costly. Utility range is [-2.3, 1.0].

When To Use

Deciding whether to dispatch another agent or tool call
Gating expensive tool calls (search, code execution, delegation)
Selecting the right model tier for a sub-task
Continuation decisions after receiving partial results
Verification gating before writing or committing output

When NOT to Use

Single-step operations with one obvious action
Trivial tasks where cost of scoring exceeds benefit
Already-committed actions that cannot be undone

Action Space

A = {respond, retrieve, tool_call, verify, delegate, stop}

| Action | Description | |-----------|------------------------------------------------------| | respond | Emit a final answer from current context | | retrieve | Fetch additional information (search, read, lookup) | | tool_call | Execute a tool (code runner, API, file write) | | verify | Check a prior result for correctness or completeness | | delegate | Spawn a sub-agent or hand off to a specialist | | stop | Terminate the loop and return current state |

Utility Function

U(a | s_t) = Gain(a | s_t)
           - λ₁ · StepCost(a | s_t)
           - λ₂ · Uncertainty(a | s_t)
           - λ₃ · Redundancy(a | s_t)

| Parameter | Default | Rationale | |-----------|---------|---------------------------------------------------| | λ₁ | 1.0 | Cost baseline; all other weights relative to this | | λ₂ | 0.5 | Weak empirical correlation with outcome (r=0.0131) | | λ₃ | 0.8 | Redundancy pruning yields ~10% token savings |

Utility range: [-2.3, 1.0]. Positive values indicate the action is worth taking. Values below the floor (-0.5 default) indicate the action should be skipped.

Termination Conditions

Stop the loop when any of the following is true:

(a) Selected action is stop
(b) Step budget exhausted (default: 10 steps)
(c) All non-stop actions score below the floor (default: -0.5)

High-gain override: If Gain >= 0.7 for any action, condition (c) may be overridden. Document the override and the gain value in your reasoning trace.

Quick Start

Minimal 4-step advisory pattern:

Construct state: gather task context per modules/state-builder.md
Score candidates: evaluate each action in A per modules/action-selector.md
Prefer highest utility: select the action with the maximum U(a | s_t), subject to termination conditions
Log score and decision: record the winning action, its utility value, and step count before executing

Detailed Resources

State Builder: modules/state-builder.md, how to populate s_t from task context
Gain: modules/gain.md, estimating expected information or progress gain
Step Cost: modules/step-cost.md, token, latency, and monetary cost tables
Uncertainty: modules/uncertainty.md, confidence estimation and calibration
Redundancy: modules/redundancy.md, detecting duplicate or low-delta actions
Action Selector: modules/action-selector.md, scoring loop and tie-breaking rules
Integration: modules/integration.md, wiring utility scoring into existing orchestration loops

Exit Criteria

[ ] State constructed with task goal and prior steps
[ ] All six actions scored before selecting one
[ ] Termination condition checked after each step
[ ] Score and decision logged for each step taken
[ ] High-gain overrides documented with gain value

athola/utility

plugins/leyline/skills/utility/SKILL.md

Scores agent actions by expected gain, cost, uncertainty, and redundancy. Use when deciding whether to dispatch an agent or invoke a tool.

304 stars

tools

Updated Jun 8, 2026

$ install --global

skillsauth

npx skillsauth add athola/claude-night-market utility

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 8, 2026, 3:31 AM143.3s8 files scanned

SKILL.md

name:: utility
description:: Scores agent actions by expected gain, cost, uncertainty, and redundancy. Use when deciding whether to dispatch an agent or invoke a tool.
alwaysApply:: false
category:: infrastructure
dependencies:: []
complexity:: intermediate
model_hint:: standard
estimated_tokens:: 600
progressive_loading:: true

Utility Skill

Overview

When To Use

Deciding whether to dispatch another agent or tool call
Gating expensive tool calls (search, code execution, delegation)
Selecting the right model tier for a sub-task
Continuation decisions after receiving partial results
Verification gating before writing or committing output

When NOT to Use

Single-step operations with one obvious action
Trivial tasks where cost of scoring exceeds benefit
Already-committed actions that cannot be undone

Action Space

A = {respond, retrieve, tool_call, verify, delegate, stop}

Utility Function

U(a | s_t) = Gain(a | s_t)
           - λ₁ · StepCost(a | s_t)
           - λ₂ · Uncertainty(a | s_t)
           - λ₃ · Redundancy(a | s_t)

Utility range: [-2.3, 1.0]. Positive values indicate the action is worth taking. Values below the floor (-0.5 default) indicate the action should be skipped.

Termination Conditions

Stop the loop when any of the following is true:

(a) Selected action is stop
(b) Step budget exhausted (default: 10 steps)
(c) All non-stop actions score below the floor (default: -0.5)

High-gain override: If Gain >= 0.7 for any action, condition (c) may be overridden. Document the override and the gain value in your reasoning trace.

Quick Start

Minimal 4-step advisory pattern:

Construct state: gather task context per modules/state-builder.md
Score candidates: evaluate each action in A per modules/action-selector.md
Prefer highest utility: select the action with the maximum U(a | s_t), subject to termination conditions
Log score and decision: record the winning action, its utility value, and step count before executing

Detailed Resources

State Builder: modules/state-builder.md, how to populate s_t from task context
Gain: modules/gain.md, estimating expected information or progress gain
Step Cost: modules/step-cost.md, token, latency, and monetary cost tables
Uncertainty: modules/uncertainty.md, confidence estimation and calibration
Redundancy: modules/redundancy.md, detecting duplicate or low-delta actions
Action Selector: modules/action-selector.md, scoring loop and tie-breaking rules
Integration: modules/integration.md, wiring utility scoring into existing orchestration loops

Exit Criteria

[ ] State constructed with task goal and prior steps
[ ] All six actions scored before selecting one
[ ] Termination condition checked after each step
[ ] Score and decision logged for each step taken
[ ] High-gain overrides documented with gain value

Related Skills

athola/architecture-paradigm-domain-driven

data-ai

VerifiedTrustedCommunity

Models a business in its own language. Use when the domain has real business rules to capture.

323SKILL.mdUpdated Jul 15, 2026

athola/architecture-paradigm-domain-driven

athola/ideate

research

VerifiedTrustedCommunity

Generate diverse solution candidates with category-spanning ideation methods and rotation. Use when stuck on a design or fighting repetitive LLM output.

323SKILL.mdUpdated Jun 8, 2026

athola/validate-pr

development

VerifiedTrustedCommunity

Generates and self-executes a diff-derived test plan for a PR. Use when validating PR changes before merge. Do not use for code review; use sanctum:pr-review.

323SKILL.mdUpdated Jun 8, 2026

athola/graduated-implementation

development

VerifiedTrustedCommunity

Ramps implementation ambition a notch only after the prior increment is understood. Use when building a feature you must understand, not just ship.

323SKILL.mdUpdated Jun 8, 2026

athola/graduated-implementation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/athola/claude-night-market.git

# Copy into Claude Code skills folder (global)
cp -r claude-night-market/plugins/leyline/skills/utility ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

athola/claude-night-market

304 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT