Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

0x-shashi/skills/scoring

Name: skills/scoring
Author: 0x-shashi

skills/scoring/SKILL.md

npx skillsauth add 0x-shashi/web3-audit-skills skills/scoring

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Audit Scoring

Purpose

This directory provides the quantitative scoring framework for measuring audit quality. Use these metrics to objectively evaluate performance, track improvement over time, and identify areas needing attention.

Available Files

| File | Description | |------|-------------| | AUDIT_SCORING.md | Complete scoring system — detection, precision, severity accuracy, coverage, efficiency metrics, composite score formula, reward schema, tracking templates, and industry benchmarks |

Core Metrics at a Glance

| Metric | Weight | What It Measures | |--------|--------|-----------------| | Detection Score | 35% | Vulnerabilities correctly identified vs. total real vulnerabilities | | Precision Score | 25% | Valid findings vs. total findings submitted (false positive rate) | | Severity Accuracy | 15% | Correct severity classification vs. total findings | | Coverage Score | 15% | Functions/entry points audited vs. total codebase | | Efficiency Score | 10% | Weighted findings produced per hour spent |

Composite Score = (0.35 × Detection) + (0.25 × Precision) + (0.15 × Severity) + (0.15 × Coverage) + (0.10 × Efficiency)

Severity Weights for Efficiency Scoring

These weights connect the scoring system to the severity classification:

| Severity | Points | Reference | |----------|--------|-----------| | Critical | 10 | Escalation required (not in standard severity files) | | High | 5 | high-severity.md | | Medium | 2 | medium-severity.md | | Low | 1 | low-severity.md | | Informational | 0.5 | Best-practice suggestions | | Gas | 0 | gas-optimizations.md |

How to Use

After an audit → Fill out the Score Card Template in AUDIT_SCORING.md
Classify findings → Use severity/ files + severity-scoring decision tree
Track monthly → Use the Monthly Score Tracking template
Identify gaps → Category-specific scores highlight weak areas
Improve → Low category scores → update checklists and patterns

Related Skills

Severity Classification — HIGH / MEDIUM / LOW / GAS finding databases
Severity Scoring Decision Tree — How to assign severity levels
Feedback Loop — Scores feed back into skill improvement
Audit Report Templates — Report structure with severity sections
Prompt Evolution — Higher-scoring prompts get promoted

Prerequisites

Scoring requires completed audit findings with severity classifications. The Severity Classification skill MUST be applied before scoring.

Validation

To verify scoring accuracy, compare computed composite scores against known benchmarks:

# Example composite score calculation
detection = 0.85   # 85% of real vulns found
precision = 0.80   # 80% valid findings
severity_acc = 0.90 # 90% correct severity
coverage = 0.75    # 75% codebase covered
efficiency = 0.70  # Weighted findings per hour

composite = (0.35 * detection + 0.25 * precision + 0.15 * severity_acc + 0.15 * coverage + 0.10 * efficiency)
print(f"Composite Score: {composite:.2f}")  # Expected: 0.81

# Score thresholds for audit quality tiers
tiers:
  elite: 0.90+       # Top-tier competitive auditor
  proficient: 0.75+  # Solid professional auditor
  developing: 0.60+  # Learning auditor
  needs_work: <0.60  # Consider additional training

# Validate scoring data integrity
python scripts/quality-check.py skills/scoring/SKILL.md

Behavior Guidelines

Detection and Precision scores are required for every engagement
Coverage tracking is optional for quick scans but MUST be included in full audits
Efficiency scoring should be used for self-improvement, never to rush audits

References

Scoring References - Industry benchmarks and calibration data

0x-shashi/skills/scoring

skills/scoring/SKILL.md

Quantitative scoring framework for measuring audit quality with objective metrics to evaluate performance, track improvement over time, and identify areas needing attention. Use when benchmarking audit thoroughness, comparing engagement quality, or building quality gates into CI pipelines.

45 stars

development

Updated Mar 23, 2026

$ install --global

skillsauth

npx skillsauth add 0x-shashi/web3-audit-skills skills/scoring

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

70%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Mar 23, 2026, 10:59 PM183.2s3 files scanned

SKILL.md

id:: scoring
title:: Audit Scoring Skill
category:: methodology
difficulty:: intermediate
last_updated:: 2026-02-26
description:: >-

Audit Scoring

Purpose

Available Files

Core Metrics at a Glance

Composite Score = (0.35 × Detection) + (0.25 × Precision) + (0.15 × Severity) + (0.15 × Coverage) + (0.10 × Efficiency)

Severity Weights for Efficiency Scoring

These weights connect the scoring system to the severity classification:

How to Use

After an audit → Fill out the Score Card Template in AUDIT_SCORING.md
Classify findings → Use severity/ files + severity-scoring decision tree
Track monthly → Use the Monthly Score Tracking template
Identify gaps → Category-specific scores highlight weak areas
Improve → Low category scores → update checklists and patterns

Related Skills

Severity Classification — HIGH / MEDIUM / LOW / GAS finding databases
Severity Scoring Decision Tree — How to assign severity levels
Feedback Loop — Scores feed back into skill improvement
Audit Report Templates — Report structure with severity sections
Prompt Evolution — Higher-scoring prompts get promoted

Prerequisites

Scoring requires completed audit findings with severity classifications. The Severity Classification skill MUST be applied before scoring.

Validation

To verify scoring accuracy, compare computed composite scores against known benchmarks:

# Example composite score calculation
detection = 0.85   # 85% of real vulns found
precision = 0.80   # 80% valid findings
severity_acc = 0.90 # 90% correct severity
coverage = 0.75    # 75% codebase covered
efficiency = 0.70  # Weighted findings per hour

composite = (0.35 * detection + 0.25 * precision + 0.15 * severity_acc + 0.15 * coverage + 0.10 * efficiency)
print(f"Composite Score: {composite:.2f}")  # Expected: 0.81

# Score thresholds for audit quality tiers
tiers:
  elite: 0.90+       # Top-tier competitive auditor
  proficient: 0.75+  # Solid professional auditor
  developing: 0.60+  # Learning auditor
  needs_work: <0.60  # Consider additional training

# Validate scoring data integrity
python scripts/quality-check.py skills/scoring/SKILL.md

Behavior Guidelines

Detection and Precision scores are required for every engagement
Coverage tracking is optional for quick scans but MUST be included in full audits
Efficiency scoring should be used for self-improvement, never to rush audits

References

Scoring References - Industry benchmarks and calibration data

Related Skills

0x-shashi/skills/variant-analysis

development

VerifiedTrustedCommunity

Systematically hunt for every variant of a discovered vulnerability across the entire codebase. Use when a bug is found and all instances of the same root cause pattern must be identified, or when performing variant analysis during competitive audits on Code4rena or Sherlock.

45SKILL.mdUpdated Mar 23, 2026

0x-shashi/skills/variant-analysis

0x-shashi/skills/ton-scanner

testing

VerifiedTrustedCommunity

Use when the user wants to audit TON smart contracts for security vulnerabilities, scan FunC or Tact contracts for message chain replay, bounce handling, or gas issues, review TON DeFi protocols for actor-model concurrency flaws, or analyze asynchronous message passing security.

45SKILL.mdUpdated Mar 23, 2026

0x-shashi/skills/ton-scanner

0x-shashi/skills/token-analyzer

tools

VerifiedTrustedCommunity

Analyze ERC20/ERC721/ERC1155 token implementations for non-standard behavior, fee-on-transfer mechanics, rebasing logic, blacklists, pausability, and integration risks. Use when reviewing protocols that interact with external tokens or implementing token-related features.

45SKILL.mdUpdated Mar 23, 2026

0x-shashi/skills/token-analyzer

0x-shashi/skills/sui-scanner

testing

VerifiedTrustedCommunity

Use when the user wants to audit Sui Move smart contracts, scan Sui-specific patterns including object ownership, shared objects, or dynamic fields, review Sui DeFi protocols for object model security issues, or analyze Sui-specific transaction and consensus patterns.

45SKILL.mdUpdated Mar 23, 2026

0x-shashi/skills/sui-scanner

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/0x-shashi/web3-audit-skills.git

# Copy into Claude Code skills folder (global)
cp -r web3-audit-skills/skills/scoring ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

0x-shashi/web3-audit-skills

45 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT