Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

0xhoneyjar/red-team

Name: red-team
Author: 0xhoneyjar

.claude/skills/red-teaming/SKILL.md

npx skillsauth add 0xhoneyjar/loa-freeside red-team

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Red Team — Generative Adversarial Security Design

Purpose

Use the Flatline Protocol's red team mode to generate creative attack scenarios against design documents. Produces structured attack scenarios with consensus classification and architectural counter-designs.

Invocation

/red-team grimoires/loa/sdd.md
/red-team grimoires/loa/sdd.md --focus "agent-identity,token-gated-access"
/red-team grimoires/loa/sdd.md --mode quick
/red-team grimoires/loa/sdd.md --depth 2 --mode deep
/red-team --spec "Users authenticate via wallet signature and receive a JWT"

Arguments

| Argument | Flag | Default | Description | |----------|------|---------|-------------| | document | positional | required | Path to document to red-team | | spec | --spec | — | Inline spec text (creates temp document) | | focus | --focus | all | Comma-separated attack surface categories | | section | --section | all | Specific document section to target | | depth | --depth | 1 | Attack-counter_design iterations | | mode | --mode | standard | Execution mode: quick, standard, deep |

Workflow

Validate Config: Check red_team.enabled: true in .loa.config.yaml
Input Handling: Load document or create temp file from --spec
Surface Loading: Load attack surfaces from registry, filter by --focus
Invoke Orchestrator: Call flatline-orchestrator.sh --mode red-team
Present Results: Show attack summary with consensus categories
Human Gate: If any severity >800, require human acknowledgment

Execution Modes

| Mode | Models | Cross-Validation | Counter-Design | Budget | |------|--------|-------------------|----------------|--------| | Quick | 2 (primary only) | Skip | Inline only | 50K tokens | | Standard | 4 (primary + secondary) | Full | Full synthesis | 200K tokens | | Deep | 4 + iteration | Full | Full + multi-depth | 500K tokens |

Quick Mode Restrictions

Outputs labeled UNVALIDATED
Cannot produce CONFIRMED_ATTACK — all findings are THEORETICAL or CREATIVE_ONLY
No cross-validation performed
For exploratory use only, not for gating decisions

Consensus Categories

| Category | Criteria | Meaning | |----------|----------|---------| | CONFIRMED_ATTACK | Both models score >700 | Attack is realistic and should be addressed | | THEORETICAL | One model >700, other ≤700 | Plausible but models disagree | | CREATIVE_ONLY | Neither model scores >700 | Novel but neither model finds it convincing | | DEFENDED | Both models >700 AND counter-design exists | Attack is real but already has effective defense |

Score Examples:

GPT=850, Opus=900 → CONFIRMED_ATTACK (both >700)
GPT=800, Opus=400 → THEORETICAL (one >700, other ≤700)
GPT=650, Opus=750 → THEORETICAL (Opus >700, GPT ≤700)
GPT=500, Opus=600 → CREATIVE_ONLY (neither >700)
GPT=300, Opus=200 → CREATIVE_ONLY (neither >700)

Human Validation Gate

When any attack scores severity >800:

Interactive mode: Present attack details and require acknowledgment:

HUMAN REVIEW REQUIRED

ATK-003: Confused Deputy in Ensemble Routing
Severity: 920/1000
Consensus: CONFIRMED_ATTACK

[A]cknowledge / [D]ismiss / [E]scalate

Autonomous mode: Write to pending-review.json for later human review.

Output Files

| File | Permissions | Content | |------|-------------|---------| | .run/red-team/rt-{id}-result.json | 0644 | Full JSON result | | .run/red-team/rt-{id}-report.md | 0600 | Full report (restricted) | | .run/red-team/rt-{id}-summary.md | 0644 | Safe summary for PR/CI | | .run/red-team/.ci-safe | 0644 | Manifest of CI-safe files |

Error Handling

| Error | Cause | Resolution | |-------|-------|------------| | "red_team.enabled is not true" | Config toggle off | Set red_team.enabled: true | | "Input blocked by sanitizer" | Credentials in document | Remove credentials from input | | "Budget exceeded" | Token limit hit | Use lower execution mode | | "Orchestrator failed" | Model invocation error | Check API keys, retry |

Configuration

red_team:
  enabled: true
  mode: standard
  thresholds:
    confirmed_attack: 700
    theoretical: 400
    human_review_gate: 800
  budgets:
    quick_max_tokens: 50000
    standard_max_tokens: 200000
    deep_max_tokens: 500000

Simstim Integration

When red_team.simstim.auto_trigger: true, the red team automatically runs as Phase 4.5 (RED TEAM SDD) during the simstim workflow, after FLATLINE SDD review and before PLANNING.

/flatline-review — Standard Flatline Protocol quality review
/audit — Codebase security audit (implementation-level)
.claude/data/attack-surfaces.yaml — Attack surface registry
.claude/data/red-team-golden-set.json — Calibration corpus

0xhoneyjar/red-team

.claude/skills/red-teaming/SKILL.md

Red Team — Generative Adversarial Security Design

7 stars

testing

Updated Mar 27, 2026

$ install --global

skillsauth

npx skillsauth add 0xhoneyjar/loa-freeside red-team

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Mar 28, 2026, 4:25 AM36.6s2 files scanned

SKILL.md

name:: red-team
description:: Red Team — Generative Adversarial Security Design
schema_version:: 1
read_files:: true
search_code:: true
write_files:: true
execute_commands:: false
web_access:: true
user_interaction:: false
agent_spawn:: false
task_management:: false
cost-profile:: heavy

Red Team — Generative Adversarial Security Design

Purpose

Invocation

/red-team grimoires/loa/sdd.md
/red-team grimoires/loa/sdd.md --focus "agent-identity,token-gated-access"
/red-team grimoires/loa/sdd.md --mode quick
/red-team grimoires/loa/sdd.md --depth 2 --mode deep
/red-team --spec "Users authenticate via wallet signature and receive a JWT"

Arguments

Workflow

Validate Config: Check red_team.enabled: true in .loa.config.yaml
Input Handling: Load document or create temp file from --spec
Surface Loading: Load attack surfaces from registry, filter by --focus
Invoke Orchestrator: Call flatline-orchestrator.sh --mode red-team
Present Results: Show attack summary with consensus categories
Human Gate: If any severity >800, require human acknowledgment

Execution Modes

Quick Mode Restrictions

Outputs labeled UNVALIDATED
Cannot produce CONFIRMED_ATTACK — all findings are THEORETICAL or CREATIVE_ONLY
No cross-validation performed
For exploratory use only, not for gating decisions

Consensus Categories

Score Examples:

GPT=850, Opus=900 → CONFIRMED_ATTACK (both >700)
GPT=800, Opus=400 → THEORETICAL (one >700, other ≤700)
GPT=650, Opus=750 → THEORETICAL (Opus >700, GPT ≤700)
GPT=500, Opus=600 → CREATIVE_ONLY (neither >700)
GPT=300, Opus=200 → CREATIVE_ONLY (neither >700)

Human Validation Gate

When any attack scores severity >800:

Interactive mode: Present attack details and require acknowledgment:

HUMAN REVIEW REQUIRED

ATK-003: Confused Deputy in Ensemble Routing
Severity: 920/1000
Consensus: CONFIRMED_ATTACK

[A]cknowledge / [D]ismiss / [E]scalate

Autonomous mode: Write to pending-review.json for later human review.

Output Files

Error Handling

Configuration

red_team:
  enabled: true
  mode: standard
  thresholds:
    confirmed_attack: 700
    theoretical: 400
    human_review_gate: 800
  budgets:
    quick_max_tokens: 50000
    standard_max_tokens: 200000
    deep_max_tokens: 500000

Simstim Integration

When red_team.simstim.auto_trigger: true, the red team automatically runs as Phase 4.5 (RED TEAM SDD) during the simstim workflow, after FLATLINE SDD review and before PLANNING.

/flatline-review — Standard Flatline Protocol quality review
/audit — Codebase security audit (implementation-level)
.claude/data/attack-surfaces.yaml — Attack surface registry
.claude/data/red-team-golden-set.json — Calibration corpus

Related Skills

0xhoneyjar/evals/fixtures/loa-skill-dir/.claude/skills/test-skill

development

VerifiedTrustedCommunity

# Test Skill A minimal skill for framework testing. ## Constraints - C-PROC-001: Never write code outside implement - C-PROC-005: Always complete full review cycle

7SKILL.mdUpdated Mar 24, 2026

0xhoneyjar/evals/fixtures/loa-skill-dir/.claude/skills/test-skill

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/valid-skill

testing

VerifiedTrustedCommunity

# valid-skill Test skill with valid license for unit testing. ## Purpose Used in test_constructs_loader.bats to verify correct handling of valid licenses.

7SKILL.mdUpdated Mar 24, 2026

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/valid-skill

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/grace-skill

testing

VerifiedTrustedCommunity

# grace-skill Test skill in license grace period for unit testing. ## Purpose Used in test_constructs_loader.bats to verify correct handling of licenses in grace period.

7SKILL.mdUpdated Mar 24, 2026

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/grace-skill

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/expired-skill

testing

VerifiedTrustedCommunity

# expired-skill Test skill with expired license for unit testing. ## Purpose Used in test_constructs_loader.bats to verify correct handling of expired licenses.

7SKILL.mdUpdated Mar 24, 2026

0xhoneyjar/.claude/tests/fixtures/registry/skills/test-vendor/expired-skill

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/0xhoneyjar/loa-freeside.git

# Copy into Claude Code skills folder (global)
cp -r loa-freeside/.claude/skills/red-teaming ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

0xhoneyjar/loa-freeside

7 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT