Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

cursor/interrogate

Name: interrogate
Author: cursor

pstack/skills/interrogate/SKILL.md

npx skillsauth add cursor/plugins interrogate

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Interrogate

Spawn four reviewers on four different models to adversarially review code changes. Each model gets the same prompt and rubric. The adversarial signal comes from model diversity, not assigned personas. Models differ in blind spots, priors, and reasoning patterns. Agreement across models is high-confidence signal; lone-model findings are worth reading but lower confidence.

The deliverable is a synthesized verdict. Do NOT auto-apply changes.

Step 1, Determine Scope

Identify what to review from context:

If the user points at specific files or a diff, use that
If on a feature branch, run git diff main...HEAD (or the appropriate base branch) for the full changeset
If the user's message references recent work, gather the relevant files

Package the diff (or file contents) plus any surrounding context files the reviewers need to understand the code.

Step 2, State the Intent

Before spawning reviewers, state the intent explicitly. What is this code trying to accomplish? Derive this from:

The user's message
Commit messages
PR description if one exists
The code itself

Write one clear paragraph. Reviewers challenge whether the work achieves the intent well, not whether the intent itself is correct. If you're unsure about the intent, ask the user before proceeding.

Step 3, Spawn Reviewers

Launch all four in a single message using the Task tool, each with a different model.

| Subagent | Model | |----------|-------| | Reviewer A | claude-opus-4-8-thinking-xhigh | | Reviewer B | gpt-5.3-codex-high-fast | | Reviewer C | gpt-5.5-high-fast | | Reviewer D | composer-2.5-fast |

For each reviewer:

subagent_type: generalPurpose
model: the model from the table
readonly: true

If a model slug in the table is rejected as unresolvable when you try to spawn the subagent, check the valid slugs in the Task tool's error message, pick the closest equivalent (prefer the highest-reasoning tier of the same family), spawn with the valid slug, and open a separate PR to update this table. Do not block the review on the slug issue.

Read references/reviewer-prompt.md and fill in the template with:

The stated intent
The diff or file contents
The review rubric from references/rubric.md
The code-quality lens from references/code-quality-review.md

The same filled template goes to all four reviewers, so every model applies the code-quality lens.

Each reviewer produces structured findings as described in the prompt template.

Step 4, Synthesize

As results come back, build a unified picture:

Parse all findings from the four reviewers
Identify consensus. Findings raised by 2+ models independently are highest signal.
Identify lone-model findings. Still worth reading, but weight accordingly.
Deduplicate. Different models may describe the same issue differently. Merge these and note which models raised it.
Note disagreements. If one model flags something and another explicitly says the opposite, that's useful context for the verdict.

Step 5, Lead Judgment

You are the lead reviewer, a pragmatic senior engineer, not a neutral aggregator.

Read references/lead-judgment.md for the full framework. Reviewers only see a slice of the codebase. You have the full context (the goal, the constraints, the timeline, which tradeoffs were already considered). Use that context aggressively.

Categorize every finding into one of four buckets:

Act on. Real issues affecting correctness, security, or maintainability given the actual goals. These would block a real PR.
Consider. Legitimate points, but you're not sure they outweigh the cost of addressing them right now. Worth the user's attention.
Noted. Technically valid but not actionable. Context-dependent, premature optimization, or low-impact given the current stage.
Dismissed. Wrong, nitpicky, or missing context. Brief explanation why.

For each finding, include:

Which model(s) raised it
The category (act on / consider / noted / dismissed)
A one-line rationale for the categorization

Output Format

Present the verdict in this structure:

Intent

[The stated intent paragraph from Step 2]

Reviewers

Model A: [model name], [N findings]
Model B: [model name], [N findings]
Model C: [model name], [N findings]
Model D: [model name], [N findings]

Act On

[Findings that should be addressed. For each: description, which models raised it, why it matters.]

Consider

[Findings worth thinking about. For each: description, which models raised it, tradeoff involved.]

Noted

[Valid but low-priority. Brief list.]

Dismissed

[Rejected findings with brief rationale. This shows the user what was filtered out and why, so they can override your judgment if they disagree.]

Agreement Map

[Where did models agree, where did they diverge, and what does the pattern of agreement/disagreement tell us?]

cursor/interrogate

pstack/skills/interrogate/SKILL.md

Use for "interrogate", "adversarial review", "multi-model review", "challenge this", "stress test this code", "find blind spots", or "tear this apart". Four LLM reviewers challenge changes from independent angles.

1,309 stars

development

Updated May 30, 2026

$ install --global

skillsauth

npx skillsauth add cursor/plugins interrogate

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 30, 2026, 2:41 AM140.0s5 files scanned

SKILL.md

name:: interrogate
description:: Use for \"interrogate\", \"adversarial review\", \"multi-model review\", \"challenge this\", \"stress test this code\", \"find blind spots\", or \"tear this apart\". Four LLM reviewers challenge changes from independent angles.
disable-model-invocation:: true

Interrogate

The deliverable is a synthesized verdict. Do NOT auto-apply changes.

Step 1, Determine Scope

Identify what to review from context:

If the user points at specific files or a diff, use that
If on a feature branch, run git diff main...HEAD (or the appropriate base branch) for the full changeset
If the user's message references recent work, gather the relevant files

Package the diff (or file contents) plus any surrounding context files the reviewers need to understand the code.

Step 2, State the Intent

Before spawning reviewers, state the intent explicitly. What is this code trying to accomplish? Derive this from:

The user's message
Commit messages
PR description if one exists
The code itself

Write one clear paragraph. Reviewers challenge whether the work achieves the intent well, not whether the intent itself is correct. If you're unsure about the intent, ask the user before proceeding.

Step 3, Spawn Reviewers

Launch all four in a single message using the Task tool, each with a different model.

For each reviewer:

subagent_type: generalPurpose
model: the model from the table
readonly: true

Read references/reviewer-prompt.md and fill in the template with:

The stated intent
The diff or file contents
The review rubric from references/rubric.md
The code-quality lens from references/code-quality-review.md

The same filled template goes to all four reviewers, so every model applies the code-quality lens.

Each reviewer produces structured findings as described in the prompt template.

Step 4, Synthesize

As results come back, build a unified picture:

Parse all findings from the four reviewers
Identify consensus. Findings raised by 2+ models independently are highest signal.
Identify lone-model findings. Still worth reading, but weight accordingly.
Deduplicate. Different models may describe the same issue differently. Merge these and note which models raised it.
Note disagreements. If one model flags something and another explicitly says the opposite, that's useful context for the verdict.

Step 5, Lead Judgment

You are the lead reviewer, a pragmatic senior engineer, not a neutral aggregator.

Categorize every finding into one of four buckets:

Act on. Real issues affecting correctness, security, or maintainability given the actual goals. These would block a real PR.
Consider. Legitimate points, but you're not sure they outweigh the cost of addressing them right now. Worth the user's attention.
Noted. Technically valid but not actionable. Context-dependent, premature optimization, or low-impact given the current stage.
Dismissed. Wrong, nitpicky, or missing context. Brief explanation why.

For each finding, include:

Which model(s) raised it
The category (act on / consider / noted / dismissed)
A one-line rationale for the categorization

Output Format

Present the verdict in this structure:

Intent

[The stated intent paragraph from Step 2]

Reviewers

Model A: [model name], [N findings]
Model B: [model name], [N findings]
Model C: [model name], [N findings]
Model D: [model name], [N findings]

Act On

[Findings that should be addressed. For each: description, which models raised it, why it matters.]

Consider

[Findings worth thinking about. For each: description, which models raised it, tradeoff involved.]

Noted

[Valid but low-priority. Brief list.]

Dismissed

[Rejected findings with brief rationale. This shows the user what was filtered out and why, so they can override your judgment if they disagree.]

Agreement Map

[Where did models agree, where did they diverge, and what does the pattern of agreement/disagreement tell us?]

Related Skills

cursor/principle-encode-lessons-in-structure

development

VerifiedTrustedCommunity

Apply when you catch yourself writing the same instruction a second time, or notice a recurring correction. Encode the rule as a lint, metadata flag, runtime check, or script instead of more text.

1,824SKILL.mdUpdated May 24, 2026

cursor/principle-encode-lessons-in-structure

cursor/principle-build-the-lever

tools

VerifiedTrustedCommunity

Apply to any non-trivial work, not just bulk work: edits, migrations, analyses, checks. Build the tool that does it or proves it (codemod, script, generator, or a skill your subagents follow) instead of working by hand. The tool is the artifact a reviewer can rerun.

1,309SKILL.mdUpdated May 27, 2026

cursor/principle-build-the-lever

cursor/why

tools

VerifiedTrustedCommunity

Use for 'why does X work this way', 'why we picked Y', design rationale, regressions, postmortems, or data-backed thresholds. Discovers available MCPs and queries each evidence category (source control, issue tracker, long-form docs, real-time chat, infrastructure observability, error tracking, product analytics warehouse) in parallel, then returns a cited read on decisions and tradeoffs. Use how for runtime behavior.

1,309SKILL.mdUpdated May 24, 2026

cursor/unslop

data-ai

VerifiedTrustedCommunity

Cut AI tells from any writing. Must always apply.

1,309SKILL.mdUpdated May 24, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/cursor/plugins.git

# Copy into Claude Code skills folder (global)
cp -r plugins/pstack/skills/interrogate ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

cursor/plugins

1,309 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT