Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

renatocaliari/skills/local/cali-testing

Name: skills/local/cali-testing
Author: renatocaliari

skills/local/cali-testing/SKILL.md

npx skillsauth add renatocaliari/agent-sync-public-skills skills/local/cali-testing

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Testing Protocol

After implementing any feature, run this protocol before marking complete.

| Phase | What | When to skip | |-------|------|--------------| | Phase 1: Unit Tests | Run test suite, block on failure | Never | | Phase 2: Code Review | Parallel subagent review | <3 files changed | | Phase 3: UI Quality | Accessibility + design audit | Non-visual features | | Phase 4: Browser Testing | Interactive QA | Non-interactive features | | Phase 5: Final Checklist | Pre-completion verification | Never |

Phase 1: Unit Tests

Run the project's test suite first:

# Go
go test ./...

# Node
npm test

# Python
pytest

Block until tests pass. Do not proceed with failing tests.

Phase 2: Parallel Code Review via Subagents

Launch fresh-context reviewers in parallel:

subagent({
  tasks: [
    {
      agent: "reviewer",
      task: "Review this diff for correctness, regressions, and edge cases. Focus on: logic errors, missing error handling, security issues, performance regressions. Provide specific line references.",
      output: false
    },
    {
      agent: "reviewer",
      task: "Review this diff for simplicity and code quality. Focus on: unnecessary complexity, dead code, naming clarity, adherence to project conventions. Remove slop and verbosity.",
      output: false
    }
  ],
  concurrency: 2,
  context: "fresh"
})

When to use subagents:

Diff touches 3+ files
Feature involves multiple components
Changes affect critical paths (auth, payments, data)

When to skip:

Single-file typo fix
Config-only change
Documentation update

Phase 3: UI Quality (if visual)

Only if the scope involves a visual interface.

Accessibility Audit

Load the audit skill for WCAG compliance:

/audit

Checks:

Color contrast ratios
Keyboard navigation
Screen reader compatibility
ARIA attributes
Focus management

Design Review

Load the critique skill for design quality:

/critique

Checks:

Cognitive load
Visual hierarchy
Consistency with existing patterns
AI slop detection (over-generated UI)

Phase 4: Browser Testing (if interactive)

Load agent-browser and dogfood skills for interactive testing:

/dogfood

Steps:

Open the feature in browser
Test happy path
Test error states
Test edge cases (empty states, loading, errors)
Capture screenshots for evidence

Phase 5: Final Checklist

Before marking feature complete:

[ ] Unit tests pass
[ ] Code review done (subagent or human)
[ ] No regressions detected
[ ] UI accessible (if applicable)
[ ] Documentation updated (if applicable)
[ ] AGENTS.md updated (if architecture changed)

Workflow Summary

Implement feature
    ↓
Run unit tests → FAIL? Fix first
    ↓
Parallel subagent review (if 3+ files)
    ↓
UI audit + critique (if visual)
    ↓
Browser testing (if interactive)
    ↓
Final checklist → All green? Mark complete

Examples

Example 1: Simple feature (1-2 files)

Input: "Just finished implementing the login form"

Steps:

Run: go test ./... (or npm test)
Skip subagent review (only 1 file)
Run: /audit for accessibility
Run: /dogfood to test in browser

Output: "All tests pass. Login form is accessible. Browser testing shows happy path works."

Example 2: Complex feature (5+ files)

Input: "Just finished the payment system — touches 6 files"

Steps:

Run: go test ./...
Launch subagent review (2 reviewers in parallel)
Skip UI audit (backend only)
Skip browser testing (no UI)
Complete checklist

Output: "Tests pass. Subagent review found 1 issue (missing error handling in payment_handler.go). Fixed."

Example 3: UI feature with accessibility

Input: "Finished the dashboard redesign — new charts and layout"

Steps:

Run: npm test
Launch subagent review (3+ files touched)
Run: /audit → found contrast issue on chart labels
Run: /critique → suggested reducing cognitive load
Run: /dogfood → all interactive elements work
Complete checklist

Output: "Tests pass. Review clean. Accessibility found 1 contrast issue (fixed). Design review suggests simpler chart layout."

Edge Cases

Tests fail

STOP. Do not proceed to review.
Fix tests first
Tell user: "Tests are failing — fix before review"

Subagent review finds critical issue

STOP. Do not mark complete.
Fix the issue
Re-run review on fixed code
Tell user: "Review found critical issue — fixed and re-reviewed"

UI audit finds WCAG violations

FIX before marking complete
Document what was fixed
If can't fix: document accepted risk with reason

Feature touches <3 files

Skip subagent review (not worth the overhead)
Do manual review instead
Still run tests and checklist

Test Cases

Should activate

"Test this feature"
"Run QA on my changes"
"Check if this is ready to ship"
"Dogfood the new feature"
"Review my code before merge"

Should NOT activate

"Write tests" (writing tests, not running QA)
"What testing framework do we use?" (question, not action)
"Fix the flaky test" (fixing, not reviewing)

References

references/subagent-patterns.md — Subagent task structure patterns

renatocaliari/skills/local/cali-testing

skills/local/cali-testing/SKILL.md

--- name: cali-testing description: Run post-implementation testing protocol. Triggers when: user says "test this", "run tests", "QA", "dogfood", "check quality", user finishes implementing a feature, or when a PR is ready for review. Also triggers on mentions of: test coverage, accessibility audit, WCAG, design review, code review, subagent review. Covers: parallel review via subagents, UI quality audit, accessibility check, and browser testing. --- # Testing Protocol After implementing any f

1 stars

development

Updated May 28, 2026

$ install --global

skillsauth

npx skillsauth add renatocaliari/agent-sync-public-skills skills/local/cali-testing

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 28, 2026, 4:07 AM13.3s2 files scanned

SKILL.md

name:: cali-testing
description:: Run post-implementation testing protocol. Triggers when: user says "test this", "run tests", "QA", "dogfood", "check quality", user finishes implementing a feature, or when a PR is ready for review. Also triggers on mentions of: test coverage, accessibility audit, WCAG, design review, code review, subagent review. Covers: parallel review via subagents, UI quality audit, accessibility check, and browser testing.

Testing Protocol

After implementing any feature, run this protocol before marking complete.

Phase 1: Unit Tests

Run the project's test suite first:

# Go
go test ./...

# Node
npm test

# Python
pytest

Block until tests pass. Do not proceed with failing tests.

Phase 2: Parallel Code Review via Subagents

Launch fresh-context reviewers in parallel:

subagent({
  tasks: [
    {
      agent: "reviewer",
      task: "Review this diff for correctness, regressions, and edge cases. Focus on: logic errors, missing error handling, security issues, performance regressions. Provide specific line references.",
      output: false
    },
    {
      agent: "reviewer",
      task: "Review this diff for simplicity and code quality. Focus on: unnecessary complexity, dead code, naming clarity, adherence to project conventions. Remove slop and verbosity.",
      output: false
    }
  ],
  concurrency: 2,
  context: "fresh"
})

When to use subagents:

Diff touches 3+ files
Feature involves multiple components
Changes affect critical paths (auth, payments, data)

When to skip:

Single-file typo fix
Config-only change
Documentation update

Phase 3: UI Quality (if visual)

Only if the scope involves a visual interface.

Accessibility Audit

Load the audit skill for WCAG compliance:

/audit

Checks:

Color contrast ratios
Keyboard navigation
Screen reader compatibility
ARIA attributes
Focus management

Design Review

Load the critique skill for design quality:

/critique

Checks:

Cognitive load
Visual hierarchy
Consistency with existing patterns
AI slop detection (over-generated UI)

Phase 4: Browser Testing (if interactive)

Load agent-browser and dogfood skills for interactive testing:

/dogfood

Steps:

Open the feature in browser
Test happy path
Test error states
Test edge cases (empty states, loading, errors)
Capture screenshots for evidence

Phase 5: Final Checklist

Before marking feature complete:

[ ] Unit tests pass
[ ] Code review done (subagent or human)
[ ] No regressions detected
[ ] UI accessible (if applicable)
[ ] Documentation updated (if applicable)
[ ] AGENTS.md updated (if architecture changed)

Workflow Summary

Implement feature
    ↓
Run unit tests → FAIL? Fix first
    ↓
Parallel subagent review (if 3+ files)
    ↓
UI audit + critique (if visual)
    ↓
Browser testing (if interactive)
    ↓
Final checklist → All green? Mark complete

Examples

Example 1: Simple feature (1-2 files)

Input: "Just finished implementing the login form"

Steps:

Run: go test ./... (or npm test)
Skip subagent review (only 1 file)
Run: /audit for accessibility
Run: /dogfood to test in browser

Output: "All tests pass. Login form is accessible. Browser testing shows happy path works."

Example 2: Complex feature (5+ files)

Input: "Just finished the payment system — touches 6 files"

Steps:

Run: go test ./...
Launch subagent review (2 reviewers in parallel)
Skip UI audit (backend only)
Skip browser testing (no UI)
Complete checklist

Output: "Tests pass. Subagent review found 1 issue (missing error handling in payment_handler.go). Fixed."

Example 3: UI feature with accessibility

Input: "Finished the dashboard redesign — new charts and layout"

Steps:

Run: npm test
Launch subagent review (3+ files touched)
Run: /audit → found contrast issue on chart labels
Run: /critique → suggested reducing cognitive load
Run: /dogfood → all interactive elements work
Complete checklist

Output: "Tests pass. Review clean. Accessibility found 1 contrast issue (fixed). Design review suggests simpler chart layout."

Edge Cases

Tests fail

STOP. Do not proceed to review.
Fix tests first
Tell user: "Tests are failing — fix before review"

Subagent review finds critical issue

STOP. Do not mark complete.
Fix the issue
Re-run review on fixed code
Tell user: "Review found critical issue — fixed and re-reviewed"

UI audit finds WCAG violations

FIX before marking complete
Document what was fixed
If can't fix: document accepted risk with reason

Feature touches <3 files

Skip subagent review (not worth the overhead)
Do manual review instead
Still run tests and checklist

Test Cases

Should activate

"Test this feature"
"Run QA on my changes"
"Check if this is ready to ship"
"Dogfood the new feature"
"Review my code before merge"

Should NOT activate

"Write tests" (writing tests, not running QA)
"What testing framework do we use?" (question, not action)
"Fix the flaky test" (fixing, not reviewing)

References

references/subagent-patterns.md — Subagent task structure patterns

Related Skills

renatocaliari/cali-degustia-metricas

tools

VerifiedTrustedCommunity

Extrai métricas estruturadas, cálculos e estimativas de transcripts de entrevistas com clientes do Sommelier de IA. Produz um JSON com dores, frequências, tempo gasto, pessoas envolvidas, economia potencial, ROI e recomendações financeiras. Projetado para alimentar o cali-degustia-diagnostico ou integrar com dashboards/planilhas.

2SKILL.mdUpdated Jul 25, 2026

renatocaliari/cali-degustia-metricas

renatocaliari/cali-degustia-depoimentos

tools

VerifiedTrustedCommunity

Guia a coleta de depoimentos de clientes do Sommelier de IA no momento certo do processo, usando a abordagem de Hormozi: pedir depois da primeira evidência de resultado, nunca na entrega. Gera depoimentos mais autênticos e reduz a sensação de que o cliente está sendo "solicitado".

2SKILL.mdUpdated Jul 25, 2026

renatocaliari/cali-degustia-depoimentos

renatocaliari/stelow-product-ux-critique

development

VerifiedTrustedCommunity

[stelow] Full UX critique for visual interfaces. Accepts a live URL, source code directory, or screenshot image. Evaluates accessibility (WCAG AA), Nielsen's 10 heuristics, visual hierarchy, cognitive load, consistency, mobile responsiveness, AI slop, emotional journey, and design personas — then generates a classified gap report. Standalone or integrated into stelow and stelow-product-testing-execution.

2SKILL.mdUpdated Jul 22, 2026

renatocaliari/stelow-product-ux-critique

renatocaliari/stelow-product-trust-building

development

VerifiedTrustedCommunity

Building trust through perception and guarantee mechanisms. Covers ten pillars to materialize trust, guarantee types from unconditional to anti-guarantees, and strategic approaches for different contexts.

2SKILL.mdUpdated Jul 22, 2026

renatocaliari/stelow-product-trust-building

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/renatocaliari/agent-sync-public-skills.git

# Copy into Claude Code skills folder (global)
cp -r agent-sync-public-skills/skills/local/cali-testing ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

renatocaliari/agent-sync-public-skills

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

renatocaliari/skills/local/cali-testing

$ install --global

Security Scan Results

SKILL.md

Testing Protocol

Contents

Phase 1: Unit Tests

Phase 2: Parallel Code Review via Subagents

Phase 3: UI Quality (if visual)

Accessibility Audit

Design Review

Phase 4: Browser Testing (if interactive)

Phase 5: Final Checklist

Workflow Summary

Examples

Example 1: Simple feature (1-2 files)

Example 2: Complex feature (5+ files)

Example 3: UI feature with accessibility

Edge Cases

Tests fail

Subagent review finds critical issue

UI audit finds WCAG violations

Feature touches <3 files

Test Cases

Should activate

Should NOT activate

References

Related Skills

renatocaliari/cali-degustia-metricas

renatocaliari/cali-degustia-depoimentos

renatocaliari/stelow-product-ux-critique

renatocaliari/stelow-product-trust-building

renatocaliari/skills/local/cali-testing

$ install --global

Security Scan Results

SKILL.md

Testing Protocol

Contents

Phase 1: Unit Tests

Phase 2: Parallel Code Review via Subagents

Phase 3: UI Quality (if visual)

Accessibility Audit

Design Review

Phase 4: Browser Testing (if interactive)

Phase 5: Final Checklist

Workflow Summary

Examples

Example 1: Simple feature (1-2 files)

Example 2: Complex feature (5+ files)

Example 3: UI feature with accessibility

Edge Cases

Tests fail

Subagent review finds critical issue

UI audit finds WCAG violations

Feature touches <3 files

Test Cases

Should activate

Should NOT activate

References

Related Skills

renatocaliari/cali-degustia-metricas

renatocaliari/cali-degustia-depoimentos

renatocaliari/stelow-product-ux-critique

renatocaliari/stelow-product-trust-building