.kiro/skills/qe-cicd-pipeline-qe-orchestrator/SKILL.md
Orchestrate quality engineering across CI/CD pipeline phases. Use when designing test strategies, planning quality gates, or implementing shift-left/shift-right testing.
npx skillsauth add proffesor-for-testing/agentic-qe qe-cicd-pipeline-qe-orchestratorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
<default_to_action> When orchestrating quality across CI/CD pipeline:
Quick Phase Selection:
Critical Success Factors:
| Phase | Primary Agents | Key Skills | |-------|---------------|------------| | Commit | qe-test-generator, qe-requirements-validator | tdd-london-chicago, shift-left | | Build | qe-test-executor, qe-coverage-analyzer, qe-flaky-test-hunter | test-automation, mutation-testing | | Test | qe-api-contract-validator, qe-performance-tester, qe-security-scanner | api-testing, performance, security | | Staging | qe-chaos-engineer, qe-visual-tester, qe-deployment-readiness | chaos-engineering, accessibility | | Prod | qe-production-intelligence, qe-quality-analyzer | shift-right, compliance |
| Phase | Metric | Threshold | Blocking | |-------|--------|-----------|----------| | Commit | Unit coverage | > 80% | Yes | | Build | All tests pass | 100% | Yes | | Build | Mutation score | > 70% | No | | Test | API contracts | No breaking changes | Yes | | Test | p95 response | < 200ms | Yes | | Test | Security critical | 0 | Yes | | Staging | Deployment readiness | > 85% | Yes |
Goal: Catch defects early, ensure testability
Agents:
qe-test-generator - Generate unit testsqe-requirements-validator - BDD scenarios, INVEST criteriaSkills: shift-left-testing, tdd-london-chicago, code-review-quality
// Parallel execution
Task("Generate Tests", "Create unit tests for new methods", "qe-test-generator")
Task("Validate Requirements", "Check BDD scenarios", "qe-requirements-validator")
Gates: Unit coverage > 80%, Static analysis clean, Code review approved
Goal: Validate integration, ensure coverage
Agents:
qe-test-executor - Run test suitesqe-coverage-analyzer - Coverage gaps (O(log n))qe-flaky-test-hunter - Detect/stabilize flaky testsqe-regression-risk-analyzer - Minimal regression suiteSkills: test-automation-strategy, mutation-testing, regression-testing
Task("Execute Tests", "Run full suite, store in aqe/test-results/*", "qe-test-executor")
Task("Coverage Analysis", "Identify gaps", "qe-coverage-analyzer")
Task("Flaky Detection", "Analyze test history", "qe-flaky-test-hunter")
Gates: All tests pass, Coverage > 90% critical paths, No new flaky tests
Goal: Validate contracts, performance, security
Agents:
qe-api-contract-validator - Breaking changes detectionqe-performance-tester - Load test critical pathsqe-security-scanner - SAST/DAST scansqe-test-data-architect - Realistic test data (10k+/sec)Skills: api-testing-patterns, performance-testing, security-testing
// Parallel testing
Task("API Contracts", "Validate for breaking changes", "qe-api-contract-validator")
Task("Performance", "Load test 1000 users", "qe-performance-tester")
Task("Security", "SAST/DAST scan", "qe-security-scanner")
Gates: No breaking API changes, p95 < 200ms, No critical vulnerabilities
Goal: Validate production-like environment, resilience
Agents:
qe-chaos-engineer - Fault injectionqe-visual-tester - Visual regressionqe-deployment-readiness - Risk assessmentSkills: chaos-engineering-resilience, accessibility-testing, visual-testing
Task("Chaos Testing", "Controlled failure injection", "qe-chaos-engineer")
Task("Visual Testing", "Visual regression", "qe-visual-tester")
Task("Deployment Check", "Risk assessment", "qe-deployment-readiness")
Gates: Chaos tests pass, No visual regressions, Readiness > 85%
Goal: Monitor real usage, validate compliance
Agents:
qe-production-intelligence - Incident → test scenariosqe-quality-analyzer - Quality metrics and trendsSkills: shift-right-testing, compliance-testing
Task("Production Intelligence", "Convert incidents to tests", "qe-production-intelligence")
Task("Quality Analysis", "Production metrics", "qe-quality-analyzer")
Gates: Synthetic monitors pass, Error rate < 0.1%, Compliance validated
// Phase 1: Commit
Task("TDD Generation", "Generate tests for new features", "qe-test-generator")
Task("Requirements", "Validate BDD scenarios", "qe-requirements-validator")
// Phase 2: Build
Task("Execute Tests", "Full suite with coverage", "qe-test-executor")
Task("Coverage", "Analyze gaps", "qe-coverage-analyzer")
Task("Flaky Hunt", "Stabilize flaky tests", "qe-flaky-test-hunter")
// Phase 3: Integration
Task("API Contracts", "Check breaking changes", "qe-api-contract-validator")
Task("Performance", "1000 user load test", "qe-performance-tester")
Task("Security", "SAST/DAST scans", "qe-security-scanner")
// Phase 4: Staging
Task("Chaos", "Fault injection testing", "qe-chaos-engineer")
Task("Visual", "Visual regression", "qe-visual-tester")
Task("Readiness", "Deployment assessment", "qe-deployment-readiness")
// Phase 5: Production
Task("Intelligence", "Convert incidents", "qe-production-intelligence")
Task("Quality Gate", "Final validation", "qe-quality-gate")
| Risk | Strategy | Agents | |------|----------|--------| | Critical | All phases, manual gates | Full fleet | | High | Automated gates, comprehensive | 10+ agents | | Medium | Smart selection, risk-based | 5-8 agents | | Low | Minimal regression, fast | 2-3 agents |
| Type | Focus Skills | Primary Agents | |------|-------------|----------------| | API | api-testing, contract, performance | api-contract-validator, performance-tester | | Web UI | visual-testing, accessibility | visual-tester, accessibility | | Mobile | mobile-testing, compatibility | performance-tester, visual-tester | | Backend | database-testing, security | security-scanner, performance-tester |
aqe/pipeline/
├── phase-results/* - Results from each phase
├── quality-gates/* - Gate validation results
├── orchestration-plan/* - Selected skills and agents
├── test-plan/generated - Test plans
├── coverage/gaps - Coverage analysis
├── security/findings - Security results
└── performance/results - Performance data
Task("Fleet Orchestration",
"Coordinate 10 agents across phases: commit (2), build (3), test (3), staging (2)",
"qe-fleet-commander")
| Event | Trigger | Subscribers |
|-------|---------|-------------|
| phase:commit:complete | Commit phase done | build agents |
| coverage:gap:detected | Gap found | test-generator |
| security:finding:critical | Vulnerability | quality-gate |
| quality:gate:evaluated | Gate decision | fleet-commander |
{
"commit": {
"gates": [
{ "metric": "unit_coverage", "threshold": 80, "blocking": true },
{ "metric": "static_analysis_critical", "max": 0, "blocking": true }
]
},
"build": {
"gates": [
{ "metric": "all_tests_passed", "threshold": 100, "blocking": true },
{ "metric": "mutation_score", "threshold": 70, "blocking": false }
]
},
"integration": {
"gates": [
{ "metric": "api_breaking_changes", "max": 0, "blocking": true },
{ "metric": "performance_p95_ms", "threshold": 200, "blocking": true },
{ "metric": "security_critical", "max": 0, "blocking": true }
]
}
}
| Issue | Cause | Solution | |-------|-------|----------| | OOM during tests | Running all tests in parallel | Use batched execution | | Pipeline too slow | Comprehensive testing every commit | Smart test selection | | Gates always failing | Thresholds too strict | Analyze trends, adjust |
The CI/CD Pipeline QE Orchestrator provides:
With Agents: Use qe-fleet-commander for multi-agent orchestration. Coordinate through aqe/* memory namespace. Batch operations for efficiency.
development
Apply XP practices including pair programming, ensemble programming, continuous integration, and sustainable pace. Use when implementing agile development practices, improving team collaboration, or adopting technical excellence practices.
development
Warehouse Management System testing patterns for inventory operations, pick/pack/ship workflows, wave management, EDI X12/EDIFACT compliance, RF/barcode scanning, and WMS-ERP integration. Use when testing WMS platforms (Blue Yonder, Manhattan, SAP EWM).
testing
Advanced visual regression testing with pixel-perfect comparison, AI-powered diff analysis, responsive design validation, and cross-browser visual consistency. Use when detecting UI regressions, validating designs, or ensuring visual consistency.
development
Comprehensive truth scoring, code quality verification, and automatic rollback system with 0.95 accuracy threshold for ensuring high-quality agent outputs and codebase reliability.