skills/skills/staff/SKILL.md
# /staff — Staff Engineer Panel Analysis Convene a panel of 4 staff engineers from top tech companies + Will Larson as moderator to independently analyze a technical problem, debate options, and produce a consensus decision with implementation plan. > Like a real Staff Engineer round-table: each engineer brings their company's culture and battle scars. They disagree, challenge assumptions, find latent bugs, and converge on the smallest change that eliminates the actual risk. ## Trigger User
npx skillsauth add msifoss/ai-dlc skills/skills/staffInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Convene a panel of 4 staff engineers from top tech companies + Will Larson as moderator to independently analyze a technical problem, debate options, and produce a consensus decision with implementation plan.
Like a real Staff Engineer round-table: each engineer brings their company's culture and battle scars. They disagree, challenge assumptions, find latent bugs, and converge on the smallest change that eliminates the actual risk.
User invokes /staff <problem> with a description of the technical problem to analyze.
| Argument | Description |
|----------|-------------|
| <problem> | A technical problem, architectural question, or investigation finding to analyze. Can be a sentence, a file path, or a reference to a prior finding. |
Examples:
/staff "Should we extract shared code into a Lambda Layer?"/staff "Our enrichment pipeline has been silently disabled since v1.5.3"/staff lambdas/analytics_ingestion/analytics_handler.py — the db_exec whitelist is getting complex/staff F7 from the five-persona reviewWorktree isolation (recommended for code analysis): If the problem references specific code files, consider using an isolated worktree to avoid disrupting the main working directory:
EnterWorktree for isolationls .ai-dlc.local.yaml 2>/dev/null
If .ai-dlc.local.yaml exists, read it for project-specific context that should inform the panel (infrastructure constraints, team topology, deployment model, etc.).
Identify the work type to frame the panel's analysis appropriately:
| Work Type | Panel Focus | |-----------|-------------| | Feature | Architecture, approach selection, risk-reward trade-offs | | Enhancement | Impact on existing system, backward compatibility, scope control | | Bug Fix | Root cause analysis, fix vs workaround, regression prevention | | Refactoring | Blast radius, migration strategy, behavioral preservation | | Architectural | Long-term implications, team scaling, cost at scale |
Include the work type in the Problem Statement so panelists calibrate their analysis depth.
If the problem references specific code files, perform a lightweight impact scan BEFORE the panel convenes:
grep -rn "import.*[filename]" src/ --include="*.{ts,js,py,go,rs}" 2>/dev/null | head -20
Produce a brief Impact Summary (5-10 lines) to include in the panel briefing. This gives panelists concrete data instead of abstract risk discussion.
Before convening the panel, deeply understand the problem:
docs/solutions/ and docs/captains_log/ for relevant prior decisionsProduce a Problem Statement with:
Each panelist is a real archetype with a distinct engineering philosophy. They analyze the problem independently — do NOT let one panelist's analysis influence another.
Philosophy: "The best part is no part." Ruthless simplification. Thinks in failure modes and blast radius. Strengths: Risk quantification (Probability x Consequence matrices), eliminating unnecessary complexity, finding the minimal viable fix Signature move: Calculates risk scores for each option, kills the ones with bad ratios On abstractions: Deeply skeptical. Every new layer is a layer that can fail. Prefers explicit over implicit. Typical output: "Fix the 65 dangerous lines. Leave the 650 stable ones alone."
Philosophy: "Deduplicate based on the cost of divergence, not the existence of duplication." Thinks in developer cognitive load and real-world failure modes. Strengths: Finding latent bugs hiding in "harmless" duplication, developer experience, practical risk assessment Signature move: Discovers that what looks like a cosmetic issue is actually a live bug On patterns: Cares about what breaks when copies diverge, not whether copies exist Typical output: "You don't have a code duplication problem. You have a risk prioritization question."
Philosophy: "Move fast with stable infra." Finds the 80/20 split. Knows when copy-paste is fine and when it's a liability. Strengths: Pragmatic bucketing (dangerous vs aesthetic), CI/CD integration, team scaling patterns Signature move: Sorts everything into "fix yesterday" vs "don't care" with clear criteria On enforcement: "Pre-commit is a developer convenience; CI is the contract." Typical output: Two clean buckets with no ambiguity about which is which.
Philosophy: Deep platform knowledge. Has seen thousands of customer architectures. Knows the sharp edges of the services involved. Strengths: Platform-specific expertise, under-documented features, deployment coupling analysis, scaling concerns Signature move: Reframes the problem from a platform perspective ("this is a deployment coupling problem, not a code hygiene problem") On architecture: Thinks in terms of the right long-term pattern, but willing to concede when scope doesn't justify it Typical output: The architecturally correct approach + honest assessment of whether the project needs it yet.
Role: Does NOT analyze the problem independently. Instead:
Each panelist independently produces:
### [Name] — Staff Engineer, [Company]
**Risk assessment:**
[Their analysis of the problem — what's dangerous, what's not, what breaks]
**Options evaluated:**
[Which options they favor, which they reject, and why]
**Key quote:**
"[One memorable line that captures their position]"
**Recommendation:**
[Their preferred approach + effort estimate]
**On [key debate topic]:**
"[Their position on the main point of contention]"
**Unique contribution:**
[Something only this panelist noticed — a latent bug, a platform edge case, a simpler approach]
After all 4 panelists have spoken, produce a consensus matrix:
## Consensus Matrix
| Question | Tim (SpaceX) | Rob (Roblox) | Fran (Meta) | Al (AWS) |
|----------|-------------|-------------|-------------|----------|
| [Key decision 1] | YES/NO | YES/NO | YES/NO | YES/NO |
| [Key decision 2] | YES/NO | YES/NO | YES/NO | YES/NO |
| ...
| Guard mechanism | [their pref] | [their pref] | [their pref] | [their pref] |
| Total effort | [estimate] | [estimate] | [estimate] | [estimate] |
**Unanimous agreements:**
1. [Things all 4 agree on]
**Majority agreements (3-of-4):**
2. [Things 3 agree on, with the dissenter noted]
**Key disagreements:**
3. [Where they split, and on what dimension]
Before ruling, Will asks questions that expose hidden assumptions:
### Clarifying Questions & Answers
| Question | Answer | Impact on Decision |
|----------|--------|--------------------|
| [Question about the codebase/infra] | [Verified answer] | [How it changes the calculus] |
IMPORTANT: Actually investigate the answers. If Will asks "Is there a CI pipeline?", go check. If he asks "How often has this code been modified?", run git log. If he asks "Is password rotation enabled?", check the CloudFormation template. Wrong assumptions lead to wrong decisions.
If a clarifying question reveals that a panelist's assumption was wrong, reconvene the panel for a reassessment round (Phase 4b).
If Phase 4 reveals wrong assumptions:
This phase is what makes the panel valuable — it catches the "we assumed X but actually Y" errors that cause bad architectural decisions.
Will synthesizes the panel's input into a final ruling:
## Will Larson's Decision
**Scope:** [Which panelist's approach, with whose scope]
| Step | What | Why | Effort |
|------|------|-----|--------|
| 1 | [Specific action] | [Rationale tied to panel finding] | [Time] |
| 2 | ... | ... | ... |
**Total: [hours]. [One-line summary of the approach.]**
### What's explicitly deferred
| Item | Rationale | Revisit When |
|------|-----------|--------------|
| [Rejected option] | [Why, citing which panelist] | [Trigger condition] |
### Key takeaways
> "[Quotable insight from the panel]" — [Panelist]
[2-4 takeaways that generalize beyond this specific problem]
Save the full analysis to docs/key_findings/YYYYMMDD-[Topic-Slug]-Staff-Engineer-Panel.md with this structure:
# [Topic] — Staff Engineer Panel Analysis
**Date:** YYYY-MM-DD
**Panel:** Tim (SpaceX), Rob (Roblox), Fran (Meta), Al (AWS), Will Larson (Moderator)
**Trigger:** [What prompted this analysis]
---
## Problem Statement
[From Phase 0]
## Panel Analysis
[From Phase 2 — all 4 panelists]
## Consensus Matrix
[From Phase 3]
## Clarifying Questions
[From Phase 4]
## [Reassessment — if Phase 4b occurred]
## Will Larson's Decision
[From Phase 5]
## Key Takeaways
[Generalizable insights]
## Files Referenced
[Table of files discussed with their roles]
## Implementation Plan
[Numbered steps with file paths, line numbers, effort]
## Findings to Fix
[Numbered findings — each with File, Lines, Description, Fix]
development
Team sync for Astro website repos — checks git/GitHub/server state and tells you exactly what to do next
development
Simple team guide for website collaborators — checks your situation and tells you what to do in plain English
tools
--- name: ticky description: Full lifecycle ticket management — draft, submit, sync, and clean Azure DevOps work items across repos. user-invocable: true allowed-tools: Bash, Read, Write, Edit, Glob, Grep argument-hint: <mode> [args...] — modes: draft, submit, clean, update, get, create --- # Ticky — Full Lifecycle Ticket Management Manage Azure DevOps work items through their full lifecycle: draft locally, submit to ADO, sync status, and clean up cross-repo tickets. **CLI:** `${TICKY_HOME:-$
testing
# /prodstatus — CallHero Production Health Dashboard > **CONFIDENTIAL** — This skill contains internal infrastructure references (resource names, stack identifiers, queue names). Do not share outside the team or commit to public repositories. > Read-only diagnostic skill. No writes, no deploys, no doc updates. Pure observability. ## When to Use - Quick health check before a deploy - After a deploy to verify both stacks - Investigating an alarm or incident - Weekly status review ## Platform