Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

nwave-ai/nw-sd-framework

Name: nw-sd-framework
Author: nwave-ai

plugins/nw/skills/nw-sd-framework/SKILL.md

npx skillsauth add nwave-ai/nwave nw-sd-framework

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

System Design Framework

The 4-Step Process

Every system design follows this structure. Skipping steps is the top mistake.

Step 1: Understand the Problem and Establish Design Scope (3-10 min)

Narrow an impossibly broad question into a tractable problem.

Produce: functional requirements (3-5 bullets) | non-functional requirements (scale, latency, availability, consistency model) | capacity estimation (QPS, storage, bandwidth)

Red flags if skipped: designing a system nobody asked for | over-engineering for imaginary scale | missing critical constraints (GDPR, real-time)

Step 2: Propose High-Level Design and Get Buy-In (10-15 min)

Sketch the big picture. Validate before diving deep.

Do: draw architecture diagram (clients, servers, databases, caches, queues) | define API contract (REST/GraphQL/gRPC -- key endpoints) | design data model (entities, relationships, access patterns) | walk through 1-2 core use cases end-to-end | get buy-in: "Does this make sense before I go deeper?"

API patterns: RESTful for CRUD-heavy | GraphQL for flexible client queries | gRPC for internal service-to-service | WebSocket/SSE for real-time

Data model: SQL vs NoSQL based on access patterns, not hype | denormalization trade-offs | partitioning key selection (directly impacts scalability)

Step 3: Design Deep Dive (10-25 min)

Go deep on 2-3 components.

Choose: most technically challenging | most interesting trade-offs | bottleneck components (highest load, most failure-prone)

Depth means: specific algorithms (consistent hashing, Bloom filters) | failure modes and handling | scaling strategy per component | data flow with edge cases | monitoring and operational concerns

Step 4: Wrap Up (3-5 min)

Cover: summarize design in 2-3 sentences | identify known bottlenecks | what you'd improve with more time | operational concerns (monitoring, alerting, deployment) | future enhancements

Avoid: introducing entirely new components at this stage | second-guessing your design

Back-of-Envelope Estimation

Powers of 2 Reference

| Power | Value | Meaning | |-------|-------|---------| | 10 | 1 Thousand | 1 KB | | 20 | 1 Million | 1 MB | | 30 | 1 Billion | 1 GB | | 40 | 1 Trillion | 1 TB | | 50 | 1 Quadrillion | 1 PB |

Latency Numbers Every Engineer Should Know

| Operation | Latency | |-----------|---------| | L1 cache reference | 0.5 ns | | L2 cache reference | 7 ns | | Main memory reference | 100 ns | | Compress 1KB (Zippy) | 10 us | | Send 2KB over 1 Gbps | 20 us | | Read 1 MB from memory | 250 us | | Datacenter round trip | 500 us | | Disk seek | 10 ms | | Read 1 MB from network | 10 ms | | Read 1 MB from disk | 30 ms | | CA to Netherlands round trip | 150 ms |

Key takeaways: memory fast, disk slow -- cache aggressively | compress before network send | inter-datacenter trips expensive -- minimize cross-region calls

Common Estimation Patterns

DAU to QPS: QPS = DAU * actions_per_user / 86400 | Peak QPS = QPS * 2 (or *3 for spiky)

Storage: daily = DAU * actions * avg_size | yearly = daily * 365 | 5-year = yearly * 5

Bandwidth: QPS * average_response_size

Servers: Peak QPS / QPS_per_server where CPU-bound ~hundreds | IO-bound with cache ~thousands | static content ~tens of thousands

Estimation Example: Twitter-like Service

150M DAU, 2 tweets/day, 10 reads/day
Write QPS = 150M * 2 / 86400 ~ 3,500
Read QPS = 150M * 10 / 86400 ~ 17,000; Peak ~ 50,000
Storage: 300M tweets * 1KB + 30M media * 500KB ~ 15.3 TB/day

Scaling Ladder

Each step solves a specific bottleneck. Never introduce a component without articulating which bottleneck it addresses.

Load balancer -- distribute traffic across web servers
Database replication -- master-slave for read scaling
Cache layer -- reduce database load (Redis/Memcached)
CDN -- serve static content from edge
Stateless web tier -- move session state to shared store
Database sharding -- horizontal partitioning for write scaling
Message queue -- decouple components, handle spikes
Logging, metrics, monitoring -- observability at scale
Multiple data centers -- geographic redundancy and latency reduction

Common Pitfalls

Jumping to solutions -- design before understanding requirements
Over-engineering -- adding components for imaginary scale
Ignoring trade-offs -- every choice has a cost; name it
SPOF blindness -- always ask "what if this dies?"
Neglecting data -- the data model drives everything
Forgetting operations -- a system you can't monitor is one you can't run
Not doing math -- gut feelings are wrong; estimates keep you honest

nwave-ai/nw-sd-framework

plugins/nw/skills/nw-sd-framework/SKILL.md

4-step system design framework with back-of-envelope estimation, scaling ladder, and common pitfalls

352 stars

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add nwave-ai/nwave nw-sd-framework

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 5:33 AM199.1s1 file scanned

SKILL.md

name:: nw-sd-framework
description:: 4-step system design framework with back-of-envelope estimation, scaling ladder, and common pitfalls
user-invocable:: false
disable-model-invocation:: true

System Design Framework

The 4-Step Process

Every system design follows this structure. Skipping steps is the top mistake.

Step 1: Understand the Problem and Establish Design Scope (3-10 min)

Narrow an impossibly broad question into a tractable problem.

Produce: functional requirements (3-5 bullets) | non-functional requirements (scale, latency, availability, consistency model) | capacity estimation (QPS, storage, bandwidth)

Red flags if skipped: designing a system nobody asked for | over-engineering for imaginary scale | missing critical constraints (GDPR, real-time)

Step 2: Propose High-Level Design and Get Buy-In (10-15 min)

Sketch the big picture. Validate before diving deep.

API patterns: RESTful for CRUD-heavy | GraphQL for flexible client queries | gRPC for internal service-to-service | WebSocket/SSE for real-time

Data model: SQL vs NoSQL based on access patterns, not hype | denormalization trade-offs | partitioning key selection (directly impacts scalability)

Step 3: Design Deep Dive (10-25 min)

Go deep on 2-3 components.

Choose: most technically challenging | most interesting trade-offs | bottleneck components (highest load, most failure-prone)

Step 4: Wrap Up (3-5 min)

Cover: summarize design in 2-3 sentences | identify known bottlenecks | what you'd improve with more time | operational concerns (monitoring, alerting, deployment) | future enhancements

Avoid: introducing entirely new components at this stage | second-guessing your design

Back-of-Envelope Estimation

Powers of 2 Reference

| Power | Value | Meaning | |-------|-------|---------| | 10 | 1 Thousand | 1 KB | | 20 | 1 Million | 1 MB | | 30 | 1 Billion | 1 GB | | 40 | 1 Trillion | 1 TB | | 50 | 1 Quadrillion | 1 PB |

Latency Numbers Every Engineer Should Know

Key takeaways: memory fast, disk slow -- cache aggressively | compress before network send | inter-datacenter trips expensive -- minimize cross-region calls

Common Estimation Patterns

DAU to QPS: QPS = DAU * actions_per_user / 86400 | Peak QPS = QPS * 2 (or *3 for spiky)

Storage: daily = DAU * actions * avg_size | yearly = daily * 365 | 5-year = yearly * 5

Bandwidth: QPS * average_response_size

Servers: Peak QPS / QPS_per_server where CPU-bound ~hundreds | IO-bound with cache ~thousands | static content ~tens of thousands

Estimation Example: Twitter-like Service

150M DAU, 2 tweets/day, 10 reads/day
Write QPS = 150M * 2 / 86400 ~ 3,500
Read QPS = 150M * 10 / 86400 ~ 17,000; Peak ~ 50,000
Storage: 300M tweets * 1KB + 30M media * 500KB ~ 15.3 TB/day

Scaling Ladder

Each step solves a specific bottleneck. Never introduce a component without articulating which bottleneck it addresses.

Load balancer -- distribute traffic across web servers
Database replication -- master-slave for read scaling
Cache layer -- reduce database load (Redis/Memcached)
CDN -- serve static content from edge
Stateless web tier -- move session state to shared store
Database sharding -- horizontal partitioning for write scaling
Message queue -- decouple components, handle spikes
Logging, metrics, monitoring -- observability at scale
Multiple data centers -- geographic redundancy and latency reduction

Common Pitfalls

Jumping to solutions -- design before understanding requirements
Over-engineering -- adding components for imaginary scale
Ignoring trade-offs -- every choice has a cost; name it
SPOF blindness -- always ask "what if this dies?"
Neglecting data -- the data model drives everything
Forgetting operations -- a system you can't monitor is one you can't run
Not doing math -- gut feelings are wrong; estimates keep you honest

Related Skills

nwave-ai/nw-distill

testing

VerifiedTrustedCommunity

Acceptance test creation methodology for the DISTILL wave. Domain knowledge for the acceptance designer agent: port-to-port principle, prior wave reading, wave-decision reconciliation, graceful degradation, and document back-propagation.

563SKILL.mdUpdated May 21, 2026

nwave-ai/nw-collaboration-and-handoffs

development

VerifiedTrustedCommunity

Cross-agent collaboration protocols, workflow handoff patterns, and commit message formats for TDD/Mikado/refactoring workflows

563SKILL.mdUpdated Apr 16, 2026

nwave-ai/nw-collaboration-and-handoffs

nwave-ai/nw-roadmap

development

VerifiedTrustedCommunity

Creates a phased roadmap.json for a feature goal with acceptance criteria and TDD steps. Use when planning implementation steps before execution.

563SKILL.mdUpdated Apr 9, 2026

nwave-ai/nw-distill

testing

VerifiedTrustedCommunity

563SKILL.mdUpdated Apr 9, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/nwave-ai/nwave.git

# Copy into Claude Code skills folder (global)
cp -r nwave/plugins/nw/skills/nw-sd-framework ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

nwave-ai/nwave

352 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT