System Builder Experience - Deployment Reality and Operational Truth

Core Capability

Hard-earned knowledge from building, deploying, and maintaining complex systems in production. Grounds architectural proposals in the brutal reality of what actually works, what breaks, and why most things are harder than they look.

Key Functions

1. Deployment Reality Checks

Expose the gap between demo systems and production systems
Identify operational complexities that only appear at scale
Map the hidden work required to make systems actually reliable
Force consideration of maintenance burden and operational overhead

2. Failure Mode Prediction

Predict specific ways systems will break based on architectural choices
Identify cascade failure patterns and single points of failure
Map human factors that cause operational failures
Analyze monitoring and debugging challenges before they hit production

3. Scaling Bottleneck Analysis

Identify performance bottlenecks that emerge only at scale
Analyze resource consumption patterns under load
Predict organizational bottlenecks that limit system scaling
Map cost scaling characteristics for realistic budget planning

4. Operational Complexity Assessment

Calculate true maintenance burden of proposed architectures
Identify knowledge transfer and documentation requirements
Assess debugging complexity and troubleshooting challenges
Analyze on-call burden and operational team requirements

System Builder War Stories Database

Database Scaling Reality

The Theory

"We'll use a distributed database for infinite scalability"

The Reality

Split-brain scenarios happen during network partitions
- Manual intervention required to resolve
- Data loss occurs if automated resolution is wrong
- Testing split-brain scenarios is complex and expensive
Consistency model complexity explodes application logic
- Eventual consistency requires complete application redesign
- Strong consistency kills performance under load
- Session consistency requires sticky sessions that limit scaling
Operational nightmares that vendors don't mention
- Cross-datacenter replication lag causes subtle bugs
- Schema changes require coordinated rolling updates
- Backup and restore across distributed systems takes days
- Query optimization becomes exponentially harder

What Actually Works

Single-master databases with read replicas scale to millions of users
Vertical scaling is cheaper and more reliable than distributed complexity
Application-level sharding is simpler than database-level distribution
Most "scale problems" are actually inefficient query problems

Microservices Reality Check

The Theory

"Microservices will give us independent deployability and team autonomy"

The Reality

Network is unreliable - latency and failures become core concerns
- 1ms local calls become 10-100ms network calls
- Cascade failures when service dependencies are unavailable
- Distributed tracing required but adds 10-20% performance overhead
- Circuit breakers and retries add complexity to every service
Data consistency nightmares across service boundaries
- Two-phase commit doesn't scale and has blocking failure modes
- Saga pattern requires complex rollback logic for every transaction
- Event sourcing works but requires complete mental model shift
- "Eventually consistent" means "sometimes wrong" from user perspective
Debugging becomes exponentially harder
- Request flows across 5-10 services for simple operations
- Log correlation requires sophisticated infrastructure
- Performance issues require distributed profiling
- Integration testing requires managing dozens of service dependencies
Team coordination overhead increases dramatically
- Service API changes require coordination across teams
- Deployment orchestration becomes critical path for velocity
- On-call rotation complexity grows with service count
- Shared libraries become deployment coordination nightmares

What Actually Scales

Modular monoliths with clear internal boundaries
Database-per-service without microservice network overhead
Asynchronous messaging for loose coupling, not RPC calls
Extract services only when team boundaries genuinely require it

Container Orchestration Experience

The Theory

"Kubernetes will make deployment and scaling automatic"

The Reality

Learning curve is brutal
- 6-12 months for team to become productive
- YAML configuration becomes programming language with no IDE
- Networking model requires deep understanding of Linux networking
- Storage concepts map poorly to traditional database deployment
Operational complexity explodes
- etcd corruption scenarios require deep distributed systems knowledge
- Network policy debugging requires packet capture and analysis
- Resource limits and requests require application profiling expertise
- Rolling update failures can leave applications in inconsistent states
Security becomes exponentially more complex
- RBAC system is powerful but extremely error-prone
- Pod security policies require understanding Linux security primitives
- Network segmentation through service mesh adds another complexity layer
- Secret management still requires external systems and careful key rotation
Hidden costs are substantial
- 20-40% resource overhead for orchestration layer
- Dedicated platform team required for anything beyond basic usage
- Monitoring and logging infrastructure becomes critical dependency
- Development environment complexity kills local development productivity

What Actually Works

VM-based deployment with configuration management (Ansible/Terraform)
Blue-green deployments with load balancer switching
Container deployment for packaging, not orchestration
Kubernetes only when you have dedicated platform engineering team

Specific Failure Pattern Analysis

AI System Deployment Failures

Model Serving Reality

Theory: "Deploy ML model as containerized microservice"
Reality Check:
├── Cold Start Problems
│   ├── Model loading takes 30-120 seconds
│   ├── GPU memory allocation failures under load
│   └── Dependency hell in container environments
├── Resource Management Nightmares
│   ├── GPU utilization optimization requires custom scheduling
│   ├── Memory usage varies dramatically with input size
│   └── CPU vs GPU resource balancing is application-specific
├── Inference Latency Issues
│   ├── P99 latency 10-100x worse than average
│   ├── Batch processing vs real-time serving trade-offs
│   └── Model quantization breaks on edge cases
└── Operational Monitoring Gaps
    ├── Model drift detection requires domain expertise
    ├── Performance degradation often silent and gradual
    └── A/B testing infrastructure for model updates is complex

Prompt Engineering Pipeline Reality

Theory: "Automated prompt optimization and testing"
Reality Check:
├── Version Control Nightmares
│   ├── Prompt changes break in subtle, hard-to-test ways
│   ├── Git diffs on natural language prompts are useless
│   └── Rollback scenarios require prompt-specific testing
├── Testing Complexity
│   ├── Output quality evaluation requires human judgment
│   ├── Regression testing requires expensive model inference
│   └── Edge case coverage is fundamentally incomplete
├── Production Monitoring
│   ├── Quality degradation often goes unnoticed
│   ├── Usage pattern changes break carefully tuned prompts
│   └── Model provider updates silently change behavior
└── Human-AI Workflow Integration
    ├── Human review bottlenecks scale poorly
    ├── Context switching between AI and human work is expensive
    └── Quality varies dramatically across different operators

Security Implementation Reality

Zero Trust Architecture Deployment

Theory: "Never trust, always verify"
Reality Experience:
├── Identity and Access Management Complexity
│   ├── Certificate rotation at scale requires automation that often breaks
│   ├── Service mesh proxy failures create hard-to-debug connectivity issues
│   ├── Policy engine becomes single point of failure with unclear failure modes
│   └── Integration with legacy systems requires trust compromises
├── Network Segmentation Operational Burden
│   ├── Firewall rule complexity grows exponentially with service count
│   ├── Troubleshooting network issues requires deep packet inspection expertise
│   ├── Performance impact of encryption everywhere affects latency-sensitive applications
│   └── Change management for network policies becomes critical path for development
├── Monitoring and Compliance Overhead
│   ├── Every network connection must be logged and analyzed
│   ├── SIEM systems generate more false positives than actionable alerts
│   ├── Compliance reporting requires custom tooling and manual verification
│   └── Audit trail storage and retention costs become significant budget item
└── User Experience and Productivity Impact
    ├── Additional authentication steps reduce developer productivity by 15-30%
    ├── VPN alternatives (ZTNA) have reliability and performance issues
    ├── Mobile device management becomes critical and expensive
    └── Remote work scenarios require careful policy design to avoid lockout

Operational Cost Reality Modeling

True Total Cost of Ownership

Hidden Cost Categories

Technology Decision → Operational Cost Analysis
├── Development Costs
│   ├── Learning Curve: Time to productivity
│   ├── Integration Complexity: Unexpected compatibility issues
│   ├── Debugging Tools: Specialized tooling and training
│   └── Testing Infrastructure: Comprehensive test environment setup
├── Operational Costs
│   ├── Monitoring and Alerting: Custom dashboards and alert tuning
│   ├── Incident Response: On-call expertise and escalation procedures
│   ├── Capacity Planning: Performance testing and scaling analysis
│   └── Security Maintenance: Patching, vulnerability management, compliance
├── Human Costs
│   ├── Specialized Hiring: Market rates for scarce expertise
│   ├── Knowledge Transfer: Documentation and cross-training burden
│   ├── On-Call Rotation: Quality of life and burnout considerations
│   └── Career Development: Training and certification requirements
└── Organizational Costs
    ├── Change Management: Process updates and workflow changes
    ├── Vendor Management: Contract negotiation and relationship maintenance
    ├── Compliance and Audit: Regular assessment and remediation
    └── Technology Migration: Future replacement and upgrade costs

Realistic Cost Examples

Microservices Migration (Mid-size Company):
├── Development: $2M (18 months, 8 engineers)
├── Infrastructure: $500K/year (Kubernetes, monitoring, CI/CD)
├── Operations: $800K/year (2 SREs, on-call, incident response)
├── Hidden Costs: $1M first year (debugging, integration issues, rework)
└── Ongoing Burden: 30% of development velocity for service coordination

AI/ML Platform (Series B Startup):
├── Infrastructure: $100K/year (GPU compute, model serving)
├── Development: $1.5M (12 months, 6 ML engineers)
├── Operations: $300K/year (MLOps engineer, monitoring, retraining)
├── Data Engineering: $500K/year (data pipelines, quality, governance)
└── Business Risk: 6-month delay to market due to model reliability issues

Scaling Breakdown Patterns

Performance Cliff Analysis

Common Scaling Cliffs

System Component → Scaling Cliff Pattern
├── Database Connection Pools
│   ├── Cliff at ~1000 connections per database
│   ├── Symptoms: Connection timeouts, query queueing
│   └── Fix: Read replicas, connection pooling, query optimization
├── Service Discovery
│   ├── Cliff at ~500 services in service mesh
│   ├── Symptoms: Control plane overload, configuration propagation delays
│   └── Fix: Hierarchical service discovery, regional isolation
├── Log Aggregation
│   ├── Cliff at ~10GB/day per service
│   ├── Symptoms: Log ingestion lag, search timeouts, storage costs
│   └── Fix: Structured logging, sampling, retention policies
└── Container Orchestration
    ├── Cliff at ~1000 pods per node
    ├── Symptoms: Kubelet memory usage, etcd performance
    └── Fix: Node sizing, cluster federation, workload distribution

Organizational Scaling Limits

Conway's Law in Practice

Team Structure → System Architecture Constraints
├── Communication Overhead
│   ├── N-squared scaling of coordination for N teams
│   ├── Service boundaries must match team boundaries
│   └── Shared services become organizational bottlenecks
├── Knowledge Distribution
│   ├── System expertise becomes concentrated in few individuals
│   ├── Documentation lags reality by 6-12 months
│   └── On-call knowledge transfer requires explicit rotation planning
├── Decision Making Latency
│   ├── Cross-team decisions slow with organizational size
│   ├── Technical standards enforcement requires dedicated governance
│   └── Emergency response procedures break down without clear ownership
└── Technology Adoption Patterns
    ├── Technology choices reflect hiring biases and team preferences
    ├── Legacy system maintenance competes with new feature development
    └── Technical debt accumulation follows team structure boundaries

Integration with Other Skills

With Implementation-Grounded CTO

Provide war stories that ground architectural decisions
Supply realistic cost models for technology choices
Share failure patterns that inform design decisions

With Adversarial Intelligence

Identify operational attack vectors that emerge in production
Provide real-world failure scenarios for threat modeling
Ground security analysis in operational constraints

With Formal Proof Construction

Supply empirical data that validates theoretical analysis
Provide counterexamples from production experience
Ground mathematical models in operational reality

Success Metrics

System Builder Experience Quality

Prediction Accuracy: How often deployment reality matches predictions
Cost Estimation: How close TCO estimates are to actual costs
Failure Prevention: Percentage of predicted failure modes that occur
Time to Production: Realistic timeline estimates vs actual delivery

Operational Intelligence

Maintenance Burden Reduction: Lower operational overhead through better design
Incident Prevention: Fewer production issues due to proactive design
Scaling Preparedness: Systems that gracefully handle growth
Knowledge Transfer: Effective documentation and operational procedures

This system builder experience ensures that HeudElf grounds every architectural proposal in the messy reality of production systems, preventing the common trap of optimizing for theoretical elegance while ignoring operational complexity.

System Builder Experience - Deployment Reality and Operational Truth

Core Capability

Key Functions

1. Deployment Reality Checks

Expose the gap between demo systems and production systems
Identify operational complexities that only appear at scale
Map the hidden work required to make systems actually reliable
Force consideration of maintenance burden and operational overhead

2. Failure Mode Prediction

Predict specific ways systems will break based on architectural choices
Identify cascade failure patterns and single points of failure
Map human factors that cause operational failures
Analyze monitoring and debugging challenges before they hit production

3. Scaling Bottleneck Analysis

Identify performance bottlenecks that emerge only at scale
Analyze resource consumption patterns under load
Predict organizational bottlenecks that limit system scaling
Map cost scaling characteristics for realistic budget planning

4. Operational Complexity Assessment

Calculate true maintenance burden of proposed architectures
Identify knowledge transfer and documentation requirements
Assess debugging complexity and troubleshooting challenges
Analyze on-call burden and operational team requirements

System Builder War Stories Database

Database Scaling Reality

The Theory

"We'll use a distributed database for infinite scalability"

The Reality

Split-brain scenarios happen during network partitions
- Manual intervention required to resolve
- Data loss occurs if automated resolution is wrong
- Testing split-brain scenarios is complex and expensive
Consistency model complexity explodes application logic
- Eventual consistency requires complete application redesign
- Strong consistency kills performance under load
- Session consistency requires sticky sessions that limit scaling
Operational nightmares that vendors don't mention
- Cross-datacenter replication lag causes subtle bugs
- Schema changes require coordinated rolling updates
- Backup and restore across distributed systems takes days
- Query optimization becomes exponentially harder

What Actually Works

Single-master databases with read replicas scale to millions of users
Vertical scaling is cheaper and more reliable than distributed complexity
Application-level sharding is simpler than database-level distribution
Most "scale problems" are actually inefficient query problems

Microservices Reality Check

The Theory

"Microservices will give us independent deployability and team autonomy"

The Reality

Network is unreliable - latency and failures become core concerns
- 1ms local calls become 10-100ms network calls
- Cascade failures when service dependencies are unavailable
- Distributed tracing required but adds 10-20% performance overhead
- Circuit breakers and retries add complexity to every service
Data consistency nightmares across service boundaries
- Two-phase commit doesn't scale and has blocking failure modes
- Saga pattern requires complex rollback logic for every transaction
- Event sourcing works but requires complete mental model shift
- "Eventually consistent" means "sometimes wrong" from user perspective
Debugging becomes exponentially harder
- Request flows across 5-10 services for simple operations
- Log correlation requires sophisticated infrastructure
- Performance issues require distributed profiling
- Integration testing requires managing dozens of service dependencies
Team coordination overhead increases dramatically
- Service API changes require coordination across teams
- Deployment orchestration becomes critical path for velocity
- On-call rotation complexity grows with service count
- Shared libraries become deployment coordination nightmares

What Actually Scales

Modular monoliths with clear internal boundaries
Database-per-service without microservice network overhead
Asynchronous messaging for loose coupling, not RPC calls
Extract services only when team boundaries genuinely require it

Container Orchestration Experience

The Theory

"Kubernetes will make deployment and scaling automatic"

The Reality

Learning curve is brutal
- 6-12 months for team to become productive
- YAML configuration becomes programming language with no IDE
- Networking model requires deep understanding of Linux networking
- Storage concepts map poorly to traditional database deployment
Operational complexity explodes
- etcd corruption scenarios require deep distributed systems knowledge
- Network policy debugging requires packet capture and analysis
- Resource limits and requests require application profiling expertise
- Rolling update failures can leave applications in inconsistent states
Security becomes exponentially more complex
- RBAC system is powerful but extremely error-prone
- Pod security policies require understanding Linux security primitives
- Network segmentation through service mesh adds another complexity layer
- Secret management still requires external systems and careful key rotation
Hidden costs are substantial
- 20-40% resource overhead for orchestration layer
- Dedicated platform team required for anything beyond basic usage
- Monitoring and logging infrastructure becomes critical dependency
- Development environment complexity kills local development productivity

What Actually Works

VM-based deployment with configuration management (Ansible/Terraform)
Blue-green deployments with load balancer switching
Container deployment for packaging, not orchestration
Kubernetes only when you have dedicated platform engineering team

Specific Failure Pattern Analysis

AI System Deployment Failures

Model Serving Reality

Theory: "Deploy ML model as containerized microservice"
Reality Check:
├── Cold Start Problems
│   ├── Model loading takes 30-120 seconds
│   ├── GPU memory allocation failures under load
│   └── Dependency hell in container environments
├── Resource Management Nightmares
│   ├── GPU utilization optimization requires custom scheduling
│   ├── Memory usage varies dramatically with input size
│   └── CPU vs GPU resource balancing is application-specific
├── Inference Latency Issues
│   ├── P99 latency 10-100x worse than average
│   ├── Batch processing vs real-time serving trade-offs
│   └── Model quantization breaks on edge cases
└── Operational Monitoring Gaps
    ├── Model drift detection requires domain expertise
    ├── Performance degradation often silent and gradual
    └── A/B testing infrastructure for model updates is complex

Prompt Engineering Pipeline Reality

Theory: "Automated prompt optimization and testing"
Reality Check:
├── Version Control Nightmares
│   ├── Prompt changes break in subtle, hard-to-test ways
│   ├── Git diffs on natural language prompts are useless
│   └── Rollback scenarios require prompt-specific testing
├── Testing Complexity
│   ├── Output quality evaluation requires human judgment
│   ├── Regression testing requires expensive model inference
│   └── Edge case coverage is fundamentally incomplete
├── Production Monitoring
│   ├── Quality degradation often goes unnoticed
│   ├── Usage pattern changes break carefully tuned prompts
│   └── Model provider updates silently change behavior
└── Human-AI Workflow Integration
    ├── Human review bottlenecks scale poorly
    ├── Context switching between AI and human work is expensive
    └── Quality varies dramatically across different operators

Security Implementation Reality

Zero Trust Architecture Deployment

Theory: "Never trust, always verify"
Reality Experience:
├── Identity and Access Management Complexity
│   ├── Certificate rotation at scale requires automation that often breaks
│   ├── Service mesh proxy failures create hard-to-debug connectivity issues
│   ├── Policy engine becomes single point of failure with unclear failure modes
│   └── Integration with legacy systems requires trust compromises
├── Network Segmentation Operational Burden
│   ├── Firewall rule complexity grows exponentially with service count
│   ├── Troubleshooting network issues requires deep packet inspection expertise
│   ├── Performance impact of encryption everywhere affects latency-sensitive applications
│   └── Change management for network policies becomes critical path for development
├── Monitoring and Compliance Overhead
│   ├── Every network connection must be logged and analyzed
│   ├── SIEM systems generate more false positives than actionable alerts
│   ├── Compliance reporting requires custom tooling and manual verification
│   └── Audit trail storage and retention costs become significant budget item
└── User Experience and Productivity Impact
    ├── Additional authentication steps reduce developer productivity by 15-30%
    ├── VPN alternatives (ZTNA) have reliability and performance issues
    ├── Mobile device management becomes critical and expensive
    └── Remote work scenarios require careful policy design to avoid lockout

Operational Cost Reality Modeling

True Total Cost of Ownership

Hidden Cost Categories

Technology Decision → Operational Cost Analysis
├── Development Costs
│   ├── Learning Curve: Time to productivity
│   ├── Integration Complexity: Unexpected compatibility issues
│   ├── Debugging Tools: Specialized tooling and training
│   └── Testing Infrastructure: Comprehensive test environment setup
├── Operational Costs
│   ├── Monitoring and Alerting: Custom dashboards and alert tuning
│   ├── Incident Response: On-call expertise and escalation procedures
│   ├── Capacity Planning: Performance testing and scaling analysis
│   └── Security Maintenance: Patching, vulnerability management, compliance
├── Human Costs
│   ├── Specialized Hiring: Market rates for scarce expertise
│   ├── Knowledge Transfer: Documentation and cross-training burden
│   ├── On-Call Rotation: Quality of life and burnout considerations
│   └── Career Development: Training and certification requirements
└── Organizational Costs
    ├── Change Management: Process updates and workflow changes
    ├── Vendor Management: Contract negotiation and relationship maintenance
    ├── Compliance and Audit: Regular assessment and remediation
    └── Technology Migration: Future replacement and upgrade costs

Realistic Cost Examples

Microservices Migration (Mid-size Company):
├── Development: $2M (18 months, 8 engineers)
├── Infrastructure: $500K/year (Kubernetes, monitoring, CI/CD)
├── Operations: $800K/year (2 SREs, on-call, incident response)
├── Hidden Costs: $1M first year (debugging, integration issues, rework)
└── Ongoing Burden: 30% of development velocity for service coordination

AI/ML Platform (Series B Startup):
├── Infrastructure: $100K/year (GPU compute, model serving)
├── Development: $1.5M (12 months, 6 ML engineers)
├── Operations: $300K/year (MLOps engineer, monitoring, retraining)
├── Data Engineering: $500K/year (data pipelines, quality, governance)
└── Business Risk: 6-month delay to market due to model reliability issues

Scaling Breakdown Patterns

Performance Cliff Analysis

Common Scaling Cliffs

System Component → Scaling Cliff Pattern
├── Database Connection Pools
│   ├── Cliff at ~1000 connections per database
│   ├── Symptoms: Connection timeouts, query queueing
│   └── Fix: Read replicas, connection pooling, query optimization
├── Service Discovery
│   ├── Cliff at ~500 services in service mesh
│   ├── Symptoms: Control plane overload, configuration propagation delays
│   └── Fix: Hierarchical service discovery, regional isolation
├── Log Aggregation
│   ├── Cliff at ~10GB/day per service
│   ├── Symptoms: Log ingestion lag, search timeouts, storage costs
│   └── Fix: Structured logging, sampling, retention policies
└── Container Orchestration
    ├── Cliff at ~1000 pods per node
    ├── Symptoms: Kubelet memory usage, etcd performance
    └── Fix: Node sizing, cluster federation, workload distribution

Organizational Scaling Limits

Conway's Law in Practice

Team Structure → System Architecture Constraints
├── Communication Overhead
│   ├── N-squared scaling of coordination for N teams
│   ├── Service boundaries must match team boundaries
│   └── Shared services become organizational bottlenecks
├── Knowledge Distribution
│   ├── System expertise becomes concentrated in few individuals
│   ├── Documentation lags reality by 6-12 months
│   └── On-call knowledge transfer requires explicit rotation planning
├── Decision Making Latency
│   ├── Cross-team decisions slow with organizational size
│   ├── Technical standards enforcement requires dedicated governance
│   └── Emergency response procedures break down without clear ownership
└── Technology Adoption Patterns
    ├── Technology choices reflect hiring biases and team preferences
    ├── Legacy system maintenance competes with new feature development
    └── Technical debt accumulation follows team structure boundaries

Integration with Other Skills

With Implementation-Grounded CTO

Provide war stories that ground architectural decisions
Supply realistic cost models for technology choices
Share failure patterns that inform design decisions

With Adversarial Intelligence

Identify operational attack vectors that emerge in production
Provide real-world failure scenarios for threat modeling
Ground security analysis in operational constraints

With Formal Proof Construction

Supply empirical data that validates theoretical analysis
Provide counterexamples from production experience
Ground mathematical models in operational reality

Success Metrics

System Builder Experience Quality

Prediction Accuracy: How often deployment reality matches predictions
Cost Estimation: How close TCO estimates are to actual costs
Failure Prevention: Percentage of predicted failure modes that occur
Time to Production: Realistic timeline estimates vs actual delivery

Operational Intelligence

Maintenance Burden Reduction: Lower operational overhead through better design
Incident Prevention: Fewer production issues due to proactive design
Scaling Preparedness: Systems that gracefully handle growth
Knowledge Transfer: Effective documentation and operational procedures

Adoption

pauljbernard/skills/advanced/system-builder-experience

$ install --global

Security Scan Results

SKILL.md

System Builder Experience - Deployment Reality and Operational Truth

Core Capability

Key Functions

1. Deployment Reality Checks

2. Failure Mode Prediction

3. Scaling Bottleneck Analysis

4. Operational Complexity Assessment

System Builder War Stories Database

Database Scaling Reality

The Theory

The Reality

What Actually Works

Microservices Reality Check

The Theory

The Reality

What Actually Scales

Container Orchestration Experience

The Theory

The Reality

What Actually Works

Specific Failure Pattern Analysis

AI System Deployment Failures

Model Serving Reality

Prompt Engineering Pipeline Reality

Security Implementation Reality

Zero Trust Architecture Deployment

Operational Cost Reality Modeling

True Total Cost of Ownership

Hidden Cost Categories

Realistic Cost Examples

Scaling Breakdown Patterns

Performance Cliff Analysis

Common Scaling Cliffs

Organizational Scaling Limits

Conway's Law in Practice

Integration with Other Skills

With Implementation-Grounded CTO

With Adversarial Intelligence

With Formal Proof Construction

Success Metrics

System Builder Experience Quality

Operational Intelligence

Related Skills

pauljbernard/skills/technology-mastery/security-tools-expertise

pauljbernard/skills/technology-mastery/monitoring-observability-expertise

pauljbernard/skills/technology-mastery/ml-ai-frameworks-expertise

pauljbernard/skills/technology-mastery/messaging-streaming-expertise

pauljbernard/skills/advanced/system-builder-experience

$ install --global

Security Scan Results

SKILL.md

System Builder Experience - Deployment Reality and Operational Truth

Core Capability

Key Functions

1. Deployment Reality Checks

2. Failure Mode Prediction

3. Scaling Bottleneck Analysis

4. Operational Complexity Assessment

System Builder War Stories Database

Database Scaling Reality

The Theory

The Reality

What Actually Works

Microservices Reality Check

The Theory

The Reality

What Actually Scales

Container Orchestration Experience

The Theory

The Reality

What Actually Works

Specific Failure Pattern Analysis

AI System Deployment Failures

Model Serving Reality

Prompt Engineering Pipeline Reality