name: data-driven-feature
description: Build features guided by data insights, A/B testing, and continuous measurement using specialized agents for analysis, implementation, and experimentation.
Data-Driven Feature Development
Build features guided by data insights, A/B testing, and continuous measurement using specialized agents for analysis, implementation, and experimentation.
[Extended thinking: This workflow orchestrates a comprehensive data-driven development process from initial data analysis and hypothesis formulation through feature implementation with integrated analytics, A/B testing infrastructure, and post-launch analysis. Each phase leverages specialized agents to ensure features are built based on data insights, properly instrumented for measurement, and validated through controlled experiments. The workflow emphasizes modern product analytics practices, statistical rigor in testing, and continuous learning from user behavior.]
Phase 1: Data Analysis and Hypothesis Formation
1. Exploratory Data Analysis
- Do this step directly in Codex CLI (legacy playbook referenced subagent: machine-learning-ops::data-scientist).
- Instruction: "Perform exploratory data analysis for feature: (use the user's prompt). Analyze existing user behavior data, identify patterns and opportunities, segment users by behavior, and calculate baseline metrics. Use modern analytics tools (Amplitude, Mixpanel, Segment) to understand current user journeys, conversion funnels, and engagement patterns."
- Output: EDA report with visualizations, user segments, behavioral patterns, baseline metrics
2. Business Hypothesis Development
- Do this step directly in Codex CLI (legacy playbook referenced subagent: business-analytics::business-analyst).
- Context: Data scientist's EDA findings and behavioral patterns
- Instruction: "Formulate business hypotheses for feature: (use the user's prompt) based on data analysis. Define clear success metrics, expected impact on key business KPIs, target user segments, and minimum detectable effects. Create measurable hypotheses using frameworks like ICE scoring or RICE prioritization."
- Output: Hypothesis document, success metrics definition, expected ROI calculations
3. Statistical Experiment Design
- Do this step directly in Codex CLI (legacy playbook referenced subagent: machine-learning-ops::data-scientist).
- Context: Business hypotheses and success metrics
- Instruction: "Design statistical experiment for feature: (use the user's prompt). Calculate required sample size for statistical power, define control and treatment groups, specify randomization strategy, and plan for multiple testing corrections. Consider Bayesian A/B testing approaches for faster decision making. Design for both primary and guardrail metrics."
- Output: Experiment design document, power analysis, statistical test plan
Phase 2: Feature Architecture and Analytics Design
4. Feature Architecture Planning
- Do this step directly in Codex CLI (legacy playbook referenced subagent: data-engineering::backend-architect).
- Context: Business requirements and experiment design
- Instruction: "Design feature architecture for: (use the user's prompt) with A/B testing capability. Include feature flag integration (LaunchDarkly, Split.io, or Optimizely), gradual rollout strategy, circuit breakers for safety, and clean separation between control and treatment logic. Ensure architecture supports real-time configuration updates."
- Output: Architecture diagrams, feature flag schema, rollout strategy
5. Analytics Instrumentation Design
- Do this step directly in Codex CLI (legacy playbook referenced subagent: data-engineering::data-engineer).
- Context: Feature architecture and success metrics
- Instruction: "Design comprehensive analytics instrumentation for: (use the user's prompt). Define event schemas for user interactions, specify properties for segmentation and analysis, design funnel tracking and conversion events, plan cohort analysis capabilities. Implement using modern SDKs (Segment, Amplitude, Mixpanel) with proper event taxonomy."
- Output: Event tracking plan, analytics schema, instrumentation guide
6. Data Pipeline Architecture
- Do this step directly in Codex CLI (legacy playbook referenced subagent: data-engineering::data-engineer).
- Context: Analytics requirements and existing data infrastructure
- Instruction: "Design data pipelines for feature: (use the user's prompt). Include real-time streaming for live metrics (Kafka, Kinesis), batch processing for detailed analysis, data warehouse integration (Snowflake, BigQuery), and feature store for ML if applicable. Ensure proper data governance and GDPR compliance."
- Output: Pipeline architecture, ETL/ELT specifications, data flow diagrams
Phase 3: Implementation with Instrumentation
7. Backend Implementation
- Do this step directly in Codex CLI (legacy playbook referenced subagent: backend-development::backend-architect).
- Context: Architecture design and feature requirements
- Instruction: "Implement backend for feature: (use the user's prompt) with full instrumentation. Include feature flag checks at decision points, comprehensive event tracking for all user actions, performance metrics collection, error tracking and monitoring. Implement proper logging for experiment analysis."
- Output: Backend code with analytics, feature flag integration, monitoring setup
8. Frontend Implementation
- Do this step directly in Codex CLI (legacy playbook referenced subagent: frontend-mobile-development::frontend-developer).
- Context: Backend APIs and analytics requirements
- Instruction: "Build frontend for feature: (use the user's prompt) with analytics tracking. Implement event tracking for all user interactions, session recording integration if applicable, performance metrics (Core Web Vitals), and proper error boundaries. Ensure consistent experience between control and treatment groups."
- Output: Frontend code with analytics, A/B test variants, performance monitoring
9. ML Model Integration (if applicable)
- Do this step directly in Codex CLI (legacy playbook referenced subagent: machine-learning-ops::ml-engineer).
- Context: Feature requirements and data pipelines
- Instruction: "Integrate ML models for feature: (use the user's prompt) if needed. Implement online inference with low latency, A/B testing between model versions, model performance tracking, and automatic fallback mechanisms. Set up model monitoring for drift detection."
- Output: ML pipeline, model serving infrastructure, monitoring setup
Phase 4: Pre-Launch Validation
10. Analytics Validation
- Do this step directly in Codex CLI (legacy playbook referenced subagent: data-engineering::data-engineer).
- Context: Implemented tracking and event schemas
- Instruction: "Validate analytics implementation for: (use the user's prompt). Test all event tracking in staging, verify data quality and completeness, validate funnel definitions, ensure proper user identification and session tracking. Run end-to-end tests for data pipeline."
- Output: Validation report, data quality metrics, tracking coverage analysis
11. Experiment Setup
- Do this step directly in Codex CLI (legacy playbook referenced subagent: cloud-infrastructure::deployment-engineer).
- Context: Feature flags and experiment design
- Instruction: "Configure experiment infrastructure for: (use the user's prompt). Set up feature flags with proper targeting rules, configure traffic allocation (start with 5-10%), implement kill switches, set up monitoring alerts for key metrics. Test randomization and assignment logic."
- Output: Experiment configuration, monitoring dashboards, rollout plan
Phase 5: Launch and Experimentation
12. Gradual Rollout
- Do this step directly in Codex CLI (legacy playbook referenced subagent: cloud-infrastructure::deployment-engineer).
- Context: Experiment configuration and monitoring setup
- Instruction: "Execute gradual rollout for feature: (use the user's prompt). Start with internal dogfooding, then beta users (1-5%), gradually increase to target traffic. Monitor error rates, performance metrics, and early indicators. Implement automated rollback on anomalies."
- Output: Rollout execution, monitoring alerts, health metrics
13. Real-time Monitoring
- Do this step directly in Codex CLI (legacy playbook referenced subagent: observability-monitoring::observability-engineer).
- Context: Deployed feature and success metrics
- Instruction: "Set up comprehensive monitoring for: (use the user's prompt). Create real-time dashboards for experiment metrics, configure alerts for statistical significance, monitor guardrail metrics for negative impacts, track system performance and error rates. Use tools like Datadog, New Relic, or custom dashboards."
- Output: Monitoring dashboards, alert configurations, SLO definitions
Phase 6: Analysis and Decision Making
14. Statistical Analysis
- Do this step directly in Codex CLI (legacy playbook referenced subagent: machine-learning-ops::data-scientist).
- Context: Experiment data and original hypotheses
- Instruction: "Analyze A/B test results for: (use the user's prompt). Calculate statistical significance with confidence intervals, check for segment-level effects, analyze secondary metrics impact, investigate any unexpected patterns. Use both frequentist and Bayesian approaches. Account for multiple testing if applicable."
- Output: Statistical analysis report, significance tests, segment analysis
15. Business Impact Assessment
- Do this step directly in Codex CLI (legacy playbook referenced subagent: business-analytics::business-analyst).
- Context: Statistical analysis and business metrics
- Instruction: "Assess business impact of feature: (use the user's prompt). Calculate actual vs expected ROI, analyze impact on key business metrics, evaluate cost-benefit including operational overhead, project long-term value. Make recommendation on full rollout, iteration, or rollback."
- Output: Business impact report, ROI analysis, recommendation document
16. Post-Launch Optimization
- Do this step directly in Codex CLI (legacy playbook referenced subagent: machine-learning-ops::data-scientist).
- Context: Launch results and user feedback
- Instruction: "Identify optimization opportunities for: (use the user's prompt) based on data. Analyze user behavior patterns in treatment group, identify friction points in user journey, suggest improvements based on data, plan follow-up experiments. Use cohort analysis for long-term impact."
- Output: Optimization recommendations, follow-up experiment plans
Configuration Options
experiment_config:
min_sample_size: 10000
confidence_level: 0.95
runtime_days: 14
traffic_allocation: "gradual" # gradual, fixed, or adaptive
analytics_platforms:
- amplitude
- segment
- mixpanel
feature_flags:
provider: "launchdarkly" # launchdarkly, split, optimizely, unleash
statistical_methods:
- frequentist
- bayesian
monitoring:
- real_time_metrics: true
- anomaly_detection: true
- automatic_rollback: true
Success Criteria
- Data Coverage: 100% of user interactions tracked with proper event schema
- Experiment Validity: Proper randomization, sufficient statistical power, no sample ratio mismatch
- Statistical Rigor: Clear significance testing, proper confidence intervals, multiple testing corrections
- Business Impact: Measurable improvement in target metrics without degrading guardrail metrics
- Technical Performance: No degradation in p95 latency, error rates below 0.1%
- Decision Speed: Clear go/no-go decision within planned experiment runtime
- Learning Outcomes: Documented insights for future feature development
Coordination Notes
- Data scientists and business analysts collaborate on hypothesis formation
- Engineers implement with analytics as first-class requirement, not afterthought
- Feature flags enable safe experimentation without full deployments
- Real-time monitoring allows for quick iteration and rollback if needed
- Statistical rigor balanced with business practicality and speed to market
- Continuous learning loop feeds back into next feature development cycle
Feature to develop with data-driven approach: (use the user's prompt)
Output Format
<result>
<analysis>Brief analysis</analysis>
<solution>Implementation</solution>
<considerations>Trade-offs and notes</considerations>
</result>