.gemini/skills/MLOps Observability/SKILL.md
Guide to implement full stack observability including reproducibility, lineage, monitoring, alerting, and explainability.
npx skillsauth add fmind/mlops-python-package MLOps ObservabilityInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
To implement a "Glass Box" system where every result is Reproducible, every asset has Lineage, and system health is Monitored, Alerted on, and Explained.
Consistency is key. For instance:
random, numpy, torch, tensorflow.docker and locked dependencies (uv.lock).justfile with uv build --build-constraint for deterministic wheels.Know the origin of your data. For instance:
mlflow.data.from_pandas.mlflow.log_input.data/v1.csv) or use DVC.Watch for silent failures. For instance:
MLflow Evaluate to gate models against quality thresholds.evidently to compare reference (training) vs current (production) data.
log_system_metrics=True) for CPU/GPU.Don't stare at dashboards. For instance:
plyer for desktop notifications during long training runs.PagerDuty (critical) or Slack (warnings).Trust but verify. For instance:
SHAP values to explain individual predictions.Optimize resources. For instance:
project, env, user.run_time and instance type to estimate ROI.log_system_metrics enabled?development
Guide to implement rigorous validation layers including static analysis, automated testing, structured logging, and security scanning.
testing
Guide to create structured, reproducible Jupyter notebooks for MLOps prototyping, emphasizing configuration management and pipeline integrity.
tools
Guide to initialize a new MLOps project with standard tools (uv, git, VS Code) and best practices.
development
Guide to transform prototypes into robust, distributable Python packages using the src layout, hybrid paradigm, and strict configuration management.