.github/skills/synthetic-data/SKILL.md
Generate synthetic training data using NVIDIA Cosmos world foundation models for SDG pipelines
npx skillsauth add microsoft/physical-ai-toolchain synthetic-dataInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate photorealistic training data using NVIDIA Cosmos world foundation models — Cosmos Transfer, Cosmos Predict, and Cosmos Reason.
The Synthetic Data domain provides SDG pipelines that transform simulation-rendered frames into photorealistic training data, predict future environment states, and curate output for training quality.
| Stage | Model | Purpose | |-------|-------|---------| | Transfer | Cosmos Transfer 2.5 | Convert Isaac Sim renders to photorealistic images | | Predict | Cosmos Predict 2.5 | Generate future frame sequences from observations | | Reason | Cosmos Reason 2 | Assess data quality and filter training samples |
SDG workflows can be submitted via OSMO or AzureML:
synthetic-data/workflows/osmo/synthetic-data/workflows/azureml/| File | Purpose |
|------|---------|
| synthetic-data/README.md | Domain overview and directory structure |
| synthetic-data/workflows/osmo/sdg-pipeline.yaml | End-to-end OSMO SDG pipeline |
| synthetic-data/cosmos/configs/README.md | Model configuration reference |
| synthetic-data/specifications/synthetic-data.specification.md | SDG pipeline specification |
| synthetic-data/specifications/cosmos-integration.specification.md | Cosmos model integration specification |
All NVIDIA Cosmos containers require:
| Variable | Value |
|----------|-------|
| ACCEPT_EULA | Y |
| PRIVACY_CONSENT | Y |
| NVIDIA_DRIVER_CAPABILITIES | all |
Each Cosmos model stage requires a minimum of 1x A100 (40 GB) GPU. H100 (80 GB) recommended for production workloads.
tools
Submit, monitor, analyze, and evaluate LeRobot imitation learning training jobs on OSMO with Azure ML MLflow integration and inference evaluation - Brought to you by microsoft/physical-ai-toolchain
tools
Deploy and manage Azure infrastructure for the Physical AI Toolchain including Terraform IaC, Kubernetes setup, GPU configuration, and network topology
devops
Monitor robot fleet telemetry via Azure IoT Operations, drift detection, Grafana dashboards, and Fabric analytics
tools
Deploy trained robot policies to edge fleets via FluxCD GitOps, image automation, and deployment gating