Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

JosiahSiegel/ml-azureml-adf-automation

Name: ml-azureml-adf-automation
Author: JosiahSiegel

plugins/ml-master/skills/ml-azureml-adf-automation/SKILL.md

npx skillsauth add JosiahSiegel/claude-plugin-marketplace ml-azureml-adf-automation

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Azure ML and ADF Automation

Overview

Use this skill for Azure Machine Learning automation that registers code assets in CI and orchestrates training, scoring, registration, or deployment through Azure Data Factory. The main invariant is that runtime systems must consume the exact Azure ML asset versions that were actually registered, not the versions a pipeline attempted to request. Validate every recommendation against runtime behavior because Azure ML, ADF, storage networking, and SDK dependency behavior can diverge from static API documentation.

Core Invariants

CI owns Azure ML code asset registration and publishes the actual SDK-returned version.
ADF receives code versions through an explicit contract, usually a storage pointer blob, instead of discovering AML code versions at runtime.
The SDK result is the source of truth: requested version, build ID, branch name, or commit-derived strings are not authoritative.
Private storage requires both correct RBAC and proven data-plane reachability from the executing runtime.
ADF WebActivity networking must be tested through the intended integration runtime, not just validated as JSON.
Dependency constraints for Azure ML automation are pinned in CI environments.
Runtime evidence beats plausible ARM paths, documentation snippets, or successful template compilation.

Azure ML Code Asset Registration

Prefer the Python SDK for registering Azure ML code assets when automation must reliably capture the registered version. Use the Azure CLI only after confirming the target environment's az ml extension supports the needed code commands and returns enough information for downstream automation.

from azure.ai.ml import MLClient
from azure.ai.ml.entities._assets._artifacts.code import Code
from azure.identity import AzureCliCredential

ml_client = MLClient(
    AzureCliCredential(),
    subscription_id,
    resource_group,
    workspace_name,
)

result = ml_client._code.create_or_update(
    Code(name=code_name, version=requested_version, path=staged_code_path)
)
actual_version = result.version
print(actual_version)

Do not assume requested_version == result.version. Azure ML code assets can deduplicate uploads by content hash and return an existing version when the staged directory matches a prior asset. That is useful storage behavior but dangerous if CI publishes a requested build identifier instead of the SDK-returned version.

CI Output Variable Pattern

Publish the returned version as a pipeline output variable and wire downstream steps to that output.

print(
    "##vso[task.setvariable "
    f"variable=trainingCodeVersion;isOutput=true]{result.version}"
)

Prefer:

$registeredVersion = '$(RegisterTrainingCode.trainingCodeVersion)'

Avoid:

$registeredVersion = '$(Build.BuildId)'

If unique asset versions are operationally required even when code content repeats, stage the code directory and write a small marker file such as .aml-code-asset-version before registration. Treat this only as a dedup workaround. The real contract remains the SDK-returned result.version.

ADF to Azure ML Version Resolution

Avoid making ADF discover AML code versions through Azure ML ARM code-container endpoints unless that exact path has passed runtime validation. Some AML management endpoints can appear valid in REST documentation but fail from ADF WebActivity at execution time with unsupported-operation behavior. Treat that as service behavior until proven otherwise, not primarily an RBAC problem.

Use a CI-written pointer blob as the runtime contract between registration and orchestration:

https://<storage-account>.blob.core.windows.net/ml-globals/code-assets/training-code/latest.json

Example payload:

{
  "assetName": "training-code",
  "version": "<actual-sdk-returned-version>",
  "workspaceName": "<workspace-name>",
  "resourceGroup": "<resource-group>",
  "subscriptionId": "<subscription-id>",
  "buildId": "<build-or-run-id>",
  "sourceBranch": "<branch>",
  "sourceVersion": "<source-version>",
  "registeredAtUtc": "<timestamp>"
}

ADF reads version from this blob and passes it as a parameter to training, scoring, model registration, or deployment pipelines. The payload may include provenance fields, but downstream jobs should depend only on fields that are deliberately part of the contract.

ADF WebActivity for Pointer Blob Reads

Read the pointer blob with managed identity authentication against Azure Storage:

{
  "name": "ReadLatestTrainingCodeVersion",
  "type": "WebActivity",
  "typeProperties": {
    "method": "GET",
    "url": {
      "type": "Expression",
      "value": "@concat('https://', pipeline().globalParameters.StorageAccountName, '.blob.core.windows.net/ml-globals/code-assets/training-code/latest.json')"
    },
    "headers": {
      "x-ms-version": "2023-11-03",
      "Accept": "application/json"
    },
    "authentication": {
      "type": "MSI",
      "resource": "https://storage.azure.com/"
    },
    "connectVia": {
      "referenceName": "<managed-vnet-ir-name>",
      "type": "IntegrationRuntimeReference"
    }
  }
}

Critical placement rule: for ADF WebActivity, connectVia belongs inside typeProperties. If it is placed at the activity root, it can be ignored, causing traffic to leave over the public internet and fail against storage accounts with defaultAction: Deny.

Required access commonly includes:

ADF managed identity: Storage Blob Data Reader on the pointer container or account scope.
CI service connection identity: Storage Blob Data Contributor to write pointer blobs.
CI service connection identity: Storage Account Contributor when the pipeline manages storage firewall rules.

Private Storage and Hosted CI Agents

For storage accounts with private endpoints and defaultAction: Deny, Microsoft-hosted CI agents usually egress from public per-run IP addresses. Correct RBAC is not enough if the agent cannot reach the storage data plane. Before blaming the Azure ML SDK, ADF, or IAM, prove storage reachability from the agent.

Safe CI pattern:

Resolve the agent public IP.
Add a temporary storage network rule for that IP.
Wait for propagation.
Smoke-test storage data-plane access.
Register code assets and write pointer blobs.
Remove the temporary rule in an always() cleanup step.

$agentIp = (Invoke-RestMethod -Uri 'https://api.ipify.org' -TimeoutSec 20).Trim()

az storage account network-rule add `
  --resource-group $rg `
  --account-name $storageAccount `
  --action Allow `
  --ip-address $agentIp `
  --only-show-errors

Start-Sleep -Seconds 30

az storage container list `
  --account-name $storageAccount `
  --auth-mode login `
  --only-show-errors `
  -o none

Cleanup should run even when registration fails. In Azure DevOps YAML, put network-rule removal in a step with condition: always().

Python Dependency Pinning

Some azure-ai-ml versions import private marshmallow symbols that are unavailable in marshmallow 4.x. Hosted agents can install an incompatible transitive version and fail before any Azure ML API call runs. Pin the SDK and transitive dependency together when using affected versions.

python -m pip install --upgrade `
  "azure-ai-ml==1.24.0" `
  "azure-identity==1.19.0" `
  "marshmallow>=3.18,<4.0"

If using a newer SDK, verify the dependency behavior in CI rather than removing the pin based on local success.

ADF Development Workflow

Confirm which ADF execution mode reads unpublished Git-branch state and which mode runs the published factory definition. Debug runs may exercise branch state, while scheduled and production runs typically execute the last published factory. Manual trigger behavior depends on how the factory is configured and invoked. Pick the mode that actually exercises the change being validated.

Runtime Validation Standard

Accept runtime evidence, not structural plausibility. Validate:

Azure ML registration returned result.version and CI propagated that exact value.
Pointer blobs contain the version that Azure ML actually registered.
ADF reads the blob through the intended integration runtime and managed identity.
Storage is tested with the real firewall posture and CI agent egress path.
Downstream ADF parameters flow into the AML job definition used at runtime.
The AML training or scoring job starts with the expected code asset version.
SDK imports succeed in the same hosted-agent image used by CI.

Insufficient validation includes: docs showing an endpoint exists, JSON parsing, a plausible ARM URL, a successful deployment template, a requested version printed in logs without checking result.version, or review comments without runtime evidence.

Operational Checklist

[ ] Does CI own registration of Azure ML code assets?
[ ] Does CI capture and publish SDK-returned result.version?
[ ] Does ADF receive the exact code version through an explicit parameter or pointer blob?
[ ] Does the pointer payload separate contract fields from optional provenance metadata?
[ ] Are ADF WebActivities using connectVia inside typeProperties when private networking is required?
[ ] Is MSI blob access using the auth resource https://storage.azure.com/?
[ ] Does the ADF managed identity have Storage Blob Data Reader?
[ ] Does the CI service connection identity have Storage Blob Data Contributor?
[ ] If CI changes storage firewall rules, does it add, wait, smoke-test, and remove in cleanup?
[ ] Are known Azure ML SDK dependency constraints pinned?
[ ] Was validation performed against the runtime infrastructure path, not only compiled or reviewed?

Short Rules

Use the Azure ML Python SDK for code registration unless CLI behavior is verified in the target environment.
Never assume requested version equals registered version.
Treat SDK result.version as the source of truth.
Avoid AML ARM /codes/... discovery from ADF without runtime testing.
Use blob pointers for ADF-readable ML asset version contracts.
Put ADF WebActivity connectVia inside typeProperties.
Assume hosted CI agents need temporary storage firewall access for private storage.
Pin transitive dependencies when AML SDK imports have known constraints.
Require runtime validation for ML infrastructure changes.

Sources

Azure Machine Learning documentation: https://learn.microsoft.com/azure/machine-learning/
Azure Data Factory Web activity documentation: https://learn.microsoft.com/azure/data-factory/control-flow-web-activity
Azure Storage firewall and virtual network documentation: https://learn.microsoft.com/azure/storage/common/storage-network-security
Azure Machine Learning Python SDK documentation: https://learn.microsoft.com/python/api/overview/azure/ai-ml-readme

JosiahSiegel/ml-azureml-adf-automation

plugins/ml-master/skills/ml-azureml-adf-automation/SKILL.md

This skill should be used when the user asks to automate Azure ML and Azure Data Factory production workflows. PROACTIVELY activate for: (1) Azure ML code asset registration, azure-ai-ml SDK, AML code versions, `result.version`, requested-vs-actual versions, (2) ADF to Azure ML orchestration, ADF WebActivity, managed identity blob reads, `connectVia`, managed VNet IR, (3) code asset version pointer blobs, latest.json contracts, training/scoring code version propagation, (4) private storage firewalls, Microsoft-hosted CI agents, temporary network rules, storage data-plane smoke tests, (5) marshmallow<4 pinning, AML SDK import failures, runtime validation for Azure ML infrastructure. Provides: operationally safe Azure ML + ADF automation patterns.

38 stars

tools

Updated May 28, 2026

$ install --global

skillsauth

npx skillsauth add JosiahSiegel/claude-plugin-marketplace ml-azureml-adf-automation

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 28, 2026, 7:32 AM53.7s1 file scanned

SKILL.md

name:: ml-azureml-adf-automation
description:: |
This skill should be used when the user asks to automate Azure ML and Azure Data Factory production workflows. PROACTIVELY activate for:: (1) Azure ML code asset registration, azure-ai-ml SDK, AML code versions, `result.version`, requested-vs-actual versions, (2) ADF to Azure ML orchestration, ADF WebActivity, managed identity blob reads, `connectVia`, managed VNet IR, (3) code asset version pointer blobs, latest.json contracts, training/scoring code version propagation, (4) private storage firewalls, Microsoft-hosted CI agents, temporary network rules, storage data-plane smoke tests, (5) marshmallow<4 pinning, AML SDK import failures, runtime validation for Azure ML infrastructure. Provides: operationally safe Azure ML + ADF automation patterns.

Azure ML and ADF Automation

Overview

Core Invariants

CI owns Azure ML code asset registration and publishes the actual SDK-returned version.
ADF receives code versions through an explicit contract, usually a storage pointer blob, instead of discovering AML code versions at runtime.
The SDK result is the source of truth: requested version, build ID, branch name, or commit-derived strings are not authoritative.
Private storage requires both correct RBAC and proven data-plane reachability from the executing runtime.
ADF WebActivity networking must be tested through the intended integration runtime, not just validated as JSON.
Dependency constraints for Azure ML automation are pinned in CI environments.
Runtime evidence beats plausible ARM paths, documentation snippets, or successful template compilation.

Azure ML Code Asset Registration

from azure.ai.ml import MLClient
from azure.ai.ml.entities._assets._artifacts.code import Code
from azure.identity import AzureCliCredential

ml_client = MLClient(
    AzureCliCredential(),
    subscription_id,
    resource_group,
    workspace_name,
)

result = ml_client._code.create_or_update(
    Code(name=code_name, version=requested_version, path=staged_code_path)
)
actual_version = result.version
print(actual_version)

CI Output Variable Pattern

Publish the returned version as a pipeline output variable and wire downstream steps to that output.

print(
    "##vso[task.setvariable "
    f"variable=trainingCodeVersion;isOutput=true]{result.version}"
)

Prefer:

$registeredVersion = '$(RegisterTrainingCode.trainingCodeVersion)'

Avoid:

$registeredVersion = '$(Build.BuildId)'

ADF to Azure ML Version Resolution

Use a CI-written pointer blob as the runtime contract between registration and orchestration:

https://<storage-account>.blob.core.windows.net/ml-globals/code-assets/training-code/latest.json

Example payload:

{
  "assetName": "training-code",
  "version": "<actual-sdk-returned-version>",
  "workspaceName": "<workspace-name>",
  "resourceGroup": "<resource-group>",
  "subscriptionId": "<subscription-id>",
  "buildId": "<build-or-run-id>",
  "sourceBranch": "<branch>",
  "sourceVersion": "<source-version>",
  "registeredAtUtc": "<timestamp>"
}

ADF WebActivity for Pointer Blob Reads

Read the pointer blob with managed identity authentication against Azure Storage:

{
  "name": "ReadLatestTrainingCodeVersion",
  "type": "WebActivity",
  "typeProperties": {
    "method": "GET",
    "url": {
      "type": "Expression",
      "value": "@concat('https://', pipeline().globalParameters.StorageAccountName, '.blob.core.windows.net/ml-globals/code-assets/training-code/latest.json')"
    },
    "headers": {
      "x-ms-version": "2023-11-03",
      "Accept": "application/json"
    },
    "authentication": {
      "type": "MSI",
      "resource": "https://storage.azure.com/"
    },
    "connectVia": {
      "referenceName": "<managed-vnet-ir-name>",
      "type": "IntegrationRuntimeReference"
    }
  }
}

Required access commonly includes:

ADF managed identity: Storage Blob Data Reader on the pointer container or account scope.
CI service connection identity: Storage Blob Data Contributor to write pointer blobs.
CI service connection identity: Storage Account Contributor when the pipeline manages storage firewall rules.

Private Storage and Hosted CI Agents

Safe CI pattern:

Resolve the agent public IP.
Add a temporary storage network rule for that IP.
Wait for propagation.
Smoke-test storage data-plane access.
Register code assets and write pointer blobs.
Remove the temporary rule in an always() cleanup step.

$agentIp = (Invoke-RestMethod -Uri 'https://api.ipify.org' -TimeoutSec 20).Trim()

az storage account network-rule add `
  --resource-group $rg `
  --account-name $storageAccount `
  --action Allow `
  --ip-address $agentIp `
  --only-show-errors

Start-Sleep -Seconds 30

az storage container list `
  --account-name $storageAccount `
  --auth-mode login `
  --only-show-errors `
  -o none

Cleanup should run even when registration fails. In Azure DevOps YAML, put network-rule removal in a step with condition: always().

Python Dependency Pinning

python -m pip install --upgrade `
  "azure-ai-ml==1.24.0" `
  "azure-identity==1.19.0" `
  "marshmallow>=3.18,<4.0"

If using a newer SDK, verify the dependency behavior in CI rather than removing the pin based on local success.

ADF Development Workflow

Runtime Validation Standard

Accept runtime evidence, not structural plausibility. Validate:

Azure ML registration returned result.version and CI propagated that exact value.
Pointer blobs contain the version that Azure ML actually registered.
ADF reads the blob through the intended integration runtime and managed identity.
Storage is tested with the real firewall posture and CI agent egress path.
Downstream ADF parameters flow into the AML job definition used at runtime.
The AML training or scoring job starts with the expected code asset version.
SDK imports succeed in the same hosted-agent image used by CI.

Operational Checklist

[ ] Does CI own registration of Azure ML code assets?
[ ] Does CI capture and publish SDK-returned result.version?
[ ] Does ADF receive the exact code version through an explicit parameter or pointer blob?
[ ] Does the pointer payload separate contract fields from optional provenance metadata?
[ ] Are ADF WebActivities using connectVia inside typeProperties when private networking is required?
[ ] Is MSI blob access using the auth resource https://storage.azure.com/?
[ ] Does the ADF managed identity have Storage Blob Data Reader?
[ ] Does the CI service connection identity have Storage Blob Data Contributor?
[ ] If CI changes storage firewall rules, does it add, wait, smoke-test, and remove in cleanup?
[ ] Are known Azure ML SDK dependency constraints pinned?
[ ] Was validation performed against the runtime infrastructure path, not only compiled or reviewed?

Short Rules

Use the Azure ML Python SDK for code registration unless CLI behavior is verified in the target environment.
Never assume requested version equals registered version.
Treat SDK result.version as the source of truth.
Avoid AML ARM /codes/... discovery from ADF without runtime testing.
Use blob pointers for ADF-readable ML asset version contracts.
Put ADF WebActivity connectVia inside typeProperties.
Assume hosted CI agents need temporary storage firewall access for private storage.
Pin transitive dependencies when AML SDK imports have known constraints.
Require runtime validation for ML infrastructure changes.

Sources

Azure Machine Learning documentation: https://learn.microsoft.com/azure/machine-learning/
Azure Data Factory Web activity documentation: https://learn.microsoft.com/azure/data-factory/control-flow-web-activity
Azure Storage firewall and virtual network documentation: https://learn.microsoft.com/azure/storage/common/storage-network-security
Azure Machine Learning Python SDK documentation: https://learn.microsoft.com/python/api/overview/azure/ai-ml-readme

Related Skills

JosiahSiegel/clerk-sessions-webhooks-security

development

VerifiedTrustedCommunity

Use for Clerk sessions, tokens, webhooks, orgs, and security. PROACTIVELY activate for session tokens, JWT templates, getToken(), custom claims, pending sessions, multi-session UX, organizations, roles, permissions, system vs custom permissions, features/plans, MFA/passkeys/password policy/bot protection, Clerk webhooks, Svix signatures, verifyWebhook(), user/org sync, retries/replays, environment variables, custom domains, secret rotation, logs, and auth security reviews. Provides token semantics, webhook idempotency, authorization defaults, and hardening checklist.

45SKILL.mdUpdated Jun 19, 2026

JosiahSiegel/clerk-sessions-webhooks-security

JosiahSiegel/clerk-nextjs-auth

tools

VerifiedTrustedCommunity

Use for Clerk in Next.js. PROACTIVELY activate for @clerk/nextjs setup, App Router auth()/currentUser(), clerkMiddleware(), proxy.ts/middleware.ts, createRouteMatcher(), protected pages/layouts/Route Handlers/Server Actions/API routes/tRPC, auth.protect() role/permission/token checks, ClerkProvider placement, server-only clerkClient, Link prefetch, redirects, 401/404 auth failures, custom domains, __clerk proxy paths, and deployment gotchas. Provides file patterns, server/client boundary rules, matcher templates, and production checks.

45SKILL.mdUpdated Jun 19, 2026

JosiahSiegel/clerk-nextjs-auth

JosiahSiegel/clerk-frontend-sdks

development

VerifiedTrustedCommunity

Use for Clerk frontend auth flows. PROACTIVELY activate for React, JavaScript, Vue, Nuxt, Astro, Expo, React Router, TanStack React Start, or SPA setup; ClerkProvider and publishable-key wiring; SignIn/SignUp/UserButton/UserProfile/OrganizationSwitcher; custom useUser/useAuth/useClerk/useSignIn/useSignUp/useSession/useOrganization flows; multi-session UX; cross-origin getToken() fetches; loading states, redirects, routing, CORS/cookies, or hydration bugs. Provides SDK selection, UI patterns, token-fetch templates, and frontend gotchas.

45SKILL.mdUpdated Jun 19, 2026

JosiahSiegel/clerk-frontend-sdks

JosiahSiegel/clerk-environments-deployment

development

VerifiedTrustedCommunity

Use for Clerk dev/prod readiness, deployment, and multi-language implementation planning. PROACTIVELY activate for environment variables, pk_test/sk_test vs pk_live/sk_live, local dev, preview/staging/prod instances, domains/DNS, redirects, OAuth credentials, custom domains/proxy, authorizedParties, CSP, CORS/cookies, webhooks/tunnels, Vercel/Netlify/Cloudflare/API gateways, monitoring/troubleshooting, and backends in Node/Express/Fastify, Python/FastAPI/Django/Flask, Go, Ruby/Rails, Java/Spring, .NET, PHP/Laravel. Provides checklists, rollout plans, and language-portable patterns.

45SKILL.mdUpdated Jun 19, 2026

JosiahSiegel/clerk-environments-deployment

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/JosiahSiegel/claude-plugin-marketplace.git

# Copy into Claude Code skills folder (global)
cp -r claude-plugin-marketplace/plugins/ml-master/skills/ml-azureml-adf-automation ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

JosiahSiegel/claude-plugin-marketplace

38 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT