Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

arm-examples/llm-add-model-support

Name: llm-add-model-support
Author: arm-examples

skills/llm-add-model-support/SKILL.md

npx skillsauth add arm-examples/llm-runner llm-add-model-support

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Add model support

This repo’s “model support” typically means: a config in model_configuration_files/, model assets available under resources_downloaded/models/<backend>/..., and CTest coverage wired in test/CMakeLists.txt.

Windows note: if python3 isn’t available, use python (or py -3) for the scripts below.

If you only need “config + CTest wiring”

Use this path when you’re not onboarding new model assets or changing backend code, and only need to add/update a JSON config and ensure CI/test wiring covers it.

Add/update the config JSON under model_configuration_files/ (keep naming consistent for the target backend).
If you changed the config schema/keys, use skills/llm-config-schema-change/ for the parsing/tests/docs flow, then return here for config/test wiring.
Add the filename to the backend’s CONFIG_FILE_NAME list in test/CMakeLists.txt.
Verify it is listed:

python3 skills/llm-add-model-support/scripts/assert_config_listed.py <your-config.json>

If the config expects model assets to be downloaded automatically, add/update scripts/py/requirements.json (URLs + sha256sum) and keep downloads deterministic.

Workflow

1) Pick the target backend and model

Decide LLM_FRAMEWORK and a model identifier (folder name under resources_downloaded/models/<backend>/).
Check if the model is gated on Hugging Face; if yes, ensure HF_TOKEN or ~/.netrc is available for downloads.

2) Add/download the model assets (data-first)

Prefer extending the downloads manifest:

Add entries in scripts/py/requirements.json under models → <backend> → <model-name>.
Every entry must include url, destination, and sha256sum.
Use python3 skills/llm-add-model-support/scripts/sha256_file.py <file> to compute hashes for local files.

If downloads are not desirable (very large / licensing constraints), document how to supply the model manually and keep tests/configs from implicitly requiring the download. Never commit artifacts from resources_downloaded/ (keep downloads deterministic via scripts/py/requirements.json).

3) Add a model configuration JSON

Add model_configuration_files/<backend><Text|Vision>Config-<model>.json (follow existing naming patterns).
Ensure config fields have safe defaults and are parsed by src/cpp/config/ (LlmConfig.*).
If you introduce new config keys, update test/cpp/LlmConfigTest.cpp.

4) Handle backend-specific model quirks (code changes)

Use the minimal set of changes needed for the chosen backend:

Prompt template / chat formatting: update the backend’s query builder or chat adapter under src/cpp/frameworks/.
Tokenizer/model loading: ensure the backend can load the model file(s) referenced by the config.
Modalities: confirm SupportedInputModalities() matches the model (text-only vs vision).

5) Wire tests

Ensure CTest actually runs your config:

python3 skills/llm-add-model-support/scripts/assert_config_listed.py <your-config.json>

If it’s a new backend (or new modality), ensure test/CMakeLists.txt has the right branch and config list entries.

6) Validate outputs (build + run relevant tests)

cmake --preset=native -B build -DLLM_FRAMEWORK=<backend>
cmake --build ./build --parallel
ctest --test-dir ./build --output-on-failure

If JNI is not relevant to model support changes, isolate:

cmake --preset=native -B build -DLLM_FRAMEWORK=<backend> -DBUILD_JNI_LIB=OFF
cmake --build ./build --parallel
ctest --test-dir ./build --output-on-failure

If a new model causes output drift (tests fail but the model is “reasonable”), revise tests deliberately:

Prefer making prompts more constrained (e.g., “Answer with a single word”) over weakening assertions.
Prefer stable, high-signal checks (contains a specific entity like “Paris”) and allow a short allowlist of synonyms/casing when necessary.
Keep regressions meaningful: avoid assertions that only check “non-empty” unless that’s the contract being tested.
Validate both directions: a good test should fail if you intentionally break the prompt/model path.

See skills/llm-add-model-support/references/output-validation.md for concrete patterns used in this repo’s tests.

7) Update docs

Add the model to README.md “Supported Models” table (and link its license).
If there are new prerequisites or known runtime issues, update TROUBLESHOOTING.md.

References

Checklist: skills/llm-add-model-support/references/new-model-checklist.md
Backend notes: skills/llm-add-model-support/references/backend-notes.md
Output alignment: skills/llm-add-model-support/references/output-validation.md

arm-examples/llm-add-model-support

skills/llm-add-model-support/SKILL.md

Add or revise a model configuration JSON under model_configuration_files/ and ensure it is exercised by CTest (test/CMakeLists.txt) for the relevant backend; optionally onboard a new model end-to-end (downloads/sha256 entries, backend prompt/template quirks, tests, README updates). Use when onboarding a new model, updating supported models, or fixing model-related load/inference/test failures.

4 stars

development

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add arm-examples/llm-runner llm-add-model-support

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 9:42 PM1.8s1 file scanned

SKILL.md

name:: llm-add-model-support
description:: Add or revise a model configuration JSON under model_configuration_files/ and ensure it is exercised by CTest (test/CMakeLists.txt) for the relevant backend; optionally onboard a new model end-to-end (downloads/sha256 entries, backend prompt/template quirks, tests, README updates). Use when onboarding a new model, updating supported models, or fixing model-related load/inference/test failures.

Add model support

Windows note: if python3 isn’t available, use python (or py -3) for the scripts below.

If you only need “config + CTest wiring”

Use this path when you’re not onboarding new model assets or changing backend code, and only need to add/update a JSON config and ensure CI/test wiring covers it.

Add/update the config JSON under model_configuration_files/ (keep naming consistent for the target backend).
If you changed the config schema/keys, use skills/llm-config-schema-change/ for the parsing/tests/docs flow, then return here for config/test wiring.
Add the filename to the backend’s CONFIG_FILE_NAME list in test/CMakeLists.txt.
Verify it is listed:

python3 skills/llm-add-model-support/scripts/assert_config_listed.py <your-config.json>

If the config expects model assets to be downloaded automatically, add/update scripts/py/requirements.json (URLs + sha256sum) and keep downloads deterministic.

Workflow

1) Pick the target backend and model

Decide LLM_FRAMEWORK and a model identifier (folder name under resources_downloaded/models/<backend>/).
Check if the model is gated on Hugging Face; if yes, ensure HF_TOKEN or ~/.netrc is available for downloads.

2) Add/download the model assets (data-first)

Prefer extending the downloads manifest:

Add entries in scripts/py/requirements.json under models → <backend> → <model-name>.
Every entry must include url, destination, and sha256sum.
Use python3 skills/llm-add-model-support/scripts/sha256_file.py <file> to compute hashes for local files.

3) Add a model configuration JSON

Add model_configuration_files/<backend><Text|Vision>Config-<model>.json (follow existing naming patterns).
Ensure config fields have safe defaults and are parsed by src/cpp/config/ (LlmConfig.*).
If you introduce new config keys, update test/cpp/LlmConfigTest.cpp.

4) Handle backend-specific model quirks (code changes)

Use the minimal set of changes needed for the chosen backend:

Prompt template / chat formatting: update the backend’s query builder or chat adapter under src/cpp/frameworks/.
Tokenizer/model loading: ensure the backend can load the model file(s) referenced by the config.
Modalities: confirm SupportedInputModalities() matches the model (text-only vs vision).

5) Wire tests

Ensure CTest actually runs your config:

python3 skills/llm-add-model-support/scripts/assert_config_listed.py <your-config.json>

If it’s a new backend (or new modality), ensure test/CMakeLists.txt has the right branch and config list entries.

6) Validate outputs (build + run relevant tests)

cmake --preset=native -B build -DLLM_FRAMEWORK=<backend>
cmake --build ./build --parallel
ctest --test-dir ./build --output-on-failure

If JNI is not relevant to model support changes, isolate:

cmake --preset=native -B build -DLLM_FRAMEWORK=<backend> -DBUILD_JNI_LIB=OFF
cmake --build ./build --parallel
ctest --test-dir ./build --output-on-failure

If a new model causes output drift (tests fail but the model is “reasonable”), revise tests deliberately:

Prefer making prompts more constrained (e.g., “Answer with a single word”) over weakening assertions.
Prefer stable, high-signal checks (contains a specific entity like “Paris”) and allow a short allowlist of synonyms/casing when necessary.
Keep regressions meaningful: avoid assertions that only check “non-empty” unless that’s the contract being tested.
Validate both directions: a good test should fail if you intentionally break the prompt/model path.

See skills/llm-add-model-support/references/output-validation.md for concrete patterns used in this repo’s tests.

7) Update docs

Add the model to README.md “Supported Models” table (and link its license).
If there are new prerequisites or known runtime issues, update TROUBLESHOOTING.md.

References

Checklist: skills/llm-add-model-support/references/new-model-checklist.md
Backend notes: skills/llm-add-model-support/references/backend-notes.md
Output alignment: skills/llm-add-model-support/references/output-validation.md

Related Skills

arm-examples/llm-update-downloads

tools

VerifiedTrustedCommunity

Update scripts/py/requirements.json entries (URLs + sha256sum) for models/tools, validate hash changes, and keep downloads deterministic without committing artifacts. Use when adding or refreshing model/tool downloads.

4SKILL.mdUpdated Apr 4, 2026

arm-examples/llm-update-downloads

arm-examples/llm-session-start

tools

VerifiedTrustedCommunity

Run fast “session start / doctor” checks for this repository (toolchain + wiring sanity, framework version report, optional upstream update check), optionally generate a debug bundle, and when needed bump pinned backend framework versions with build+ctest verification. Use at session start or when upgrading llama.cpp/onnxruntime-genai/mediapipe/mnn pins.

4SKILL.mdUpdated Apr 4, 2026

arm-examples/llm-session-start

arm-examples/llm-jni-smoke

tools

VerifiedTrustedCommunity

Run a fast JNI-focused build/test smoke check (JNI on, minimal test run), and isolate JNI toolchain issues. Use when changing JNI/Java code or validating JNI setup.

4SKILL.mdUpdated Apr 4, 2026

arm-examples/llm-jni-smoke

arm-examples/llm-debug-test-failures

development

VerifiedTrustedCommunity

Debug failing LLM integration tests caused by model output drift, incorrect context/runtime parameters (contextSize, batchSize, threads), prompt/template mismatches, or backend/framework regressions. Use when tests fail and you need to see the model response, reproduce a single failing CTest, or trace issues into src/cpp/frameworks (llama.cpp, onnxruntime-genai, mediapipe, mnn).

4SKILL.mdUpdated Apr 4, 2026

arm-examples/llm-debug-test-failures

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/arm-examples/llm-runner.git

# Copy into Claude Code skills folder (global)
cp -r llm-runner/skills/llm-add-model-support ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

arm-examples/llm-runner

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT