Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

antinomyhq/test-reasoning

Name: test-reasoning
Author: antinomyhq

.forge/skills/test-reasoning/SKILL.md

npx skillsauth add antinomyhq/forge test-reasoning

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Test Reasoning Serialization

Validates that ReasoningConfig fields are correctly serialized into provider-specific JSON for OpenRouter, Anthropic, GitHub Copilot, and Codex.

Quick Start

Run all tests with the bundled script:

./scripts/test-reasoning.sh

The script builds forge in debug mode, runs each provider/model combination, captures the outgoing HTTP request body via FORGE_DEBUG_REQUESTS, and asserts the correct JSON fields.

Running a Single Test Manually

FORGE_DEBUG_REQUESTS="forge.request.json" \
FORGE_SESSION__PROVIDER_ID=<provider_id> \
FORGE_SESSION__MODEL_ID=<model_id> \
FORGE_REASONING__EFFORT=<effort> \
target/debug/forge -p "Hello!"

Then inspect .forge/forge.request.json for the expected fields.

Test Coverage

| Provider | Model | Config fields | Expected JSON field | | ---------------- | ---------------------------- | ------------------------------------------------- | --------------------------------- | | open_router | openai/o4-mini | effort: none\|minimal\|low\|medium\|high\|xhigh | reasoning.effort | | open_router | openai/o4-mini | max_tokens: 4000 | reasoning.max_tokens | | open_router | openai/o4-mini | effort: high + exclude: true | reasoning.effort + .exclude | | open_router | openai/o4-mini | enabled: true | reasoning.enabled | | open_router | anthropic/claude-opus-4-5 | max_tokens: 4000 | reasoning.max_tokens | | open_router | moonshotai/kimi-k2 | max_tokens: 4000 | reasoning.max_tokens | | open_router | moonshotai/kimi-k2 | effort: high | reasoning.effort | | open_router | minimax/minimax-m2 | max_tokens: 4000 | reasoning.max_tokens | | open_router | minimax/minimax-m2 | effort: high | reasoning.effort | | anthropic | claude-opus-4-6 | effort: low\|medium\|high\|max | output_config.effort | | anthropic | claude-3-7-sonnet-20250219 | enabled: true + max_tokens: 8000 | thinking.type + budget_tokens | | github_copilot | o4-mini | effort: none\|minimal\|low\|medium\|high\|xhigh | reasoning_effort (top-level) | | codex | gpt-5.1-codex | effort: none\|minimal\|low\|medium\|high\|xhigh | reasoning.effort + .summary | | codex | gpt-5.1-codex | effort: medium + exclude: true | reasoning.summary = "concise" | | all providers | one model each | effort: invalid | non-zero exit, no request written |

Tests for unconfigured providers are skipped automatically. Invalid-effort tests run regardless of credentials — the rejection happens at config parse time before any provider interaction.

References

OpenAI Reasoning guide
OpenAI Chat Completions API reference
Anthropic Extended Thinking
OpenRouter Reasoning Tokens

antinomyhq/test-reasoning

.forge/skills/test-reasoning/SKILL.md

Validate that reasoning parameters are correctly serialized and sent to provider APIs. Use when the user asks to test reasoning serialization, run reasoning tests, verify reasoning config fields, or check that ReasoningConfig maps correctly to provider-specific JSON (OpenRouter, Anthropic, GitHub Copilot, Codex).

5,784 stars

development

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add antinomyhq/forge test-reasoning

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 23, 2026, 1:36 AM1.7s1 file scanned

SKILL.md

name:: test-reasoning
description:: Validate that reasoning parameters are correctly serialized and sent to provider APIs. Use when the user asks to test reasoning serialization, run reasoning tests, verify reasoning config fields, or check that ReasoningConfig maps correctly to provider-specific JSON (OpenRouter, Anthropic, GitHub Copilot, Codex).

Test Reasoning Serialization

Validates that ReasoningConfig fields are correctly serialized into provider-specific JSON for OpenRouter, Anthropic, GitHub Copilot, and Codex.

Quick Start

Run all tests with the bundled script:

./scripts/test-reasoning.sh

The script builds forge in debug mode, runs each provider/model combination, captures the outgoing HTTP request body via FORGE_DEBUG_REQUESTS, and asserts the correct JSON fields.

Running a Single Test Manually

FORGE_DEBUG_REQUESTS="forge.request.json" \
FORGE_SESSION__PROVIDER_ID=<provider_id> \
FORGE_SESSION__MODEL_ID=<model_id> \
FORGE_REASONING__EFFORT=<effort> \
target/debug/forge -p "Hello!"

Then inspect .forge/forge.request.json for the expected fields.

Test Coverage

Tests for unconfigured providers are skipped automatically. Invalid-effort tests run regardless of credentials — the rejection happens at config parse time before any provider interaction.

References

OpenAI Reasoning guide
OpenAI Chat Completions API reference
Anthropic Extended Thinking
OpenRouter Reasoning Tokens

Related Skills

antinomyhq/resolve-fixme

development

VerifiedTrustedCommunity

Find all FIXME comments across the codebase and fully implement the work they describe. Use when the user asks to fix, resolve, or address FIXME comments, or when running the "fixme" command. Runs a discovery script to find every FIXME, expands multiline comment blocks, groups related FIXMEs across files into a single implementation task, completes the full underlying code changes, removes the FIXME comments only after the work is done, and verifies that no FIXMEs remain.

6,553SKILL.mdUpdated Apr 15, 2026

antinomyhq/resolve-fixme

antinomyhq/write-release-notes

testing

VerifiedTrustedCommunity

Generate engaging, high-energy release notes for a given version tag. Fetches the release from GitHub, retrieves every linked PR's title and description, then synthesizes all changes into a polished, user-facing release note with an enthusiastic tone. Use when the user asks to write, generate, or create release notes for a version (e.g. "write release notes for v1.32.0", "generate release notes for the latest release", "create changelog for v2.0").

6,553SKILL.mdUpdated Apr 4, 2026

antinomyhq/write-release-notes

antinomyhq/github-pr-description

tools

VerifiedTrustedCommunity

Generate and create pull request descriptions automatically using GitHub CLI. Use when the user asks to create a PR, generate a PR description, make a pull request, or submit changes for review. Analyzes git diff and commit history to create comprehensive, meaningful PR descriptions that explain what changed, why it matters, and how to test it.

5,784SKILL.mdUpdated Apr 4, 2026

antinomyhq/github-pr-description

antinomyhq/execute-plan

development

VerifiedTrustedCommunity

Execute structured task plans with status tracking. Use when the user provides a plan file path in the format `plans/{current-date}-{task-name}-{version}.md` or explicitly asks you to execute a plan file.

5,784SKILL.mdUpdated Apr 4, 2026

antinomyhq/execute-plan

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/antinomyhq/forge.git

# Copy into Claude Code skills folder (global)
cp -r forge/.forge/skills/test-reasoning ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

antinomyhq/forge

5,784 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT