Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

sundial-org/tinker-training-cost

Name: tinker-training-cost
Author: sundial-org

skills/tinker-training-cost/SKILL.md

npx skillsauth add sundial-org/skills tinker-training-cost

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Tinker Training Cost Calculator

Calculate training costs for Tinker fine-tuning jobs by tokenizing your dataset with the correct model tokenizer and applying current pricing.

Quick Start

Use the bundled script to calculate training costs:

# List available models and pricing
python scripts/calculate_cost.py --list-models

# Calculate cost for a JSONL dataset
python scripts/calculate_cost.py training_data.jsonl --model Qwen3-8B --epochs 3

# Output as JSON
python scripts/calculate_cost.py training_data.jsonl --model Llama-3.1-70B --json

The script:

Loads the correct tokenizer for the selected model
Counts tokens in your JSONL file (supports chat, text, and instruction formats)
Calculates the estimated training cost

Cost Formula

Training Cost = (total_tokens × epochs × train_price_per_million) / 1_000_000

Where:

total_tokens = tokens in your training dataset (from tokenization)
epochs = number of training passes (default: 3)
train_price_per_million = model-specific training rate from pricing table

Tinker Pricing

All prices as of January 5, 2026 Source: https://thinkingmachines.ai/tinker/

All prices are in USD per million tokens.

| Category | Description | |----------|-------------| | Prefill | Processing input context (inference) | | Sample | Generating output tokens (inference) | | Train | Training/fine-tuning tokens |

Qwen Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | Qwen3-4B-Instruct-2507 | $0.07 | $0.22 | $0.22 | | Qwen3-8B | $0.13 | $0.40 | $0.40 | | Qwen3-30B-A3B | $0.12 | $0.30 | $0.36 | | Qwen3-VL-30B-A3B-Instruct | $0.18 | $0.44 | $0.53 | | Qwen3-32B | $0.49 | $1.47 | $1.47 | | Qwen3-235B-Instruct-2507 | $0.68 | $1.70 | $2.04 | | Qwen3-VL-235B-A22B-Instruct | $1.02 | $2.56 | $3.07 |

Llama Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | Llama-3.2-1B | $0.03 | $0.09 | $0.09 | | Llama-3.2-3B | $0.06 | $0.18 | $0.18 | | Llama-3.1-8B | $0.13 | $0.40 | $0.40 | | Llama-3.1-70B | $1.05 | $3.16 | $3.16 |

DeepSeek Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | DeepSeek-V3.1 | $1.13 | $2.81 | $3.38 |

GPT-OSS Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | GPT-OSS-120B | $0.18 | $0.44 | $0.52 | | GPT-OSS-20B | $0.12 | $0.30 | $0.36 |

Moonshot Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | Kimi-K2-Thinking | $0.98 | $2.44 | $2.93 |

Model-to-Tokenizer Mapping

Use the correct HuggingFace tokenizer for accurate token counting:

| Model | HuggingFace Tokenizer | |-------|----------------------| | Qwen3-4B-Instruct-2507 | Qwen/Qwen3-4B | | Qwen3-8B | Qwen/Qwen3-8B | | Qwen3-30B-A3B | Qwen/Qwen3-30B-A3B | | Qwen3-32B | Qwen/Qwen3-32B | | Qwen3-235B-Instruct-2507 | Qwen/Qwen3-235B-A22B-Instruct | | Qwen3-VL-* | Qwen/Qwen2.5-VL-7B-Instruct (shared VL tokenizer) | | Llama-3.2-1B | meta-llama/Llama-3.2-1B-Instruct | | Llama-3.2-3B | meta-llama/Llama-3.2-3B-Instruct | | Llama-3.1-8B | meta-llama/Llama-3.1-8B-Instruct | | Llama-3.1-70B | meta-llama/Llama-3.1-70B-Instruct | | DeepSeek-V3.1 | deepseek-ai/DeepSeek-V3 | | GPT-OSS-* | Qwen/Qwen3-8B (compatible tokenizer) | | Kimi-K2-Thinking | moonshotai/Kimi-K2-Instruct |

Tokenization

The bundled scripts/calculate_cost.py handles tokenization automatically. For custom use:

from transformers import AutoTokenizer

# Load the correct tokenizer for your model
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-8B", trust_remote_code=True)

# Count tokens
token_count = len(tokenizer.encode("Your training text here"))

Supported JSONL Formats

The script handles these training data formats:

Chat format (recommended):

{"messages": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}]}

Text format:

{"text": "Your training text here"}

Instruction format (Alpaca-style):

{"instruction": "...", "input": "...", "output": "..."}

Quick Cost Examples

Example 1: Qwen3-8B on 1M tokens, 3 epochs

Dataset tokens: 1,000,000
Training tokens: 1,000,000 × 3 = 3,000,000
Cost: 3.0M × $0.40/M = $1.20

Example 2: Llama-3.1-70B on 5M tokens, 2 epochs

Dataset tokens: 5,000,000
Training tokens: 5,000,000 × 2 = 10,000,000
Cost: 10.0M × $3.16/M = $31.60

Example 3: Qwen3-235B on 2M tokens, 4 epochs

Dataset tokens: 2,000,000
Training tokens: 2,000,000 × 4 = 8,000,000
Cost: 8.0M × $2.04/M = $16.32

Important Notes

LoRA Fine-Tuning: Tinker uses Low-Rank Adaptation (LoRA), not full fine-tuning
Token Counting: Always use the model's native tokenizer for accurate counts - different tokenizers produce different token counts for the same text
Vision Models: VL models have higher costs due to image processing overhead
trust_remote_code: Required for some tokenizers (Qwen, DeepSeek)

sundial-org/tinker-training-cost

skills/tinker-training-cost/SKILL.md

Calculate training costs for Tinker fine-tuning jobs. Use when estimating costs for Tinker LLM training, counting tokens in datasets, or comparing Tinker model training prices. Tokenizes datasets using the correct model tokenizer and provides accurate cost estimates.

148 stars

data-ai

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add sundial-org/skills tinker-training-cost

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 5:10 AM26.6s1 file scanned

SKILL.md

name:: tinker-training-cost
description:: Calculate training costs for Tinker fine-tuning jobs. Use when estimating costs for Tinker LLM training, counting tokens in datasets, or comparing Tinker model training prices. Tokenizes datasets using the correct model tokenizer and provides accurate cost estimates.

Tinker Training Cost Calculator

Calculate training costs for Tinker fine-tuning jobs by tokenizing your dataset with the correct model tokenizer and applying current pricing.

Quick Start

Use the bundled script to calculate training costs:

# List available models and pricing
python scripts/calculate_cost.py --list-models

# Calculate cost for a JSONL dataset
python scripts/calculate_cost.py training_data.jsonl --model Qwen3-8B --epochs 3

# Output as JSON
python scripts/calculate_cost.py training_data.jsonl --model Llama-3.1-70B --json

The script:

Loads the correct tokenizer for the selected model
Counts tokens in your JSONL file (supports chat, text, and instruction formats)
Calculates the estimated training cost

Cost Formula

Training Cost = (total_tokens × epochs × train_price_per_million) / 1_000_000

Where:

total_tokens = tokens in your training dataset (from tokenization)
epochs = number of training passes (default: 3)
train_price_per_million = model-specific training rate from pricing table

Tinker Pricing

All prices as of January 5, 2026 Source: https://thinkingmachines.ai/tinker/

All prices are in USD per million tokens.

Qwen Models

Llama Models

DeepSeek Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | DeepSeek-V3.1 | $1.13 | $2.81 | $3.38 |

GPT-OSS Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | GPT-OSS-120B | $0.18 | $0.44 | $0.52 | | GPT-OSS-20B | $0.12 | $0.30 | $0.36 |

Moonshot Models

| Model | Prefill | Sample | Train | |-------|---------|--------|-------| | Kimi-K2-Thinking | $0.98 | $2.44 | $2.93 |

Model-to-Tokenizer Mapping

Use the correct HuggingFace tokenizer for accurate token counting:

Tokenization

The bundled scripts/calculate_cost.py handles tokenization automatically. For custom use:

from transformers import AutoTokenizer

# Load the correct tokenizer for your model
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-8B", trust_remote_code=True)

# Count tokens
token_count = len(tokenizer.encode("Your training text here"))

Supported JSONL Formats

The script handles these training data formats:

Chat format (recommended):

{"messages": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}]}

Text format:

{"text": "Your training text here"}

Instruction format (Alpaca-style):

{"instruction": "...", "input": "...", "output": "..."}

Quick Cost Examples

Example 1: Qwen3-8B on 1M tokens, 3 epochs

Dataset tokens: 1,000,000
Training tokens: 1,000,000 × 3 = 3,000,000
Cost: 3.0M × $0.40/M = $1.20

Example 2: Llama-3.1-70B on 5M tokens, 2 epochs

Dataset tokens: 5,000,000
Training tokens: 5,000,000 × 2 = 10,000,000
Cost: 10.0M × $3.16/M = $31.60

Example 3: Qwen3-235B on 2M tokens, 4 epochs

Dataset tokens: 2,000,000
Training tokens: 2,000,000 × 4 = 8,000,000
Cost: 8.0M × $2.04/M = $16.32

Important Notes

LoRA Fine-Tuning: Tinker uses Low-Rank Adaptation (LoRA), not full fine-tuning
Token Counting: Always use the model's native tokenizer for accurate counts - different tokenizers produce different token counts for the same text
Vision Models: VL models have higher costs due to image processing overhead
trust_remote_code: Required for some tokenizers (Qwen, DeepSeek)

Related Skills

sundial-org/cs448b-visualization

development

VerifiedTrustedCommunity

Data visualization design based on Stanford CS448B. Use for: (1) choosing chart types, (2) selecting visual encodings, (3) critiquing visualizations, (4) building D3.js visualizations, (5) designing interactions/animations, (6) choosing colors, (7) visualizing networks, (8) visualizing text. Covers Bertin, Mackinlay, Cleveland & McGill.

148SKILL.mdUpdated Apr 15, 2026

sundial-org/cs448b-visualization

sundial-org/training-data-curation

testing

VerifiedTrustedCommunity

Guidelines for creating high-quality datasets for LLM post-training (SFT/DPO/RLHF). Use when preparing data for fine-tuning, evaluating data quality, or designing data collection strategies.

148SKILL.mdUpdated Apr 15, 2026

sundial-org/training-data-curation

sundial-org/tinker

development

VerifiedTrustedCommunity

Fine-tune LLMs using the Tinker API. Covers supervised fine-tuning, reinforcement learning, LoRA training, vision-language models, and both high-level Cookbook patterns and low-level API usage.

148SKILL.mdUpdated Apr 15, 2026

sundial-org/skill

data-ai

VerifiedTrustedCommunity

Find, install, create, improve, and publish AI agent skills through the Sundial ecosystem. Use when the user wants to find or search for skills, install a skill, create a new skill, improve or evaluate an existing skill, or publish a skill to Sundial Hub. Trigger phrases include "find a skill", "install skill", "create a skill", "make a skill", "improve this skill", "evaluate skill", "publish skill", "push skill", "search for skills".

148SKILL.mdUpdated Apr 15, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/sundial-org/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skills/tinker-training-cost ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

sundial-org/skills

148 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT