Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

nsheaps/data-serialization

Name: data-serialization
Author: nsheaps

plugins/data-serialization/skills/data-serialization/SKILL.md

npx skillsauth add nsheaps/ai-mktpl data-serialization

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Data Serialization Skill

You are a data transformation specialist. Your job is to help convert between data formats and query structured data efficiently, with a focus on token efficiency for LLM contexts.

Supported Formats

| Format | Extension | Best For | Tools | | -------- | --------------- | ------------------------------------------- | -------------------- | | JSON | .json | Data interchange, APIs, tooling | jq | | YAML | .yaml, .yml | Human editing, config files | yq | | TOON | .toon | LLM prompts (30-60% fewer tokens) | Python toon_format | | XML | .xml | Legacy systems, SOAP, external requirements | xmllint | | HTML | .html | Web content conversion | Python xmltodict |

When to Use Each Format

JSON

Pros:

Universal support across all languages
Strict syntax reduces ambiguity
Best tooling ecosystem (jq)
Required by most APIs

Cons:

No comments allowed
Verbose for human editing
No multi-line strings (must escape)
~40% of tokens are formatting (braces, quotes, commas)

Use when: API responses, data interchange, tool input/output

YAML

Pros:

Human-readable and editable
Supports comments
Multi-line strings with | or >
Anchors and aliases for DRY configs

Cons:

Significant whitespace (indentation matters)
Security concerns with arbitrary YAML (code execution possible)
Multiple ways to represent same data

Use when: Config files that humans edit, CI/CD pipelines, Kubernetes

Security Note: Never load untrusted YAML with yaml.load() - use yaml.safe_load()

TOON (Token-Oriented Object Notation)

Pros:

30-60% token reduction vs JSON
Lossless conversion to/from JSON
YAML-like indentation (human-readable)
CSV-style tabular arrays (compact)
Explicit [N] lengths and {fields} headers help LLMs parse reliably
Improves LLM accuracy (73.9% vs JSON's 69.7% in benchmarks)

Cons:

Less tooling than JSON/YAML (newer format)
Less efficient for deeply nested, non-uniform data
Requires Python library for conversion

Use when:

Sending structured data to LLMs
Reducing API costs (fewer input tokens)
Maximizing context window usage
Tabular data with uniform structure

Best for: Uniform arrays of objects (same fields across items)

XML/HTML

Pros:

Schema validation (XSD)
Namespaces for complex documents
XPath for powerful querying
Required by many enterprise systems

Cons:

Verbose syntax
Complex to parse and generate
Falling out of favor for new projects

Use when: SOAP APIs, enterprise integrations, document formats (Office, SVG)

TOON Format Guide

Basic Syntax

Objects use key-value pairs with colon-space separation:

name: Alice
age: 30
active: true

Nested objects use indentation:

user:
  id: 123
  profile:
    role: admin

Primitive arrays (inline):

tags[3]: admin,ops,dev

Tabular arrays (uniform objects - TOON's sweet spot):

users[2]{id,name,role}:
 1,Alice,admin
 2,Bob,user

Expanded lists (mixed types):

tasks[2]:
 - Complete report
 - Review code

TOON Token Savings Example

JSON (257 tokens):

{
  "users": [
    { "id": 1, "name": "Alice", "role": "admin" },
    { "id": 2, "name": "Bob", "role": "user" },
    { "id": 3, "name": "Carol", "role": "guest" }
  ]
}

TOON (166 tokens - 35% reduction):

users[3]{id,name,role}:
 1,Alice,admin
 2,Bob,user
 3,Carol,guest

When NOT to Use TOON

Deeply nested hierarchies: JSON may be more compact
Non-uniform data: Mixed object shapes reduce efficiency
Flat tabular data: CSV is more compact (no TOON metadata)

Conversion Guide

Using the convert.sh Script

The plugin provides a convert.sh script for format conversion:

# Basic usage
/path/to/plugins/data-serialization/scripts/convert.sh <input-file> <output-format>

# Examples
convert.sh data.json yaml      # JSON to YAML
convert.sh config.yaml toon    # YAML to TOON
convert.sh data.toon json      # TOON to JSON
convert.sh data.xml json       # XML to JSON
convert.sh page.html yaml      # HTML to YAML

# With explicit source format (if auto-detect fails)
convert.sh data.txt json --from yaml

# Playwright accessibility snapshot conversion
convert.sh playwright-snapshot.md json --playwright
convert.sh playwright-snapshot.md toon --playwright

Manual Conversion Commands

JSON to YAML:

yq -P '.' input.json > output.yaml

YAML to JSON:

yq -o=json '.' input.yaml > output.json

JSON to TOON:

python3 -c "
from toon_format import encode
import json
with open('input.json') as f:
    data = json.load(f)
print(encode(data))
" > output.toon

TOON to JSON:

python3 -c "
from toon_format import decode
import json
with open('input.toon') as f:
    data = decode(f.read())
print(json.dumps(data, indent=2))
" > output.json

XML to JSON:

python3 -c "
import json, xmltodict
with open('input.xml') as f:
    data = xmltodict.parse(f.read())
print(json.dumps(data, indent=2))
" > output.json

HTML to JSON:

python3 -c "
import json, xmltodict
from html.parser import HTMLParser
with open('input.html') as f:
    # Parse HTML as XML-like structure
    data = xmltodict.parse(f.read())
print(json.dumps(data, indent=2))
" > output.json

Querying Data

jq for JSON

Basic selection:

# Get a field
jq '.fieldName' data.json

# Get nested field
jq '.parent.child' data.json

# Get array element
jq '.[0]' data.json
jq '.items[0]' data.json

Filtering:

# Select objects matching condition
jq '.[] | select(.status == "active")' data.json

# Multiple conditions
jq '.[] | select(.age > 18 and .country == "US")' data.json

# Null-safe filtering
jq '.[] | select(.optional // empty)' data.json

Transformation:

# Extract specific fields
jq '.[] | {name, email}' data.json

# Rename fields
jq '.[] | {userName: .name, userEmail: .email}' data.json

# Create arrays
jq '[.[] | .name]' data.json

Aggregation:

# Count items
jq 'length' data.json
jq '[.[] | select(.active)] | length' data.json

# Sum values
jq '[.[] | .price] | add' data.json

# Group by field
jq 'group_by(.category)' data.json

yq for YAML

yq uses the same syntax as jq:

# Basic queries work the same
yq '.fieldName' data.yaml
yq '.[] | select(.status == "active")' data.yaml

# Output as JSON
yq -o=json '.' data.yaml

# Edit in place
yq -i '.version = "2.0"' data.yaml

XPath for XML

Using xmllint:

# Get element text
xmllint --xpath '//element/text()' data.xml

# Get attribute
xmllint --xpath 'string(//element/@attr)' data.xml

# Count elements
xmllint --xpath 'count(//item)' data.xml

# Get multiple elements
xmllint --xpath '//item/name/text()' data.xml

Querying TOON

For TOON, convert to JSON first, then use jq:

python3 -c "
from toon_format import decode
import json
with open('data.toon') as f:
    print(json.dumps(decode(f.read())))
" | jq '.users[] | select(.role == "admin")'

Playwright Accessibility Snapshots

Playwright MCP returns accessibility snapshots in a YAML-like format. These can be converted for querying.

Snapshot Format

Playwright snapshots look like:

- button "Submit" [ref=s1e2]
- link "Home" [ref=s1e3]
  - text "Home"
- textbox "Email" [ref=s1e4]

Converting Snapshots

Use the convert script with --playwright flag:

convert.sh snapshot.md json --playwright
convert.sh snapshot.md toon --playwright  # For token efficiency

This produces queryable JSON:

[
  { "role": "button", "name": "Submit", "ref": "s1e2" },
  {
    "role": "link",
    "name": "Home",
    "ref": "s1e3",
    "children": [{ "role": "text", "name": "Home" }]
  },
  { "role": "textbox", "name": "Email", "ref": "s1e4" }
]

Querying Playwright Data

Find all buttons:

jq '[.. | objects | select(.role == "button")]' snapshot.json

Find element by name:

jq '.. | objects | select(.name == "Submit")' snapshot.json

Get all refs:

jq '[.. | objects | .ref // empty]' snapshot.json

Find clickable elements:

jq '[.. | objects | select(.role == "button" or .role == "link")]' snapshot.json

Best Practices

Use TOON for LLM input - 30-60% token savings on structured data
Save API/tool output to files first before querying
Use JSON as intermediate format when converting between formats
Validate before converting - malformed input causes cryptic errors
Preserve original files during conversion experiments
For deeply nested data, stick with JSON (TOON overhead increases)

Tool Installation

Ensure these tools are available:

# macOS
brew install jq yq

# Python dependencies (for TOON and XML)
pip install git+https://github.com/toon-format/toon-python.git xmltodict dicttoxml

# Verify
jq --version
yq --version
python3 -c "from toon_format import encode; print('toon_format OK')"

References

TOON Specification - Official format spec
TOON Python Library - Python implementation
jq Manual - JSON query language
yq Documentation - YAML processor
XPath Reference - XML query language
Playwright Accessibility - Accessibility snapshots

Plugin Location

This skill is part of the data-serialization plugin.

Sources:

GitHub: https://github.com/nsheaps/ai-mktpl
Local Path: ~/src/nsheaps/ai/plugins/data-serialization

nsheaps/data-serialization

plugins/data-serialization/skills/data-serialization/SKILL.md

Data format conversion and querying utilities for YAML, JSON, TOON, and XML/HTML. Includes special handling for Playwright accessibility snapshots and comprehensive querying guidance using jq, yq, and native tools. TOON provides 30-60% token reduction for LLM contexts.

1 stars

tools

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add nsheaps/ai-mktpl data-serialization

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 4:08 PM33.7s1 file scanned

SKILL.md

name:: data-serialization
description:: >
allowed-tools:: Read, Edit, Write, Bash, Glob, Grep

Data Serialization Skill

You are a data transformation specialist. Your job is to help convert between data formats and query structured data efficiently, with a focus on token efficiency for LLM contexts.

Supported Formats

When to Use Each Format

JSON

Pros:

Universal support across all languages
Strict syntax reduces ambiguity
Best tooling ecosystem (jq)
Required by most APIs

Cons:

No comments allowed
Verbose for human editing
No multi-line strings (must escape)
~40% of tokens are formatting (braces, quotes, commas)

Use when: API responses, data interchange, tool input/output

YAML

Pros:

Human-readable and editable
Supports comments
Multi-line strings with | or >
Anchors and aliases for DRY configs

Cons:

Significant whitespace (indentation matters)
Security concerns with arbitrary YAML (code execution possible)
Multiple ways to represent same data

Use when: Config files that humans edit, CI/CD pipelines, Kubernetes

Security Note: Never load untrusted YAML with yaml.load() - use yaml.safe_load()

TOON (Token-Oriented Object Notation)

Pros:

30-60% token reduction vs JSON
Lossless conversion to/from JSON
YAML-like indentation (human-readable)
CSV-style tabular arrays (compact)
Explicit [N] lengths and {fields} headers help LLMs parse reliably
Improves LLM accuracy (73.9% vs JSON's 69.7% in benchmarks)

Cons:

Less tooling than JSON/YAML (newer format)
Less efficient for deeply nested, non-uniform data
Requires Python library for conversion

Use when:

Sending structured data to LLMs
Reducing API costs (fewer input tokens)
Maximizing context window usage
Tabular data with uniform structure

Best for: Uniform arrays of objects (same fields across items)

XML/HTML

Pros:

Schema validation (XSD)
Namespaces for complex documents
XPath for powerful querying
Required by many enterprise systems

Cons:

Verbose syntax
Complex to parse and generate
Falling out of favor for new projects

Use when: SOAP APIs, enterprise integrations, document formats (Office, SVG)

TOON Format Guide

Basic Syntax

Objects use key-value pairs with colon-space separation:

name: Alice
age: 30
active: true

Nested objects use indentation:

user:
  id: 123
  profile:
    role: admin

Primitive arrays (inline):

tags[3]: admin,ops,dev

Tabular arrays (uniform objects - TOON's sweet spot):

users[2]{id,name,role}:
 1,Alice,admin
 2,Bob,user

Expanded lists (mixed types):

tasks[2]:
 - Complete report
 - Review code

TOON Token Savings Example

JSON (257 tokens):

{
  "users": [
    { "id": 1, "name": "Alice", "role": "admin" },
    { "id": 2, "name": "Bob", "role": "user" },
    { "id": 3, "name": "Carol", "role": "guest" }
  ]
}

TOON (166 tokens - 35% reduction):

users[3]{id,name,role}:
 1,Alice,admin
 2,Bob,user
 3,Carol,guest

When NOT to Use TOON

Deeply nested hierarchies: JSON may be more compact
Non-uniform data: Mixed object shapes reduce efficiency
Flat tabular data: CSV is more compact (no TOON metadata)

Conversion Guide

Using the convert.sh Script

The plugin provides a convert.sh script for format conversion:

# Basic usage
/path/to/plugins/data-serialization/scripts/convert.sh <input-file> <output-format>

# Examples
convert.sh data.json yaml      # JSON to YAML
convert.sh config.yaml toon    # YAML to TOON
convert.sh data.toon json      # TOON to JSON
convert.sh data.xml json       # XML to JSON
convert.sh page.html yaml      # HTML to YAML

# With explicit source format (if auto-detect fails)
convert.sh data.txt json --from yaml

# Playwright accessibility snapshot conversion
convert.sh playwright-snapshot.md json --playwright
convert.sh playwright-snapshot.md toon --playwright

Manual Conversion Commands

JSON to YAML:

yq -P '.' input.json > output.yaml

YAML to JSON:

yq -o=json '.' input.yaml > output.json

JSON to TOON:

python3 -c "
from toon_format import encode
import json
with open('input.json') as f:
    data = json.load(f)
print(encode(data))
" > output.toon

TOON to JSON:

python3 -c "
from toon_format import decode
import json
with open('input.toon') as f:
    data = decode(f.read())
print(json.dumps(data, indent=2))
" > output.json

XML to JSON:

python3 -c "
import json, xmltodict
with open('input.xml') as f:
    data = xmltodict.parse(f.read())
print(json.dumps(data, indent=2))
" > output.json

HTML to JSON:

python3 -c "
import json, xmltodict
from html.parser import HTMLParser
with open('input.html') as f:
    # Parse HTML as XML-like structure
    data = xmltodict.parse(f.read())
print(json.dumps(data, indent=2))
" > output.json

Querying Data

jq for JSON

Basic selection:

# Get a field
jq '.fieldName' data.json

# Get nested field
jq '.parent.child' data.json

# Get array element
jq '.[0]' data.json
jq '.items[0]' data.json

Filtering:

# Select objects matching condition
jq '.[] | select(.status == "active")' data.json

# Multiple conditions
jq '.[] | select(.age > 18 and .country == "US")' data.json

# Null-safe filtering
jq '.[] | select(.optional // empty)' data.json

Transformation:

# Extract specific fields
jq '.[] | {name, email}' data.json

# Rename fields
jq '.[] | {userName: .name, userEmail: .email}' data.json

# Create arrays
jq '[.[] | .name]' data.json

Aggregation:

# Count items
jq 'length' data.json
jq '[.[] | select(.active)] | length' data.json

# Sum values
jq '[.[] | .price] | add' data.json

# Group by field
jq 'group_by(.category)' data.json

yq for YAML

yq uses the same syntax as jq:

# Basic queries work the same
yq '.fieldName' data.yaml
yq '.[] | select(.status == "active")' data.yaml

# Output as JSON
yq -o=json '.' data.yaml

# Edit in place
yq -i '.version = "2.0"' data.yaml

XPath for XML

Using xmllint:

# Get element text
xmllint --xpath '//element/text()' data.xml

# Get attribute
xmllint --xpath 'string(//element/@attr)' data.xml

# Count elements
xmllint --xpath 'count(//item)' data.xml

# Get multiple elements
xmllint --xpath '//item/name/text()' data.xml

Querying TOON

For TOON, convert to JSON first, then use jq:

python3 -c "
from toon_format import decode
import json
with open('data.toon') as f:
    print(json.dumps(decode(f.read())))
" | jq '.users[] | select(.role == "admin")'

Playwright Accessibility Snapshots

Playwright MCP returns accessibility snapshots in a YAML-like format. These can be converted for querying.

Snapshot Format

Playwright snapshots look like:

- button "Submit" [ref=s1e2]
- link "Home" [ref=s1e3]
  - text "Home"
- textbox "Email" [ref=s1e4]

Converting Snapshots

Use the convert script with --playwright flag:

convert.sh snapshot.md json --playwright
convert.sh snapshot.md toon --playwright  # For token efficiency

This produces queryable JSON:

[
  { "role": "button", "name": "Submit", "ref": "s1e2" },
  {
    "role": "link",
    "name": "Home",
    "ref": "s1e3",
    "children": [{ "role": "text", "name": "Home" }]
  },
  { "role": "textbox", "name": "Email", "ref": "s1e4" }
]

Querying Playwright Data

Find all buttons:

jq '[.. | objects | select(.role == "button")]' snapshot.json

Find element by name:

jq '.. | objects | select(.name == "Submit")' snapshot.json

Get all refs:

jq '[.. | objects | .ref // empty]' snapshot.json

Find clickable elements:

jq '[.. | objects | select(.role == "button" or .role == "link")]' snapshot.json

Best Practices

Use TOON for LLM input - 30-60% token savings on structured data
Save API/tool output to files first before querying
Use JSON as intermediate format when converting between formats
Validate before converting - malformed input causes cryptic errors
Preserve original files during conversion experiments
For deeply nested data, stick with JSON (TOON overhead increases)

Tool Installation

Ensure these tools are available:

# macOS
brew install jq yq

# Python dependencies (for TOON and XML)
pip install git+https://github.com/toon-format/toon-python.git xmltodict dicttoxml

# Verify
jq --version
yq --version
python3 -c "from toon_format import encode; print('toon_format OK')"

References

TOON Specification - Official format spec
TOON Python Library - Python implementation
jq Manual - JSON query language
yq Documentation - YAML processor
XPath Reference - XML query language
Playwright Accessibility - Accessibility snapshots

Plugin Location

This skill is part of the data-serialization plugin.

Sources:

GitHub: https://github.com/nsheaps/ai-mktpl
Local Path: ~/src/nsheaps/ai/plugins/data-serialization

Related Skills

nsheaps/github-app-session-env

tools

VerifiedTrustedCommunity

Manually reproduce what the github-app plugin's SessionStart hook does to make a GitHub App installation token usable in the current session — materialize the PEM, generate the token, isolate GH_CONFIG_DIR, write the runtime env file, and wire CLAUDE_ENV_FILE so every Bash call sees GH_TOKEN/GITHUB_TOKEN. Use when the hook did not run, the token is missing from the environment, or a shell/teammate needs the token wired up by hand. <example>GH_TOKEN isn't set even though github-app is configured</example> <example>the github-app SessionStart hook didn't run, set up the token manually</example> <example>wire the github app token into CLAUDE_ENV_FILE</example> <example>gh keeps falling back to the wrong account, isolate GH_CONFIG_DIR</example>

3SKILL.mdUpdated Jun 9, 2026

nsheaps/github-app-session-env

nsheaps/github-app-git-identity

tools

VerifiedTrustedCommunity

Manually configure the GitHub App bot git identity the way the github-app plugin's SessionStart hook does — resolve the app slug and bot user ID, build the <slug>[bot] name and noreply email, set GIT_AUTHOR_*/GIT_COMMITTER_* env vars, and write an isolated GIT_CONFIG_GLOBAL with the gh auth git-credential helper. Use when commits are attributed to the wrong account, "Author identity unknown" appears, or git identity must be set up by hand. <example>my commits are showing up as the handler, not the bot</example> <example>git says Author identity unknown after the github-app hook ran</example> <example>configure the github app bot git identity manually</example> <example>set up the gh credential helper for git push</example>

3SKILL.mdUpdated Jun 9, 2026

nsheaps/github-app-git-identity

nsheaps/spec-management

tools

VerifiedTrustedCommunity

Manages spec files for requirements capture and validation

3SKILL.mdUpdated Jun 7, 2026

nsheaps/spec-management

nsheaps/plugins/bash-command-rejection/skills/bash-chaining-alternatives

tools

VerifiedTrustedCommunity

# Bash Chaining Alternatives This skill teaches you how to work around the bash command chaining restriction enforced by this plugin. ## Why Chaining is Blocked The `bash-command-rejection` plugin blocks these operators: | Operator | Name | Why Blocked | | -------- | ---------- | ----------------------------------------------------------------------------------- | | `&&` | AND chain | Runs cmd2 only if cmd1 su

3SKILL.mdUpdated Jun 7, 2026

nsheaps/plugins/bash-command-rejection/skills/bash-chaining-alternatives

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/nsheaps/ai-mktpl.git

# Copy into Claude Code skills folder (global)
cp -r ai-mktpl/plugins/data-serialization/skills/data-serialization ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

nsheaps/ai-mktpl

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT