Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

lyndonkl/transaction-categorizer

Name: transaction-categorizer
Author: lyndonkl

skills/transaction-categorizer/SKILL.md

npx skillsauth add lyndonkl/claude transaction-categorizer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Transaction Categorizer

Overview
Input contract
Workflow
Taxonomy
Rule format
Confidence and audit
Output contract
Guardrails

Overview

A transaction is just a description_raw string and a signed amount_cents. This skill turns that into a clean merchant, a category + subcategory, an is_recurring boolean candidate, and a confidence. It applies rules first (cheap, deterministic) and only falls back to LLM inference for the residual.

It also produces learned rules — when a confident classification matches a clear merchant pattern, propose a new rule for the rule table so future identical transactions match for free.

Input contract

The caller provides:

transactions — array of {id, description_raw, amount_cents, account_id, date, account_type}.
taxonomy — the categories.json taxonomy block (top-level → subcategory list).
rules — array of existing rules (see Rule format).
account_type_hints (optional) — when known, helps disambiguate (e.g., a deposit on a brokerage account is more likely dividends than salary).

Workflow

Categorization Progress:
- [ ] Step 1: Normalize description_raw
- [ ] Step 2: Apply rules in priority order
- [ ] Step 3: Classify residual via LLM with taxonomy guard
- [ ] Step 4: Detect recurring candidates
- [ ] Step 5: Score confidence
- [ ] Step 6: Propose new rules from high-confidence matches

Step 1 — Normalize

Build a description_normalized for matching only — never overwrite description_raw.

Uppercase.
Strip leading vendor codes (SQ *, TST*, PAYPAL *, CKCD, POS DEBIT, ACH DEBIT).
Strip trailing geo ( PORTLAND OR, 800-555-1234 CA, #1234).
Collapse multiple spaces.
Drop date numerics inside the description.

SQ *TRADER JOES #123 PORTLAND OR → TRADER JOES.

Step 2 — Apply rules

For each transaction, walk rules in priority order. First rule whose match substring (case-insensitive) is a substring of description_normalized wins. Apply its merchant, category, subcategory, and is_recurring (if set).

If multiple rules match, the most specific (longest match) wins.

If a rule matches, set source: "rule" and confidence: 1.0.

Step 3 — Classify residual

For unmatched transactions, classify via LLM:

Constrain output to the supplied taxonomy — never invent a category.
Pick category.subcategory (e.g., food.groceries).
Propose a clean merchant name.
If sign and account_type imply income, prefer the income.* branch.
If account_type is brokerage or 401k and amount is positive, prefer income.dividends, income.interest_earned, or savings_investment.*.
If description_raw looks like an internal transfer between two of the user's accounts, classify as financial.transfers_internal.

Set source: "llm" and confidence: 0.6–0.9 based on signal strength.

Step 4 — Detect recurring candidates

Set is_recurring: true candidate if:

The merchant has been seen ≥ 3 times in the last 90 days on the same account, with amount within ±10%, at a regular cadence (weekly, biweekly, monthly, quarterly).
OR a matched rule explicitly set is_recurring: true.

This is a candidate — promotion to recurring.json is the recurring-charge-detector skill's job.

Step 5 — Confidence

| Source | Default confidence | |---|---| | Rule match (substring length ≥ 8) | 1.00 | | Rule match (substring length 4–7) | 0.92 | | LLM with strong taxonomic signal (e.g., "NETFLIX" → entertainment.streaming) | 0.85 | | LLM with weak signal | 0.65 | | Cannot classify above uncategorized | 0.30 |

If confidence < 0.5, mark category: "uncategorized.unknown" and flag for review.

Step 6 — Propose new rules

After classification, scan high-confidence LLM matches (confidence ≥ 0.85) where the same description_normalized substring covers ≥ 3 transactions in the input set. For each, propose a new rule and append to rules.proposed[] in the output. The bookkeeper agent confirms these before they merge into categories.json.

Taxonomy

The skill respects the taxonomy supplied by the caller. The default taxonomy used by the household-finance team is:

housing → mortgage, rent, property_tax, hoa, home_insurance, home_maintenance,
          utilities_electric, utilities_gas, utilities_water, utilities_internet
food → groceries, restaurants, coffee, alcohol
transportation → gas, auto_insurance, auto_maintenance, public_transit, rideshare,
                 parking, tolls
health → medical_copay, prescriptions, dental, vision, mental_health, gym
personal → clothing, haircare, subscriptions_personal
kids → childcare, school, activities, kids_clothing
entertainment → streaming, events, hobbies, books
travel → flights, lodging, travel_food, travel_other
financial → fees, interest_paid, transfers_internal
income → salary, bonus, interest_earned, dividends, capital_gains, refund, other_income
savings_investment → 401k_contribution, ira_contribution, hsa_contribution,
                     brokerage_deposit, savings_deposit
uncategorized → unknown

Never invent a category. If a transaction does not fit, use uncategorized.unknown and emit a taxonomy_gap warning.

Rule format

{
  "match": "TRADER JOE",
  "merchant": "Trader Joe's",
  "category": "food",
  "subcategory": "groceries",
  "is_recurring": false,
  "priority": 100,
  "added_on": "2026-01-20",
  "source": "user_confirmed | learned"
}

Higher priority values win ties. Rules added by humans default to priority 200; rules learned by this skill default to 100.

Confidence and audit

Every output transaction carries:

category and subcategory — must be in taxonomy.
merchant — clean display name.
confidence — [0.0, 1.0].
source — rule | llm | uncategorized.
matched_rule_id (if source: rule).

Never overwrite description_raw; always preserve it for re-classification.

Output contract

{
  "categorized": [
    {
      "id": "tx_20260115_001",
      "merchant": "Trader Joe's",
      "category": "food",
      "subcategory": "groceries",
      "is_recurring_candidate": false,
      "confidence": 1.0,
      "source": "rule",
      "matched_rule_id": "rule_trader_joes"
    }
  ],
  "rules_proposed": [
    {
      "match": "BLUE BOTTLE",
      "merchant": "Blue Bottle Coffee",
      "category": "food",
      "subcategory": "coffee",
      "evidence_count": 4,
      "evidence_tx_ids": ["tx_20260103_004", "tx_20260110_002", "tx_20260117_007", "tx_20260124_001"]
    }
  ],
  "warnings": [
    { "tx_id": "tx_20260118_009", "type": "taxonomy_gap", "description_raw": "ZELLE TO M COPPENS" }
  ],
  "summary": {
    "total": 142,
    "rule_matched": 118,
    "llm_classified": 22,
    "uncategorized": 2,
    "uncategorized_pct": 1.4
  }
}

Guardrails

Preserve description_raw byte-for-byte. Normalization is for matching only.
Never invent categories. Stay within the supplied taxonomy.
Account-type aware. A "deposit" on a brokerage account is not salary; a "withdrawal" on a savings account is likely a transfer, not spending.
Internal transfers must net to zero across accounts. If a financial.transfers_internal is classified on one side, the matching opposite-sign transaction on the other account should also be transfers_internal — flag if not.
Uncategorized rate above 5% of new transactions in a batch is a quality signal; surface it in the summary.
No PII in proposed rules. A rule like match: "ZELLE TO JOHN SMITH" exposes a name; redact or skip such proposals.

lyndonkl/transaction-categorizer

skills/transaction-categorizer/SKILL.md

Assigns a category and subcategory to a financial transaction by matching its raw description against a configurable taxonomy and rules table, falling back to LLM inference when no rule matches. Emits a normalized merchant name, category path, recurring flag, and confidence score, and proposes new rules from confirmed classifications. Use when categorizing bank, credit-card, or brokerage transactions, building or refining a category taxonomy, or when user mentions transaction categorization, merchant normalization, expense classification, or category rules.

85 stars

development

Updated Apr 26, 2026

$ install --global

skillsauth

npx skillsauth add lyndonkl/claude transaction-categorizer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 26, 2026, 1:10 PM41.7s1 file scanned

SKILL.md

name:: transaction-categorizer
description:: Assigns a category and subcategory to a financial transaction by matching its raw description against a configurable taxonomy and rules table, falling back to LLM inference when no rule matches. Emits a normalized merchant name, category path, recurring flag, and confidence score, and proposes new rules from confirmed classifications. Use when categorizing bank, credit-card, or brokerage transactions, building or refining a category taxonomy, or when user mentions transaction categorization, merchant normalization, expense classification, or category rules.

Transaction Categorizer

Overview
Input contract
Workflow
Taxonomy
Rule format
Confidence and audit
Output contract
Guardrails

Overview

It also produces learned rules — when a confident classification matches a clear merchant pattern, propose a new rule for the rule table so future identical transactions match for free.

Input contract

The caller provides:

transactions — array of {id, description_raw, amount_cents, account_id, date, account_type}.
taxonomy — the categories.json taxonomy block (top-level → subcategory list).
rules — array of existing rules (see Rule format).
account_type_hints (optional) — when known, helps disambiguate (e.g., a deposit on a brokerage account is more likely dividends than salary).

Workflow

Categorization Progress:
- [ ] Step 1: Normalize description_raw
- [ ] Step 2: Apply rules in priority order
- [ ] Step 3: Classify residual via LLM with taxonomy guard
- [ ] Step 4: Detect recurring candidates
- [ ] Step 5: Score confidence
- [ ] Step 6: Propose new rules from high-confidence matches

Step 1 — Normalize

Build a description_normalized for matching only — never overwrite description_raw.

Uppercase.
Strip leading vendor codes (SQ *, TST*, PAYPAL *, CKCD, POS DEBIT, ACH DEBIT).
Strip trailing geo ( PORTLAND OR, 800-555-1234 CA, #1234).
Collapse multiple spaces.
Drop date numerics inside the description.

SQ *TRADER JOES #123 PORTLAND OR → TRADER JOES.

Step 2 — Apply rules

If multiple rules match, the most specific (longest match) wins.

If a rule matches, set source: "rule" and confidence: 1.0.

Step 3 — Classify residual

For unmatched transactions, classify via LLM:

Constrain output to the supplied taxonomy — never invent a category.
Pick category.subcategory (e.g., food.groceries).
Propose a clean merchant name.
If sign and account_type imply income, prefer the income.* branch.
If account_type is brokerage or 401k and amount is positive, prefer income.dividends, income.interest_earned, or savings_investment.*.
If description_raw looks like an internal transfer between two of the user's accounts, classify as financial.transfers_internal.

Set source: "llm" and confidence: 0.6–0.9 based on signal strength.

Step 4 — Detect recurring candidates

Set is_recurring: true candidate if:

The merchant has been seen ≥ 3 times in the last 90 days on the same account, with amount within ±10%, at a regular cadence (weekly, biweekly, monthly, quarterly).
OR a matched rule explicitly set is_recurring: true.

This is a candidate — promotion to recurring.json is the recurring-charge-detector skill's job.

Step 5 — Confidence

If confidence < 0.5, mark category: "uncategorized.unknown" and flag for review.

Step 6 — Propose new rules

Taxonomy

The skill respects the taxonomy supplied by the caller. The default taxonomy used by the household-finance team is:

housing → mortgage, rent, property_tax, hoa, home_insurance, home_maintenance,
          utilities_electric, utilities_gas, utilities_water, utilities_internet
food → groceries, restaurants, coffee, alcohol
transportation → gas, auto_insurance, auto_maintenance, public_transit, rideshare,
                 parking, tolls
health → medical_copay, prescriptions, dental, vision, mental_health, gym
personal → clothing, haircare, subscriptions_personal
kids → childcare, school, activities, kids_clothing
entertainment → streaming, events, hobbies, books
travel → flights, lodging, travel_food, travel_other
financial → fees, interest_paid, transfers_internal
income → salary, bonus, interest_earned, dividends, capital_gains, refund, other_income
savings_investment → 401k_contribution, ira_contribution, hsa_contribution,
                     brokerage_deposit, savings_deposit
uncategorized → unknown

Never invent a category. If a transaction does not fit, use uncategorized.unknown and emit a taxonomy_gap warning.

Rule format

{
  "match": "TRADER JOE",
  "merchant": "Trader Joe's",
  "category": "food",
  "subcategory": "groceries",
  "is_recurring": false,
  "priority": 100,
  "added_on": "2026-01-20",
  "source": "user_confirmed | learned"
}

Higher priority values win ties. Rules added by humans default to priority 200; rules learned by this skill default to 100.

Confidence and audit

Every output transaction carries:

category and subcategory — must be in taxonomy.
merchant — clean display name.
confidence — [0.0, 1.0].
source — rule | llm | uncategorized.
matched_rule_id (if source: rule).

Never overwrite description_raw; always preserve it for re-classification.

Output contract

{
  "categorized": [
    {
      "id": "tx_20260115_001",
      "merchant": "Trader Joe's",
      "category": "food",
      "subcategory": "groceries",
      "is_recurring_candidate": false,
      "confidence": 1.0,
      "source": "rule",
      "matched_rule_id": "rule_trader_joes"
    }
  ],
  "rules_proposed": [
    {
      "match": "BLUE BOTTLE",
      "merchant": "Blue Bottle Coffee",
      "category": "food",
      "subcategory": "coffee",
      "evidence_count": 4,
      "evidence_tx_ids": ["tx_20260103_004", "tx_20260110_002", "tx_20260117_007", "tx_20260124_001"]
    }
  ],
  "warnings": [
    { "tx_id": "tx_20260118_009", "type": "taxonomy_gap", "description_raw": "ZELLE TO M COPPENS" }
  ],
  "summary": {
    "total": 142,
    "rule_matched": 118,
    "llm_classified": 22,
    "uncategorized": 2,
    "uncategorized_pct": 1.4
  }
}

Guardrails

Preserve description_raw byte-for-byte. Normalization is for matching only.
Never invent categories. Stay within the supplied taxonomy.
Account-type aware. A "deposit" on a brokerage account is not salary; a "withdrawal" on a savings account is likely a transfer, not spending.
Internal transfers must net to zero across accounts. If a financial.transfers_internal is classified on one side, the matching opposite-sign transaction on the other account should also be transfers_internal — flag if not.
Uncategorized rate above 5% of new transactions in a batch is a quality signal; surface it in the summary.
No PII in proposed rules. A rule like match: "ZELLE TO JOHN SMITH" exposes a name; redact or skip such proposals.

Related Skills

lyndonkl/conf-theme-clustering

testing

VerifiedTrustedCommunity

Cluster a conference's event records into a small set of coarse themes with finer sub-clusters, an explicit outlier bucket, and soft (multi-membership) affinities — using the hybrid embed-then-label pipeline (embed abstracts, reduce, density-cluster, then LLM-label the clusters) when embedding libraries are available, and an LLM-reasoned hierarchical fallback when they are not. Embeddings do the grouping; the LLM only names the groups. Conference-agnostic. Use when turning structured event records into a navigable theme map for preference elicitation and scheduling, when you need 6-8 reasonable themes rather than 20 muddy ones, or when overlapping talks must belong to more than one theme. Trigger keywords - theme clustering, cluster talks, embed then label, soft membership, outlier talks, conference themes, topic map.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

development

VerifiedTrustedCommunity

Build a personal conference schedule as a constraint-optimization problem — hard constraints (no time overlap, room-to-room travel time, capacity/registration, the attendee's own must-attends and blackouts) plus a user-owned weighted objective trading interest against breadth, pacing (maximize contiguous free time), and serendipity. Surfaces unbreakable conflicts (two high-value overlapping talks the model cannot rank) as decisions for the human rather than silently picking, and reports what each choice traded away. Conference-agnostic. Use to turn a preference profile plus a theme map into a day-by-day plan, to resolve overlapping sessions, or to balance a packed vs paced schedule. Trigger keywords - schedule optimization, conference schedule, constraint optimization, overlapping talks, contiguous free time, conflict surfacing, packed vs paced.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

development

VerifiedTrustedCommunity

Parse a heterogeneous conference program (markdown, HTML, PDF-derived text, or JSON) into normalized event records with per-field confidence scores and independent classification axes (topic, depth, format, prerequisites, recorded, capacity). Detects the program's format before extracting, treats every inferred field as uncertain (present vs inferred vs missing), and flags thin or missing abstracts so downstream enrichment can target them. Conference-agnostic. Use when ingesting a conference or event schedule into a structured store, normalizing a talk/session list, or extracting per-session metadata with calibrated confidence. Trigger keywords - program ingestion, parse schedule, session extraction, event records, conference program, talk metadata, per-field confidence.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation

development

VerifiedTrustedCommunity

Build a personalized preference profile from a small number of well-chosen, cluster-grounded questions instead of a long survey. Represents the person's interests as an uncertainty region over the theme map, picks the single highest-information-gain choice-based question (contrasting real talks from different clusters), balances exploiting known interests against exploring uncertain ones, deliberately injects outlier probes to fight selection bias, and stops as soon as the schedule would be stable. Also elicits the user-owned objective weights and hard constraints. Interactive — runs where it can actually ask the person. Conference-agnostic. Use to turn a theme map into a preference profile, to decide what to ask a conference attendee, or to elicit scheduling priorities. Trigger keywords - preference elicitation, ask few questions, information gain, choice-based questions, selection bias probe, objective weights, attendee preferences.

127SKILL.mdUpdated Jun 28, 2026

lyndonkl/conf-preference-elicitation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/lyndonkl/claude.git

# Copy into Claude Code skills folder (global)
cp -r claude/skills/transaction-categorizer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

lyndonkl/claude

85 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

lyndonkl/transaction-categorizer

$ install --global

Security Scan Results

SKILL.md

Transaction Categorizer

Table of Contents

Overview

Input contract

Workflow

Step 1 — Normalize

Step 2 — Apply rules

Step 3 — Classify residual

Step 4 — Detect recurring candidates

Step 5 — Confidence

Step 6 — Propose new rules

Taxonomy

Rule format

Confidence and audit

Output contract

Guardrails

Related Skills

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation

lyndonkl/transaction-categorizer

$ install --global

Security Scan Results

SKILL.md

Transaction Categorizer

Table of Contents

Overview

Input contract

Workflow

Step 1 — Normalize

Step 2 — Apply rules

Step 3 — Classify residual

Step 4 — Detect recurring candidates

Step 5 — Confidence

Step 6 — Propose new rules

Taxonomy

Rule format

Confidence and audit

Output contract

Guardrails

Related Skills

lyndonkl/conf-theme-clustering

lyndonkl/conf-schedule-optimization

lyndonkl/conf-program-extraction

lyndonkl/conf-preference-elicitation