resources/skills/csv-pipeline/SKILL.md
Process, transform, analyze, and report on CSV and JSON data files. Use when the user needs to filter rows, join datasets, compute aggregates, convert formats, deduplicate, or generate summary reports from tabular data. Works with any CSV, TSV, or JSON Lines file.
npx skillsauth add aidotnet/opencowork csv-pipelineInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Process tabular data (CSV, TSV, JSON, JSON Lines) using standard command-line tools and Python. No external dependencies required beyond Python 3.
| Script | Purpose | Dependencies |
| ------------- | ----------------------------------- | ---------------------- |
| csv_tool.py | All-in-one CSV/JSON processing tool | Python 3 (stdlib only) |
python --version
CRITICAL — Error Recovery: If any script below fails with an error, check that Python 3 is available and re-run the exact same command.
python scripts/csv_tool.py inspect "DATA_FILE"
Shows row count, column names, and non-empty value counts per column.
python scripts/csv_tool.py filter "DATA_FILE" --column COLUMN_NAME --op OPERATOR --value VALUE --output "OUTPUT_FILE"
Operators: eq, neq, gt, gte, lt, lte, contains, startswith, endswith
python scripts/csv_tool.py sort "DATA_FILE" --column COLUMN_NAME --order asc --output "OUTPUT_FILE"
Options: --numeric for numeric sorting, --order desc for descending.
python scripts/csv_tool.py dedup "DATA_FILE" --columns "col1,col2" --output "OUTPUT_FILE"
Remove duplicates by specified columns (or all columns if omitted).
python scripts/csv_tool.py aggregate "DATA_FILE" --group-by COLUMN --agg-column VALUE_COL --func sum --output "OUTPUT_FILE"
Functions: sum, avg, count, min, max
python scripts/csv_tool.py join "LEFT_FILE" "RIGHT_FILE" --on KEY_COLUMN --how inner --output "OUTPUT_FILE"
Join types: inner, left
python scripts/csv_tool.py convert "DATA_FILE" --to json --output "OUTPUT_FILE"
Supported conversions: csv, json, jsonl (JSON Lines), tsv
python scripts/csv_tool.py report "DATA_FILE" --group-by CATEGORY_COL --value-column VALUE_COL --output "report.md"
Generates a Markdown summary table with count, sum, avg, min, max per group.
python scripts/csv_tool.py clean "DATA_FILE" --output "CLEAN_FILE"
Strips whitespace, normalizes empty values (N/A, null, None → empty), normalizes booleans.
inspect to understand the dataaggregate for raw data, report for Markdown outputjoin two datasets on a shared keyconvert between CSV/JSON/TSVtools
Product Design workflow plugin for OpenCowork. Use when the user mentions Product Design or asks to design, redesign, audit, research, prototype, clone a URL, turn an image or screenshot into code, generate UI directions, manage saved product design context, or share a runnable prototype.
tools
Post tweets to X.com (Twitter) using the system browser's login state
development
Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When GLM needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks
development
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When GLM needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas