.claude/skills/data-exploration/SKILL.md
Performs exploratory data analysis (EDA) on datasets from CKAN portals and CSV files. Use when analyzing datasets, checking data quality, exploring CSV files, or when the user asks to examine, analyze, or validate data.
npx skillsauth add ondata/ckan-mcp-server data-explorationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill provides comprehensive capabilities for Exploratory Data Analysis (EDA) on datasets from CKAN portals and direct CSV files. It focuses on understanding data structure, assessing quality, identifying patterns, and generating insights.
For advanced CSV analysis using SQL:
# Schema analysis
duckdb -jsonlines -c "DESCRIBE SELECT * FROM read_csv('url')"
# Statistical summarization
duckdb -jsonlines -c "SUMMARIZE SELECT * FROM read_csv('url')"
# Data sampling
duckdb -jsonlines -c "SELECT * FROM read_csv('url') USING SAMPLE N"
# Custom queries
duckdb -jsonlines -c "SELECT column_name, COUNT(*), AVG(value) FROM read_csv('url') GROUP BY column_name"
ckan_package_show - Detailed dataset metadatackan_organization_show - Publisher informationckan_package_search - Dataset discoveryckan_datastore_search - DataStore query capabilitiesckan_find_relevant_datasets - Semantic search1. Metadata Validation
- Check title, description, license
- Verify publisher information
- Assess tag relevance
2. Structural Analysis
- Examine schema and data types
- Check column naming conventions
- Validate data formats
3. Content Analysis
- Assess completeness (null values)
- Verify consistency (internal logic)
- Check accuracy (calculated fields)
4. Statistical Profiling
- Generate descriptive statistics
- Analyze distributions
- Identify outliers
5. Insight Generation
- Detect patterns and trends
- Generate quality score
- Provide recommendations
User: "Analyze this dataset from dati.gov.it"
1. Retrieve dataset metadata using ckan_package_show
2. Download and examine CSV structure
3. Generate statistical summary
4. Check data quality metrics
5. Identify key insights and patterns
6. Produce comprehensive report
User: "Check the quality of this CSV file"
1. Examine file structure and schema
2. Check for missing values
3. Validate data types and formats
4. Verify internal consistency
5. Generate quality score (0-10)
6. Provide improvement recommendations
User: "Compare these two datasets"
1. Retrieve both datasets
2. Analyze schemas and structures
3. Compare statistical profiles
4. Identify similarities and differences
5. Highlight quality differences
6. Generate comparison report
# Dataset Analysis Report
## Metadata Overview
- Title, Publisher, License
- Creation date, modification date
- Resource count and formats
## Structural Analysis
- Schema table (columns, types)
- Data format assessment
- Naming convention evaluation
## Quality Assessment
- Completeness score (0-10)
- Consistency evaluation
- Accuracy verification
- Overall quality score
## Statistical Profile
- Key statistics table
- Distribution analysis
- Outlier detection
- Trend analysis
## Key Insights
- Important patterns discovered
- Notable anomalies found
- Quality improvement recommendations
## Technical Details
- Analysis methods used
- Tools and queries executed
- Limitations and assumptions
| Criterion | Weight | Description |
|--------------------|--------|--------------------------------------|
| Completeness | 30% | Percentage of non-null values |
| Consistency | 25% | Internal logical coherence |
| Accuracy | 20% | Correctness of calculations |
| Metadata Quality | 15% | Completeness of documentation |
| Format Standards | 10% | Compliance with best practices |
USING SAMPLE N) for initial explorationThis skill can work alongside:
code-analysis: For examining data processing scriptsdocumentation: For improving dataset documentationvisualization: For creating data visualizationsreporting: For generating comprehensive reports.claude/commands/openspec/ - OpenSpec command templatesAGENTS.md - Agent guidelines and rulesdocs/skills/skills.md - General skills documentationdocs/europe/openapi.yaml - API specificationstools
MCP server for exploring CKAN-based open data portals (dati.gov.it, data.gov, data.gov.uk, open.canada.ca, demo.ckan.org, and any other CKAN instance). Also covers data.europa.eu via its REST API (not CKAN). Use this skill whenever the user: asks about open data, public datasets, or data portals; mentions a country, region, or city in relation to data or statistics; asks about government transparency, public records, or official publications; asks "where can I find data on X", "are there datasets about Y", or "what data does organization Z publish"; needs to search, filter, explore, or analyze any open data catalog; or mentions a known portal by name or URL.
testing
Create, edit, improve, or audit AgentSkills. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory (moving files to references/ or scripts/, removing stale content, validating against the AgentSkills spec). Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".
testing
Host security hardening and risk-tolerance configuration for OpenClaw deployments. Use when a user asks for security audits, firewall/SSH/update hardening, risk posture, exposure review, OpenClaw cron scheduling for periodic checks, or version status checks on a machine running OpenClaw (laptop, workstation, Pi, VPS).
testing
Create, edit, improve, or audit AgentSkills. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory (moving files to references/ or scripts/, removing stale content, validating against the AgentSkills spec). Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".