src/datapro/data/skills/data-viz/SKILL.md
Generates professional statistical charts (Bar, Pie, Grouped) using Matplotlib and Seaborn. Use this skill to visualize survey data, trends, and distributions for reports.
npx skillsauth add pablodiegoo/data-pro-skill data-vizInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill provides a standardized way to generate high-quality statistical charts for reports. It handles styling (using Sebrae-compatible colors), layout, and saving to files.
plot_bar)Best for comparing categories or counts. Supports vertical and horizontal orientation.
plot_pie)Best for showing composition (shares) of a whole. Limit to Top 5-7 categories for readability.
plot_grouped_bar)Best for comparing distributions across segments.
Best for comparing means of domains across points in time (e.g., Start vs End). Use plot_evolution_line.
For qualitative text analysis visualization. Use in conjunction with survey-qual-analyzer frequencies.
principal_component_plotting, correlation_ellipse_plot, multivariate_normal_contours)performance_curve_builder)import pandas as pd
# Import from skill scripts directory
from scripts.plotter import plot_bar, plot_pie, plot_grouped_bar
from scripts.evolution_plotter import plot_evolution_line
from scripts.visuals import * # Additional visualization utilities
from scripts.advanced_plots import * # Advanced chart types
from scripts.principal_component_plotting import plot_feature_importance, plot_biplot
from scripts.correlation_ellipse_plot import plot_corr_ellipses
from scripts.multivariate_normal_contours import plot_contours
from scripts.performance_curve_builder import calculate_drawdown, extend_series_to_date
# 1. Simple Bar Chart (Top 10 Cities)
plot_bar(df, x_col="City", title="Respondents by City", filename="output/city_dist.png", orientation='h')
# 2. Evolution of Domains (Survey Pre vs Post)
# Expected columns: 'Cycle', 'Domain', 'Mean'
plot_evolution_line(df_evo, x="Cycle", y="Mean", hue="Domain", title="Evolution of Domains", filename="output/evolution.png")
Requires matplotlib, seaborn, and pandas.
testing
Comprehensive time-series validation and analysis suite. Handles backtesting of trading and non-trading strategies with support for walk-forward validation (training vs testing windows), performance metric calculation (Sharpe, Drawdown, Win Rate), and event-driven resource allocation simulation. Use for: (1) Validating sequential logic on time-series data, (2) Calculating risk-adjusted performance, (3) Simulating constraints in resource distribution, (4) Detecting look-ahead bias through walk-forward testing.
tools
Core statistical analysis and pipeline automation for survey datasets. Use for: (1) Running standard Crosstabs, NPS, Top-Box calculations, (2) Generating complete EDA or Analytics notebooks, (3) Quantitative and qualitative processing of questionnaire data.
development
Business-level frameworks and actionable reporting for executives. Use for: (1) Plotting Priority Matrices, (2) Generating Pain Curves, (3) Conversion Funnels, (4) Removing Halo Effects to uncover true sentiment.
testing
Tactical and highly interpretable Machine Learning. Use for: (1) Extracting Feature Importance via Random Forest, (2) Running Permutation Tests, (3) Handling Imbalanced Data (SMOTE).