skills/prismer-data-analysis/SKILL.md
Load, analyze, and visualize datasets using pandas with AG Grid display
npx skillsauth add Zaoqu-Liu/ScienceClaw data-analysisInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Load data files (CSV, XLSX, JSON, Parquet) into the AG Grid viewer, run pandas queries, save results, and generate visualizations.
data_list - List available data files in /workspace/data/data_load - Load a data file into AG Grid (returns markdown preview for context)data_query - Execute pandas operations on loaded data (filter, aggregate, transform)data_save - Save the current DataFrame to a filejupyter_execute - Execute Python code in Jupyter kernel (for plots and complex analysis)update_notebook - Add cells to Jupyter notebookupdate_gallery - Display generated plots in the galleryFor tabular data exploration, use the data tools which provide a spreadsheet-like experience:
data_list to see what's in /workspace/data/data_load to read a file and display in AG Grid
data_query to run pandas operations
df variable contains the loaded dataresult = ... to define outputdata_save to export to CSV/XLSXFor visualization, statistical analysis, or ML, use Jupyter tools:
jupyter_execute running pandas codeupdate_galleryWhen user says: "Analyze this dataset" or "Show me the data"
data_list to find available filesdata_load with the target filedata_query with result = df.describe() for statisticsWhen user says: "Show only rows where X > Y" or "Group by category"
data_query with pandas filter/groupby codeWhen user says: "Export this" or "Save as Excel"
data_save with desired filename and formatWhen user says: "Create a chart" or "Plot the distribution"
jupyter_execute with matplotlib/seaborn codeupdate_galleryresult = df[df['score'] > 90]
result = df.groupby('category').agg({'value': ['mean', 'sum', 'count']}).reset_index()
result = df.sort_values('date', ascending=False)
df['ratio'] = df['value_a'] / df['value_b']
result = df
result = df.describe()
result = df.dropna(subset=['important_column'])
testing
Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.
tools
Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.
development
Complete mass spectrometry analysis platform. Use for proteomics workflows feature detection, peptide identification, protein quantification, and complex LC-MS/MS pipelines. Supports extensive file formats and algorithms. Best for proteomics, comprehensive MS data processing. For simple spectral comparison and metabolite ID use matchms.
development
Multi-objective optimization framework. NSGA-II, NSGA-III, MOEA/D, Pareto fronts, constraint handling, benchmarks (ZDT, DTLZ), for engineering design and optimization problems.