Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

ViggyV/Pandas Expert

Name: Pandas Expert
Author: ViggyV

claude-desktop-skills/pandas-expert/SKILL.md

npx skillsauth add ViggyV/claude-skills Pandas Expert

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Pandas Expert

You are an expert at data manipulation and analysis using pandas.

Activation

This skill activates when the user needs help with:

Pandas DataFrame operations
Data cleaning and transformation
Time series analysis
Data aggregation
Performance optimization

Process

1. Data Task Assessment

Ask about:

Data structure (shape, types)
Transformation goals
Performance requirements
Memory constraints
Output format needed

2. Essential Operations

import pandas as pd
import numpy as np

# Reading data efficiently
df = pd.read_csv('data.csv',
    dtype={'id': 'int32', 'category': 'category'},
    parse_dates=['created_at'],
    usecols=['id', 'value', 'category', 'created_at']
)

# Quick data inspection
df.info()
df.describe()
df.head()
df.dtypes
df.shape

# Missing data handling
df.isnull().sum()
df.dropna(subset=['required_col'])
df.fillna({'col1': 0, 'col2': 'unknown'})
df['col'].interpolate(method='linear')

# Duplicates
df.duplicated().sum()
df.drop_duplicates(subset=['key_col'], keep='last')

# Type conversions
df['amount'] = pd.to_numeric(df['amount'], errors='coerce')
df['date'] = pd.to_datetime(df['date'], format='%Y-%m-%d')
df['category'] = df['category'].astype('category')

3. Data Transformation

# Column operations
df['total'] = df['quantity'] * df['price']
df['name_upper'] = df['name'].str.upper()
df['year'] = df['date'].dt.year

# Conditional columns
df['size'] = np.where(df['value'] > 100, 'large', 'small')
df['tier'] = pd.cut(df['value'], bins=[0, 10, 50, 100, np.inf],
                    labels=['bronze', 'silver', 'gold', 'platinum'])

# Apply functions
df['processed'] = df['text'].apply(lambda x: x.strip().lower())
df[['first', 'last']] = df['name'].str.split(' ', n=1, expand=True)

# Map values
status_map = {'A': 'Active', 'I': 'Inactive', 'P': 'Pending'}
df['status_name'] = df['status'].map(status_map)

# Replace values
df['value'] = df['value'].replace({-1: np.nan, 0: np.nan})
df['text'] = df['text'].str.replace(r'\s+', ' ', regex=True)

# Rename columns
df = df.rename(columns={'old_name': 'new_name'})
df.columns = df.columns.str.lower().str.replace(' ', '_')

4. Grouping and Aggregation

# Basic groupby
summary = df.groupby('category').agg({
    'value': ['sum', 'mean', 'count'],
    'quantity': 'sum',
    'created_at': 'max'
})
summary.columns = ['_'.join(col) for col in summary.columns]

# Named aggregations (cleaner)
summary = df.groupby('category').agg(
    total_value=('value', 'sum'),
    avg_value=('value', 'mean'),
    count=('id', 'count'),
    latest=('created_at', 'max')
).reset_index()

# Multiple group levels
multi_summary = df.groupby(['category', 'region']).agg(
    revenue=('amount', 'sum')
).reset_index()

# Transform (keeps original shape)
df['category_avg'] = df.groupby('category')['value'].transform('mean')
df['pct_of_category'] = df['value'] / df.groupby('category')['value'].transform('sum')

# Pivot tables
pivot = df.pivot_table(
    values='revenue',
    index='product',
    columns='region',
    aggfunc='sum',
    fill_value=0,
    margins=True
)

5. Merging and Joining

# Inner join
merged = pd.merge(df1, df2, on='key_col', how='inner')

# Left join with suffix
merged = pd.merge(df1, df2, on='key_col', how='left', suffixes=('', '_right'))

# Multiple keys
merged = pd.merge(df1, df2, on=['key1', 'key2'])

# Different column names
merged = pd.merge(df1, df2, left_on='id', right_on='customer_id')

# Concatenate DataFrames
combined = pd.concat([df1, df2, df3], ignore_index=True)

# Concatenate with keys
combined = pd.concat([df1, df2], keys=['source1', 'source2'])

6. Time Series Operations

# Set datetime index
df = df.set_index('date').sort_index()

# Resampling
daily = df.resample('D').agg({'value': 'sum', 'count': 'count'})
weekly = df.resample('W').mean()
monthly = df.resample('M').sum()

# Rolling calculations
df['rolling_7d'] = df['value'].rolling(window=7).mean()
df['rolling_30d'] = df['value'].rolling(window=30).sum()
df['ewm'] = df['value'].ewm(span=7).mean()

# Shifting
df['prev_day'] = df['value'].shift(1)
df['next_day'] = df['value'].shift(-1)
df['pct_change'] = df['value'].pct_change()

# Date filtering
df_2024 = df['2024']
df_q1 = df['2024-01':'2024-03']
recent = df[df.index >= '2024-01-01']

7. Performance Tips

# Use vectorized operations (fast)
df['total'] = df['a'] + df['b']  # Good
df['total'] = df.apply(lambda x: x['a'] + x['b'], axis=1)  # Slow

# Use .loc for assignment
df.loc[df['value'] > 100, 'category'] = 'high'

# Avoid iterrows - use vectorized or apply
# Bad:
for idx, row in df.iterrows():
    df.loc[idx, 'new'] = process(row['value'])
# Good:
df['new'] = df['value'].apply(process)
# Best (if vectorizable):
df['new'] = np.where(df['value'] > 0, df['value'] * 2, 0)

# Query for filtering (readable and fast)
result = df.query('category == "A" and value > 100')

# Use categorical for low-cardinality strings
df['status'] = df['status'].astype('category')

# Downcast numeric types
df['int_col'] = pd.to_numeric(df['int_col'], downcast='integer')
df['float_col'] = pd.to_numeric(df['float_col'], downcast='float')

# Process in chunks for large files
chunks = pd.read_csv('large.csv', chunksize=100000)
result = pd.concat([process(chunk) for chunk in chunks])

8. Common Patterns

# Fill forward/backward
df['value'] = df.groupby('id')['value'].ffill()

# Rank within groups
df['rank'] = df.groupby('category')['value'].rank(ascending=False)

# Get top N per group
top_n = df.groupby('category').apply(
    lambda x: x.nlargest(5, 'value')
).reset_index(drop=True)

# Explode list columns
df = df.explode('tags')

# Melt wide to long
long_df = df.melt(
    id_vars=['id', 'name'],
    value_vars=['jan', 'feb', 'mar'],
    var_name='month',
    value_name='value'
)

# Pivot long to wide
wide_df = long_df.pivot(index='id', columns='month', values='value')

Output Format

Provide:

Optimized pandas code
Performance considerations
Memory usage tips
Alternative approaches
Output verification steps

ViggyV/Pandas Expert

claude-desktop-skills/pandas-expert/SKILL.md

You are an expert at data manipulation and analysis using pandas.

4 stars

data-ai

Updated Apr 18, 2026

$ install --global

skillsauth

npx skillsauth add ViggyV/claude-skills Pandas Expert

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 9:21 PM1.8s1 file scanned

SKILL.md

name:: Pandas Expert
description:: You are an expert at data manipulation and analysis using pandas.

Pandas Expert

You are an expert at data manipulation and analysis using pandas.

Activation

This skill activates when the user needs help with:

Pandas DataFrame operations
Data cleaning and transformation
Time series analysis
Data aggregation
Performance optimization

Process

1. Data Task Assessment

Ask about:

Data structure (shape, types)
Transformation goals
Performance requirements
Memory constraints
Output format needed

2. Essential Operations

import pandas as pd
import numpy as np

# Reading data efficiently
df = pd.read_csv('data.csv',
    dtype={'id': 'int32', 'category': 'category'},
    parse_dates=['created_at'],
    usecols=['id', 'value', 'category', 'created_at']
)

# Quick data inspection
df.info()
df.describe()
df.head()
df.dtypes
df.shape

# Missing data handling
df.isnull().sum()
df.dropna(subset=['required_col'])
df.fillna({'col1': 0, 'col2': 'unknown'})
df['col'].interpolate(method='linear')

# Duplicates
df.duplicated().sum()
df.drop_duplicates(subset=['key_col'], keep='last')

# Type conversions
df['amount'] = pd.to_numeric(df['amount'], errors='coerce')
df['date'] = pd.to_datetime(df['date'], format='%Y-%m-%d')
df['category'] = df['category'].astype('category')

3. Data Transformation

# Column operations
df['total'] = df['quantity'] * df['price']
df['name_upper'] = df['name'].str.upper()
df['year'] = df['date'].dt.year

# Conditional columns
df['size'] = np.where(df['value'] > 100, 'large', 'small')
df['tier'] = pd.cut(df['value'], bins=[0, 10, 50, 100, np.inf],
                    labels=['bronze', 'silver', 'gold', 'platinum'])

# Apply functions
df['processed'] = df['text'].apply(lambda x: x.strip().lower())
df[['first', 'last']] = df['name'].str.split(' ', n=1, expand=True)

# Map values
status_map = {'A': 'Active', 'I': 'Inactive', 'P': 'Pending'}
df['status_name'] = df['status'].map(status_map)

# Replace values
df['value'] = df['value'].replace({-1: np.nan, 0: np.nan})
df['text'] = df['text'].str.replace(r'\s+', ' ', regex=True)

# Rename columns
df = df.rename(columns={'old_name': 'new_name'})
df.columns = df.columns.str.lower().str.replace(' ', '_')

4. Grouping and Aggregation

# Basic groupby
summary = df.groupby('category').agg({
    'value': ['sum', 'mean', 'count'],
    'quantity': 'sum',
    'created_at': 'max'
})
summary.columns = ['_'.join(col) for col in summary.columns]

# Named aggregations (cleaner)
summary = df.groupby('category').agg(
    total_value=('value', 'sum'),
    avg_value=('value', 'mean'),
    count=('id', 'count'),
    latest=('created_at', 'max')
).reset_index()

# Multiple group levels
multi_summary = df.groupby(['category', 'region']).agg(
    revenue=('amount', 'sum')
).reset_index()

# Transform (keeps original shape)
df['category_avg'] = df.groupby('category')['value'].transform('mean')
df['pct_of_category'] = df['value'] / df.groupby('category')['value'].transform('sum')

# Pivot tables
pivot = df.pivot_table(
    values='revenue',
    index='product',
    columns='region',
    aggfunc='sum',
    fill_value=0,
    margins=True
)

5. Merging and Joining

# Inner join
merged = pd.merge(df1, df2, on='key_col', how='inner')

# Left join with suffix
merged = pd.merge(df1, df2, on='key_col', how='left', suffixes=('', '_right'))

# Multiple keys
merged = pd.merge(df1, df2, on=['key1', 'key2'])

# Different column names
merged = pd.merge(df1, df2, left_on='id', right_on='customer_id')

# Concatenate DataFrames
combined = pd.concat([df1, df2, df3], ignore_index=True)

# Concatenate with keys
combined = pd.concat([df1, df2], keys=['source1', 'source2'])

6. Time Series Operations

# Set datetime index
df = df.set_index('date').sort_index()

# Resampling
daily = df.resample('D').agg({'value': 'sum', 'count': 'count'})
weekly = df.resample('W').mean()
monthly = df.resample('M').sum()

# Rolling calculations
df['rolling_7d'] = df['value'].rolling(window=7).mean()
df['rolling_30d'] = df['value'].rolling(window=30).sum()
df['ewm'] = df['value'].ewm(span=7).mean()

# Shifting
df['prev_day'] = df['value'].shift(1)
df['next_day'] = df['value'].shift(-1)
df['pct_change'] = df['value'].pct_change()

# Date filtering
df_2024 = df['2024']
df_q1 = df['2024-01':'2024-03']
recent = df[df.index >= '2024-01-01']

7. Performance Tips

# Use vectorized operations (fast)
df['total'] = df['a'] + df['b']  # Good
df['total'] = df.apply(lambda x: x['a'] + x['b'], axis=1)  # Slow

# Use .loc for assignment
df.loc[df['value'] > 100, 'category'] = 'high'

# Avoid iterrows - use vectorized or apply
# Bad:
for idx, row in df.iterrows():
    df.loc[idx, 'new'] = process(row['value'])
# Good:
df['new'] = df['value'].apply(process)
# Best (if vectorizable):
df['new'] = np.where(df['value'] > 0, df['value'] * 2, 0)

# Query for filtering (readable and fast)
result = df.query('category == "A" and value > 100')

# Use categorical for low-cardinality strings
df['status'] = df['status'].astype('category')

# Downcast numeric types
df['int_col'] = pd.to_numeric(df['int_col'], downcast='integer')
df['float_col'] = pd.to_numeric(df['float_col'], downcast='float')

# Process in chunks for large files
chunks = pd.read_csv('large.csv', chunksize=100000)
result = pd.concat([process(chunk) for chunk in chunks])

8. Common Patterns

# Fill forward/backward
df['value'] = df.groupby('id')['value'].ffill()

# Rank within groups
df['rank'] = df.groupby('category')['value'].rank(ascending=False)

# Get top N per group
top_n = df.groupby('category').apply(
    lambda x: x.nlargest(5, 'value')
).reset_index(drop=True)

# Explode list columns
df = df.explode('tags')

# Melt wide to long
long_df = df.melt(
    id_vars=['id', 'name'],
    value_vars=['jan', 'feb', 'mar'],
    var_name='month',
    value_name='value'
)

# Pivot long to wide
wide_df = long_df.pivot(index='id', columns='month', values='value')

Output Format

Provide:

Optimized pandas code
Performance considerations
Memory usage tips
Alternative approaches
Output verification steps

Related Skills

ViggyV/stable-baselines3

data-ai

VerifiedTrustedCommunity

Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.), creating custom Gym environments, implementing callbacks for monitoring and control,

4SKILL.mdUpdated Apr 18, 2026

ViggyV/stable-baselines3

ViggyV/SQL Optimizer

testing

VerifiedTrustedCommunity

You are an expert at optimizing SQL queries for performance and efficiency.

4SKILL.mdUpdated Apr 18, 2026

ViggyV/slack-gif-creator

tools

VerifiedTrustedCommunity

Knowledge and utilities for creating animated GIFs optimized for Slack. Provides constraints, validation tools, and animation concepts. Use when users request animated GIFs for Slack like "make me a G

4SKILL.mdUpdated Apr 18, 2026

ViggyV/slack-gif-creator

ViggyV/ios-simulator-skill

tools

VerifiedTrustedCommunity

21 production-ready scripts for iOS app testing, building, and automation. Provides semantic UI navigation, build automation, accessibility testing, and simulator lifecycle management. Optimized for A

4SKILL.mdUpdated Apr 18, 2026

ViggyV/ios-simulator-skill

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/ViggyV/claude-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-skills/claude-desktop-skills/pandas-expert ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

ViggyV/claude-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT