Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

praveenmaiya/compare-versions

Name: compare-versions
Author: praveenmaiya

.claude/skills/compare-versions/SKILL.md

npx skillsauth add praveenmaiya/holley-rec compare-versions

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Version Comparison Skill

Compares recommendation outputs between pipeline versions.

When to Use

After implementing a new version
Before deploying to production
Debugging unexpected changes
Validating bug fixes

Process

Step 1: Get Baseline Stats (Production)

bq query --use_legacy_sql=false "
SELECT
  'production' as version,
  COUNT(*) as total_users,
  COUNT(DISTINCT email_lower) as unique_emails,
  ROUND(AVG(rec1_score), 2) as avg_score,
  ROUND(MIN(rec1_price), 2) as min_price,
  ROUND(MAX(rec1_price), 2) as max_price,
  COUNTIF(rec1_image LIKE 'https://%') * 100.0 / COUNT(*) as https_pct
FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
"

Step 2: Get New Version Stats

bq query --use_legacy_sql=false "
SELECT
  'staging' as version,
  COUNT(*) as total_users,
  COUNT(DISTINCT email_lower) as unique_emails,
  ROUND(AVG(rec1_score), 2) as avg_score,
  ROUND(MIN(rec1_price), 2) as min_price,
  ROUND(MAX(rec1_price), 2) as max_price,
  COUNTIF(rec1_image LIKE 'https://%') * 100.0 / COUNT(*) as https_pct
FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
"

Step 3: Compare Recommendations (User-Level)

bq query --use_legacy_sql=false "
WITH old AS (
  SELECT email_lower, rec_part_1, rec_part_2, rec_part_3, rec_part_4,
         rec1_score, rec2_score, rec3_score, rec4_score
  FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
),
new AS (
  SELECT email_lower, rec_part_1, rec_part_2, rec_part_3, rec_part_4,
         rec1_score, rec2_score, rec3_score, rec4_score
  FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
)
SELECT
  COUNT(*) as users_in_both,
  COUNTIF(o.rec_part_1 = n.rec_part_1) as same_rec1,
  COUNTIF(o.rec_part_1 != n.rec_part_1) as diff_rec1,
  COUNTIF(o.rec_part_2 != n.rec_part_2) as diff_rec2,
  COUNTIF(o.rec_part_3 != n.rec_part_3) as diff_rec3,
  COUNTIF(o.rec_part_4 != n.rec_part_4) as diff_rec4,
  ROUND(100.0 * COUNTIF(o.rec_part_1 = n.rec_part_1) / COUNT(*), 2) as pct_same_rec1
FROM old o
JOIN new n ON o.email_lower = n.email_lower
"

Step 4: Investigate Differences

If differences found, dig deeper:

# Sample users with different rec1
bq query --use_legacy_sql=false "
WITH old AS (
  SELECT email_lower, rec_part_1 as old_rec1, rec1_score as old_score
  FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
),
new AS (
  SELECT email_lower, rec_part_1 as new_rec1, rec1_score as new_score
  FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
)
SELECT
  o.email_lower,
  o.old_rec1, o.old_score,
  n.new_rec1, n.new_score
FROM old o
JOIN new n ON o.email_lower = n.email_lower
WHERE o.old_rec1 != n.new_rec1
LIMIT 20
"

Output Format

| Metric | Old | New | Change | |--------|-----|-----|--------| | Users | X | Y | +/- N | | Same rec1 | - | - | X% | | Diff rec1 | - | - | N users | | Avg score | X | Y | +/- Z |

Interpretation Guide

>99% same rec1: Minor change, likely edge cases
95-99% same: Significant but expected for bug fixes
<95% same: Major change, investigate thoroughly
User count diff: Check audience filtering logic

Related Files

docs/pipeline_run_stats.md - Historical comparisons
docs/release_notes.md - Version change documentation

praveenmaiya/compare-versions

.claude/skills/compare-versions/SKILL.md

Compare two pipeline versions. Use when validating a new version against baseline.

devops

Updated Apr 17, 2026

$ install --global

skillsauth

npx skillsauth add praveenmaiya/holley-rec compare-versions

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 17, 2026, 7:51 AM6.0s1 file scanned

SKILL.md

name:: compare-versions
description:: Compare two pipeline versions. Use when validating a new version against baseline.
allowed-tools:: Bash, Read, Glob

Version Comparison Skill

Compares recommendation outputs between pipeline versions.

When to Use

After implementing a new version
Before deploying to production
Debugging unexpected changes
Validating bug fixes

Process

Step 1: Get Baseline Stats (Production)

bq query --use_legacy_sql=false "
SELECT
  'production' as version,
  COUNT(*) as total_users,
  COUNT(DISTINCT email_lower) as unique_emails,
  ROUND(AVG(rec1_score), 2) as avg_score,
  ROUND(MIN(rec1_price), 2) as min_price,
  ROUND(MAX(rec1_price), 2) as max_price,
  COUNTIF(rec1_image LIKE 'https://%') * 100.0 / COUNT(*) as https_pct
FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
"

Step 2: Get New Version Stats

bq query --use_legacy_sql=false "
SELECT
  'staging' as version,
  COUNT(*) as total_users,
  COUNT(DISTINCT email_lower) as unique_emails,
  ROUND(AVG(rec1_score), 2) as avg_score,
  ROUND(MIN(rec1_price), 2) as min_price,
  ROUND(MAX(rec1_price), 2) as max_price,
  COUNTIF(rec1_image LIKE 'https://%') * 100.0 / COUNT(*) as https_pct
FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
"

Step 3: Compare Recommendations (User-Level)

bq query --use_legacy_sql=false "
WITH old AS (
  SELECT email_lower, rec_part_1, rec_part_2, rec_part_3, rec_part_4,
         rec1_score, rec2_score, rec3_score, rec4_score
  FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
),
new AS (
  SELECT email_lower, rec_part_1, rec_part_2, rec_part_3, rec_part_4,
         rec1_score, rec2_score, rec3_score, rec4_score
  FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
)
SELECT
  COUNT(*) as users_in_both,
  COUNTIF(o.rec_part_1 = n.rec_part_1) as same_rec1,
  COUNTIF(o.rec_part_1 != n.rec_part_1) as diff_rec1,
  COUNTIF(o.rec_part_2 != n.rec_part_2) as diff_rec2,
  COUNTIF(o.rec_part_3 != n.rec_part_3) as diff_rec3,
  COUNTIF(o.rec_part_4 != n.rec_part_4) as diff_rec4,
  ROUND(100.0 * COUNTIF(o.rec_part_1 = n.rec_part_1) / COUNT(*), 2) as pct_same_rec1
FROM old o
JOIN new n ON o.email_lower = n.email_lower
"

Step 4: Investigate Differences

If differences found, dig deeper:

# Sample users with different rec1
bq query --use_legacy_sql=false "
WITH old AS (
  SELECT email_lower, rec_part_1 as old_rec1, rec1_score as old_score
  FROM \`auxia-reporting.company_1950_jp.final_vehicle_recommendations\`
),
new AS (
  SELECT email_lower, rec_part_1 as new_rec1, rec1_score as new_score
  FROM \`auxia-reporting.temp_holley_v5_17.final_vehicle_recommendations\`
)
SELECT
  o.email_lower,
  o.old_rec1, o.old_score,
  n.new_rec1, n.new_score
FROM old o
JOIN new n ON o.email_lower = n.email_lower
WHERE o.old_rec1 != n.new_rec1
LIMIT 20
"

Output Format

| Metric | Old | New | Change | |--------|-----|-----|--------| | Users | X | Y | +/- N | | Same rec1 | - | - | X% | | Diff rec1 | - | - | N users | | Avg score | X | Y | +/- Z |

Interpretation Guide

>99% same rec1: Minor change, likely edge cases
95-99% same: Significant but expected for bug fixes
<95% same: Major change, investigate thoroughly
User count diff: Check audience filtering logic

Related Files

docs/pipeline_run_stats.md - Historical comparisons
docs/release_notes.md - Version change documentation

Related Skills

praveenmaiya/weekly-update

testing

VerifiedTrustedCommunity

Generate a team-facing weekly status update from STATUS_LOG.md and git history.

SKILL.mdUpdated Apr 17, 2026

praveenmaiya/weekly-update

praveenmaiya/validate

testing

VerifiedTrustedCommunity

Run QA validation checks on the recommendation pipeline output. Use after pipeline runs to verify data quality.

SKILL.mdUpdated Apr 17, 2026

praveenmaiya/validate

praveenmaiya/uplift

research

VerifiedTrustedCommunity

Compare Personalized vs Static treatment performance with unbiased methodology. Use for A/B analysis and treatment comparison.

SKILL.mdUpdated Apr 17, 2026

praveenmaiya/status

testing

VerifiedTrustedCommunity

Show current pipeline and deployment status. Use for quick health check.

SKILL.mdUpdated Apr 17, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/praveenmaiya/holley-rec.git

# Copy into Claude Code skills folder (global)
cp -r holley-rec/.claude/skills/compare-versions ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

praveenmaiya/holley-rec

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT