Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

AMindToThink/bayesian-stats

Name: bayesian-stats
Author: AMindToThink

skills/bayesian-stats/SKILL.md

npx skillsauth add AMindToThink/claude-code-settings bayesian-stats

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

When the user invokes /bayesian-stats, help them convert frequentist statistical tests into Bayesian equivalents.

If an argument is provided (e.g., /bayesian-stats t-test), look up that specific test. Otherwise, ask which frequentist test they want to convert.

Mappings

| Frequentist Test | Bayesian Equivalent | Python Library | |-----------------|--------------------|--------------| | Paired t-test | Bayesian paired t-test with JZS prior → BF₁₀ | pingouin.bayesian_ttest(x, y, paired=True) | | Independent t-test | Bayesian independent t-test with JZS prior → BF₁₀ | pingouin.bayesian_ttest(x, y, paired=False) | | Wilcoxon signed-rank | Bayesian paired t-test (robust alternative) or Bayesian sign test via PyMC | pingouin for approximate BF, pymc for full model | | Fisher's exact / Chi-squared | Beta-Binomial model with Beta(1,1) priors on each group's success rate | Analytical or pymc: pm.Beta("p", 1, 1) per group | | Mixed-effects logistic regression | Bayesian mixed-effects model | bambi: bmb.Model("y ~ condition + (1|question)", data, family="bernoulli") | | ANOVA / F-test | Bayesian ANOVA | pingouin.bayesian_anova(data, dv, between) or bambi | | Pearson correlation | Bayesian correlation | pingouin.bayesian_corr(x, y) | | Bootstrap CI | Posterior credible interval from MCMC | pymc model → arviz.summary() for HDI |

Code Snippets

Bayesian Paired t-test (replaces Wilcoxon / paired t-test)

import pingouin as pg
bf = pg.bayesian_ttest(x, y, paired=True, r=0.707)  # JZS prior, Cauchy scale r=√2/2
print(f"BF₁₀ = {bf:.3f}")

Beta-Binomial (replaces Fisher's exact)

import pymc as pm
import arviz as az

with pm.Model():
    p_a = pm.Beta("p_a", 1, 1)  # Condition A success rate
    p_b = pm.Beta("p_b", 1, 1)  # Condition B success rate
    pm.Binomial("obs_a", n=n_a, p=p_a, observed=k_a)
    pm.Binomial("obs_b", n=n_b, p=p_b, observed=k_b)
    delta = pm.Deterministic("delta", p_b - p_a)
    trace = pm.sample(4000)

az.summary(trace, var_names=["delta"], hdi_prob=0.95)
az.plot_posterior(trace, var_names=["delta"], ref_val=0)

Bayesian Mixed-Effects (replaces frequentist mixed-effects)

import bambi as bmb
import arviz as az

model = bmb.Model("correct ~ advice_source * question_category + (1|question_id)", data, family="bernoulli")
results = model.fit(draws=4000)
az.summary(results, var_names=["advice_source", "question_category", "advice_source:question_category"])

Interpreting Bayes Factors (BF₁₀)

| BF₁₀ | Evidence | |-------|----------| | > 100 | Extreme evidence for H₁ | | 30–100 | Very strong evidence for H₁ | | 10–30 | Strong evidence for H₁ | | 3–10 | Moderate evidence for H₁ | | 1–3 | Anecdotal evidence for H₁ | | 1/3–1 | Anecdotal evidence for H₀ | | 1/10–1/3 | Moderate evidence for H₀ | | 1/30–1/10 | Strong evidence for H₀ | | < 1/30 | Very strong evidence for H₀ |

Key advantage: BF < 1/3 provides evidence for the null, not just "failure to reject." This is impossible with p-values.

Best Practices

Always report both frequentist (p-values, CIs) and Bayesian (BF₁₀, posterior credible intervals) results. Reviewers expect p-values; Bayesian results add rigor.
Sensitivity analysis: Rerun Bayes factors with different prior scales (e.g., Cauchy r = 0.5, √2/2, 1.0). If conclusions are robust across priors, the result is more credible.
Small samples: Bayesian methods handle small n more gracefully — posteriors are properly wide when data is limited, rather than giving misleading p-values.
Combining experiments: Posteriors from one experiment become priors for the next. This is how evidence accumulates across studies.
No multiple comparison correction needed: Bayesian updating naturally handles multiplicity (though model comparison via Bayes factors still requires care).

Required Packages

uv add pingouin pymc bambi arviz

AMindToThink/bayesian-stats

skills/bayesian-stats/SKILL.md

Convert frequentist statistical tests into their Bayesian equivalents. Provides mappings, code snippets, interpretation guides, and best practices.

development

Updated Apr 25, 2026

$ install --global

skillsauth

npx skillsauth add AMindToThink/claude-code-settings bayesian-stats

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 7:06 PM146.3s1 file scanned

SKILL.md

name:: bayesian-stats
description:: Convert frequentist statistical tests into their Bayesian equivalents. Provides mappings, code snippets, interpretation guides, and best practices.
user_invocable:: true

When the user invokes /bayesian-stats, help them convert frequentist statistical tests into Bayesian equivalents.

If an argument is provided (e.g., /bayesian-stats t-test), look up that specific test. Otherwise, ask which frequentist test they want to convert.

Mappings

Code Snippets

Bayesian Paired t-test (replaces Wilcoxon / paired t-test)

import pingouin as pg
bf = pg.bayesian_ttest(x, y, paired=True, r=0.707)  # JZS prior, Cauchy scale r=√2/2
print(f"BF₁₀ = {bf:.3f}")

Beta-Binomial (replaces Fisher's exact)

import pymc as pm
import arviz as az

with pm.Model():
    p_a = pm.Beta("p_a", 1, 1)  # Condition A success rate
    p_b = pm.Beta("p_b", 1, 1)  # Condition B success rate
    pm.Binomial("obs_a", n=n_a, p=p_a, observed=k_a)
    pm.Binomial("obs_b", n=n_b, p=p_b, observed=k_b)
    delta = pm.Deterministic("delta", p_b - p_a)
    trace = pm.sample(4000)

az.summary(trace, var_names=["delta"], hdi_prob=0.95)
az.plot_posterior(trace, var_names=["delta"], ref_val=0)

Bayesian Mixed-Effects (replaces frequentist mixed-effects)

import bambi as bmb
import arviz as az

model = bmb.Model("correct ~ advice_source * question_category + (1|question_id)", data, family="bernoulli")
results = model.fit(draws=4000)
az.summary(results, var_names=["advice_source", "question_category", "advice_source:question_category"])

Interpreting Bayes Factors (BF₁₀)

Key advantage: BF < 1/3 provides evidence for the null, not just "failure to reject." This is impossible with p-values.

Best Practices

Always report both frequentist (p-values, CIs) and Bayesian (BF₁₀, posterior credible intervals) results. Reviewers expect p-values; Bayesian results add rigor.
Sensitivity analysis: Rerun Bayes factors with different prior scales (e.g., Cauchy r = 0.5, √2/2, 1.0). If conclusions are robust across priors, the result is more credible.
Small samples: Bayesian methods handle small n more gracefully — posteriors are properly wide when data is limited, rather than giving misleading p-values.
Combining experiments: Posteriors from one experiment become priors for the next. This is how evidence accumulates across studies.
No multiple comparison correction needed: Bayesian updating naturally handles multiplicity (though model comparison via Bayes factors still requires care).

Required Packages

uv add pingouin pymc bambi arviz

Related Skills

AMindToThink/accessible-website-check

development

VerifiedTrustedCommunity

Use when the user asks to check, audit, or improve a website or web project for accessibility (a11y), WCAG compliance, screen reader support, keyboard navigation, color contrast, or alt text. Triggers a plan-mode investigation against the TeachAccess design and code checklists, then implements approved fixes.

3SKILL.mdUpdated May 13, 2026

AMindToThink/accessible-website-check

AMindToThink/skills/make-anonymous-branch

development

VerifiedTrustedCommunity

--- name: make-anonymous-branch description: Use when preparing a research repo for double-blind submission via anonymous.4open.science (ICML/NeurIPS/ICLR/workshop). Builds a single `anon-submission` branch with code+data+paper, scrubs identity leaks (author names, home paths, emails, wandb metadata, PDF author fields), patches LaTeX for pdf.js compatibility, and leaves `main` untouched. Triggers: "make an anonymous branch", "anonymize my repo for X submission", "set up anonymous.4open.science",

3SKILL.mdUpdated May 12, 2026

AMindToThink/skills/make-anonymous-branch

AMindToThink/implement-math

development

VerifiedTrustedCommunity

Translate math (formulas, estimators, algorithms) into code so the implementation faithfully matches what the source actually specifies. Use when writing code from a formula, reviewing an LLM-generated implementation of a formula, debugging a numerical mismatch with a paper, designing a new metric/estimator, or refactoring an existing math-heavy computation. Especially load-bearing whenever aggregation operators (sums, means, expectations, products, geometric means) appear over indices that can be reordered, or whenever the same English label can refer to multiple non-equivalent estimators (e.g. ratio-of-means vs mean-of-ratios, micro-average vs macro-average, sample-weighted vs unweighted). Prevents the failure mode where a code path silently implements the wrong estimator under the same name as the intended one.

3SKILL.mdUpdated May 12, 2026

AMindToThink/implement-math

AMindToThink/finding-old-chats

development

VerifiedTrustedCommunity

Use when the user asks to review, find, summarize, or check Claude Code chat transcripts from a past date or time range ("review my chats from May 1st", "what was I working on yesterday", "any unfinished sessions this week"). Reads transcripts under `~/.claude/projects/`, handles local-time vs UTC correctly so late-evening sessions don't get dropped, and flags chats whose last assistant turn looks like an unanswered question.

3SKILL.mdUpdated May 12, 2026

AMindToThink/finding-old-chats

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/AMindToThink/claude-code-settings.git

# Copy into Claude Code skills folder (global)
cp -r claude-code-settings/skills/bayesian-stats ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

AMindToThink/claude-code-settings

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT