Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

seaworld008/andrej-karpathy-skills

Name: andrej-karpathy-skills
Author: seaworld008

openclaw-skills/andrej-karpathy-skills/SKILL.md

npx skillsauth add seaworld008/commonly-used-high-value-skills andrej-karpathy-skills

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Andrej Karpathy Skills

Use this skill to keep AI-assisted engineering grounded, simple, and verifiable. It is especially useful for backend, architecture, refactoring, debugging, migrations, and any task where an agent might overbuild, guess, or change unrelated code.

The core idea is simple: do not let the model drift. Make assumptions explicit, prefer minimal changes, touch only what the request requires, and verify with concrete checks before calling the work done.

When to Use

Writing backend code, API changes, migrations, data access logic, or architecture-sensitive code.
Refactoring code where accidental behavior changes are risky.
Debugging a production issue or a failing test.
Reviewing AI-generated code for overengineering, hidden assumptions, or missing verification.
Implementing a user request that is ambiguous, cross-cutting, or easy to over-scope.
Making changes in an unfamiliar codebase.
Working under constraints such as dirty git state, existing style, generated files, or ownership boundaries.
Producing a plan before code or a concise review after code.

Skip When

The user explicitly asks for brainstorming without implementation.
The task is pure content writing, design exploration, or research without code changes.
A different skill with stricter domain procedure applies, such as a security audit, incident response, or regulated compliance workflow.

Core Principles

Think before coding.
Simplicity first.
Surgical changes.
Goal-driven execution.
Verification before completion.
Respect the existing system.
Surface uncertainty early.
Prefer reversible, observable steps.

Principle 1: Think Before Coding

Before implementation, establish what is known and what is assumed.

Read the relevant files before changing them.
Identify the exact behavior the user wants changed.
Name any ambiguity that changes the implementation path.
If there are multiple viable approaches, pick the smallest one that fits the repo.
If an approach creates a new abstraction, justify why it reduces real complexity.
If the task crosses module boundaries, map those boundaries first.
If the codebase already has a local pattern, use it.

Do not silently guess when the guess would affect data integrity, security, public API behavior, billing, auth, or destructive operations.

Principle 2: Simplicity First

Write the minimum code that fully solves the actual problem.

Do not add configuration for cases the user did not ask for.
Do not introduce a framework, service, queue, cache, or abstraction just because it may help later.
Do not broaden a bug fix into a style rewrite.
Do not add generic helpers for one call site unless the helper removes real complexity.
Keep names direct and local.
Prefer boring data structures and clear control flow.
Make failure modes explicit, but do not handle impossible states with noisy code.

Ask this before committing to an approach:

Can the same behavior be achieved with fewer concepts, fewer files, and less surface area?

If yes, simplify.

Principle 3: Surgical Changes

Every changed line should trace to the request.

Touch only the files required by the task.
Keep unrelated formatting, naming, and comments intact.
Remove only unused code made unused by your own change.
Leave unrelated dead code alone unless the user asked for cleanup.
Preserve generated files unless the repository pipeline regenerates them.
Do not rewrite public APIs when an adapter or narrow fix is enough.
In a dirty worktree, distinguish your changes from existing user changes.

Use this edit filter:

If I had to explain this line in the PR, could I tie it directly to the requested outcome?

If not, do not change it.

Principle 4: Goal-Driven Execution

Turn vague tasks into verifiable goals.

"Fix login" becomes: reproduce failing login, identify cause, patch, run auth tests.
"Add validation" becomes: define invalid cases, test them, implement, verify messages.
"Improve architecture" becomes: name the current pain, change one boundary, prove behavior holds.
"Clean up" becomes: list safe cleanup targets, remove only confirmed unused or duplicated assets.

For multi-step work, use a compact plan:

1. Inspect current behavior -> verify with failing test or local reproduction.
2. Make minimal change -> verify targeted test passes.
3. Run broader checks -> verify no nearby regression.

Principle 5: Verification Before Completion

Do not finish on intuition.

Run the smallest relevant test first.
Then run the broader suite required by the blast radius.
For UI work, inspect the rendered result when feasible.
For generated files, rerun the generator and confirm no unexpected drift.
For migrations, inspect schema and rollback assumptions.
For security-sensitive work, run static checks and manual review.
If a check cannot run, say exactly why and what risk remains.

Good final status includes changed files, validation performed, and known residual risk.

Backend Discipline

For backend and architecture work:

Preserve request/response contracts unless the user asked to change them.
Keep validation at trust boundaries.
Keep database writes transactional where partial writes would corrupt state.
Make idempotency explicit for retries, webhooks, jobs, and sync flows.
Avoid hidden global state and time-dependent behavior in core logic.
Separate pure decision logic from I/O when it makes tests easier.
Prefer existing repository error handling and logging conventions.
Avoid swallowing exceptions without traceable context.

Architecture Discipline

Use architecture changes only when they reduce current pain.

Add a module boundary when multiple call sites need the same behavior.
Add an interface when there are multiple real implementations or a test boundary demands it.
Add a service only if it owns a coherent workflow.
Add a background job only if synchronous execution is genuinely unsafe or too slow.
Add a cache only when there is measured or obvious repeated expensive work.
Add a migration path for public or persisted contracts.

If the change cannot be validated in this turn, keep it smaller.

Review Checklist

Scope: Does the diff solve only the user request?
Assumptions: Are assumptions named or encoded in tests?
Simplicity: Is there speculative abstraction?
Behavior: Are important edge cases covered?
Data: Are writes safe, atomic, and idempotent where needed?
Security: Are inputs validated and secrets protected?
Compatibility: Are public contracts preserved or intentionally migrated?
Observability: Are errors diagnosable without leaking sensitive data?
Verification: Did relevant tests or checks run?

Anti-Patterns

Rewriting a working subsystem to fix a one-line bug.
Adding a new dependency for a small utility.
Changing formatting across a file while making a behavior fix.
Returning "done" without running any verification.
Hiding uncertainty behind confident prose.
Creating TODOs instead of finishing the requested path.
Making the code more configurable than the product needs.
Updating generated files by hand.

Output Format

For implementation planning:

## Assumptions
- ...

## Minimal Approach
- ...

## Verification
- Targeted:
- Broader:

For code review:

## Findings
- [Severity] file:line - issue and impact

## Verification Gaps
- ...

Boundaries

This skill is a discipline layer, not a replacement for domain expertise. Pair it with security, testing, DevOps, database, or frontend skills when the task needs specialized checks.

seaworld008/andrej-karpathy-skills

openclaw-skills/andrej-karpathy-skills/SKILL.md

Karpathy-inspired coding discipline for AI agents: think before coding, keep changes simple, edit surgically, and define verifiable success criteria before implementation.

48 stars

testing

Updated Jun 29, 2026

$ install --global

skillsauth

npx skillsauth add seaworld008/commonly-used-high-value-skills andrej-karpathy-skills

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 29, 2026, 5:00 AM176.2s1 file scanned

SKILL.md

name:: andrej-karpathy-skills
description:: Karpathy-inspired coding discipline for AI agents: think before coding, keep changes simple, edit surgically, and define verifiable success criteria before implementation.
zh_description:: 用于应用 Andrej Karpathy 风格的 AI 学习、构建和研究实践。
version:: 1.0.0
author:: seaworld008
source:: in-house
source_url:: https://github.com/swarmclawai/andrej-karpathy-skills/blob/main/skills/karpathy-guidelines/SKILL.md
license:: MIT
tags:: [coding-discipline, architecture, backend, refactoring, code-review, ai-workflow, verification]
created_at:: 2026-06-03
updated_at:: 2026-06-03
quality:: 4
complexity:: intermediate

Andrej Karpathy Skills

When to Use

Writing backend code, API changes, migrations, data access logic, or architecture-sensitive code.
Refactoring code where accidental behavior changes are risky.
Debugging a production issue or a failing test.
Reviewing AI-generated code for overengineering, hidden assumptions, or missing verification.
Implementing a user request that is ambiguous, cross-cutting, or easy to over-scope.
Making changes in an unfamiliar codebase.
Working under constraints such as dirty git state, existing style, generated files, or ownership boundaries.
Producing a plan before code or a concise review after code.

Skip When

The user explicitly asks for brainstorming without implementation.
The task is pure content writing, design exploration, or research without code changes.
A different skill with stricter domain procedure applies, such as a security audit, incident response, or regulated compliance workflow.

Core Principles

Think before coding.
Simplicity first.
Surgical changes.
Goal-driven execution.
Verification before completion.
Respect the existing system.
Surface uncertainty early.
Prefer reversible, observable steps.

Principle 1: Think Before Coding

Before implementation, establish what is known and what is assumed.

Read the relevant files before changing them.
Identify the exact behavior the user wants changed.
Name any ambiguity that changes the implementation path.
If there are multiple viable approaches, pick the smallest one that fits the repo.
If an approach creates a new abstraction, justify why it reduces real complexity.
If the task crosses module boundaries, map those boundaries first.
If the codebase already has a local pattern, use it.

Do not silently guess when the guess would affect data integrity, security, public API behavior, billing, auth, or destructive operations.

Principle 2: Simplicity First

Write the minimum code that fully solves the actual problem.

Do not add configuration for cases the user did not ask for.
Do not introduce a framework, service, queue, cache, or abstraction just because it may help later.
Do not broaden a bug fix into a style rewrite.
Do not add generic helpers for one call site unless the helper removes real complexity.
Keep names direct and local.
Prefer boring data structures and clear control flow.
Make failure modes explicit, but do not handle impossible states with noisy code.

Ask this before committing to an approach:

Can the same behavior be achieved with fewer concepts, fewer files, and less surface area?

If yes, simplify.

Principle 3: Surgical Changes

Every changed line should trace to the request.

Touch only the files required by the task.
Keep unrelated formatting, naming, and comments intact.
Remove only unused code made unused by your own change.
Leave unrelated dead code alone unless the user asked for cleanup.
Preserve generated files unless the repository pipeline regenerates them.
Do not rewrite public APIs when an adapter or narrow fix is enough.
In a dirty worktree, distinguish your changes from existing user changes.

Use this edit filter:

If I had to explain this line in the PR, could I tie it directly to the requested outcome?

If not, do not change it.

Principle 4: Goal-Driven Execution

Turn vague tasks into verifiable goals.

"Fix login" becomes: reproduce failing login, identify cause, patch, run auth tests.
"Add validation" becomes: define invalid cases, test them, implement, verify messages.
"Improve architecture" becomes: name the current pain, change one boundary, prove behavior holds.
"Clean up" becomes: list safe cleanup targets, remove only confirmed unused or duplicated assets.

For multi-step work, use a compact plan:

1. Inspect current behavior -> verify with failing test or local reproduction.
2. Make minimal change -> verify targeted test passes.
3. Run broader checks -> verify no nearby regression.

Principle 5: Verification Before Completion

Do not finish on intuition.

Run the smallest relevant test first.
Then run the broader suite required by the blast radius.
For UI work, inspect the rendered result when feasible.
For generated files, rerun the generator and confirm no unexpected drift.
For migrations, inspect schema and rollback assumptions.
For security-sensitive work, run static checks and manual review.
If a check cannot run, say exactly why and what risk remains.

Good final status includes changed files, validation performed, and known residual risk.

Backend Discipline

For backend and architecture work:

Preserve request/response contracts unless the user asked to change them.
Keep validation at trust boundaries.
Keep database writes transactional where partial writes would corrupt state.
Make idempotency explicit for retries, webhooks, jobs, and sync flows.
Avoid hidden global state and time-dependent behavior in core logic.
Separate pure decision logic from I/O when it makes tests easier.
Prefer existing repository error handling and logging conventions.
Avoid swallowing exceptions without traceable context.

Architecture Discipline

Use architecture changes only when they reduce current pain.

Add a module boundary when multiple call sites need the same behavior.
Add an interface when there are multiple real implementations or a test boundary demands it.
Add a service only if it owns a coherent workflow.
Add a background job only if synchronous execution is genuinely unsafe or too slow.
Add a cache only when there is measured or obvious repeated expensive work.
Add a migration path for public or persisted contracts.

If the change cannot be validated in this turn, keep it smaller.

Review Checklist

Scope: Does the diff solve only the user request?
Assumptions: Are assumptions named or encoded in tests?
Simplicity: Is there speculative abstraction?
Behavior: Are important edge cases covered?
Data: Are writes safe, atomic, and idempotent where needed?
Security: Are inputs validated and secrets protected?
Compatibility: Are public contracts preserved or intentionally migrated?
Observability: Are errors diagnosable without leaking sensitive data?
Verification: Did relevant tests or checks run?

Anti-Patterns

Rewriting a working subsystem to fix a one-line bug.
Adding a new dependency for a small utility.
Changing formatting across a file while making a behavior fix.
Returning "done" without running any verification.
Hiding uncertainty behind confident prose.
Creating TODOs instead of finishing the requested path.
Making the code more configurable than the product needs.
Updating generated files by hand.

Output Format

For implementation planning:

## Assumptions
- ...

## Minimal Approach
- ...

## Verification
- Targeted:
- Broader:

For code review:

## Findings
- [Severity] file:line - issue and impact

## Verification Gaps
- ...

Boundaries

This skill is a discipline layer, not a replacement for domain expertise. Pair it with security, testing, DevOps, database, or frontend skills when the task needs specialized checks.

Related Skills

seaworld008/omen

development

VerifiedTrustedCommunity

Enumerating failure modes via pre-mortem analysis. Systematically identifies failure scenarios for plans, designs, and features, scoring them with RPN/AP. Does not write code.

62SKILL.mdUpdated Jun 16, 2026

seaworld008/nexus

testing

VerifiedTrustedCommunity

Orchestrating specialist AI agent teams as a meta-coordinator. Decomposes requests into minimum viable chains, spawns each as an independent session in AUTORUN modes, and drives to final output. Use when a task spans multiple specialist domains, requires parallel agent execution, or needs hub-and-spoke routing across the skill ecosystem.

62SKILL.mdUpdated Jun 11, 2026

seaworld008/morph

development

VerifiedTrustedCommunity

Converting document formats (Markdown/Word/Excel/PDF/HTML). Converts specs from Scribe and reports from Harvest into distributable formats; generates reusable conversion scripts. Use when converting documents, building accessibility-compliant PDFs, or creating Pandoc/LibreOffice pipelines.

62SKILL.mdUpdated Jun 11, 2026

seaworld008/lore

testing

VerifiedTrustedCommunity

Curating cross-agent knowledge and guarding institutional memory. Extracts patterns from agent journals into METAPATTERNS.md, detects knowledge decay, propagates best practices, prevents organizational forgetting. Use when consolidating cross-agent insights, curating memory, or auditing knowledge decay.

62SKILL.mdUpdated Jun 11, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/seaworld008/commonly-used-high-value-skills.git

# Copy into Claude Code skills folder (global)
cp -r commonly-used-high-value-skills/openclaw-skills/andrej-karpathy-skills ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

seaworld008/commonly-used-high-value-skills

48 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT