public/SKILLS/Data & Analysis/kaggle-skill/SKILL.md
Unified Kaggle skill. Use when the user mentions kaggle, kaggle.com, Kaggle competitions, datasets, models, notebooks, GPUs, TPUs, badges, or anything Kaggle-related. Handles account setup, competition reports, dataset/model downloads, notebook execution, competition submissions, badge collection, and general Kaggle questions.
npx skillsauth add eric861129/skills_all-in-one kaggleInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Complete Kaggle integration for any LLM or agentic coding system (Claude Code, gemini-cli, Cursor, etc.): account setup, competition reports, dataset/model downloads, notebook execution, competition submissions, badge collection, and general Kaggle questions. Four integrated modules working together.
Overlap guard: For hackathon grading evaluation and alignment analysis, use the kaggle-hackathon-grading skill instead.
Network requirements: outbound HTTPS to api.kaggle.com, www.kaggle.com,
and storage.googleapis.com.
| Module | Purpose | |--------|---------| | registration | Account creation, API key generation, credential storage | | comp-report | Competition landscape reports with Playwright scraping | | kllm | Core Kaggle interaction (kagglehub, CLI, MCP, UI) | | badge-collector | Systematic badge earning across 5 phases |
Always run the credential checker first:
python3 skills/kaggle/shared/check_all_credentials.py
Three credential types are needed for full compatibility:
| Variable | Format | Purpose |
|----------|--------|---------|
| KAGGLE_USERNAME | Kaggle handle | Identity for all tools |
| KAGGLE_KEY | 32-char hex | Legacy key (CLI, kagglehub, most MCP) |
| KAGGLE_API_TOKEN | KGAT_-prefixed | Scoped token (some MCP endpoints) |
If any are missing, follow the registration walkthrough:
Read modules/registration/README.md for the full step-by-step guide.
Security: Never echo, log, or commit actual credential values.
Walks users through creating a Kaggle account and generating all three API
credentials. Saves to .env and ~/.kaggle/kaggle.json.
Key commands:
python3 skills/kaggle/modules/registration/scripts/check_registration.py
bash skills/kaggle/modules/registration/scripts/setup_env.sh
Read modules/registration/README.md for the complete walkthrough.
Generates comprehensive landscape reports of recent Kaggle competition activity. Uses Python API for metadata + Playwright MCP tools for SPA content.
6-step workflow:
python3 skills/kaggle/modules/comp-report/scripts/list_competitions.py --lookback-days 30 --output json
python3 skills/kaggle/modules/comp-report/scripts/competition_details.py --slug SLUG
Read modules/comp-report/README.md for full details including hackathon handling.
Four methods to interact with kaggle.com:
| Method | Best For | |--------|----------| | kagglehub | Quick dataset/model download in Python | | kaggle-cli | Full workflow scripting | | MCP Server | AI agent integration | | Kaggle UI | Account setup, verification |
Capability matrix:
| Task | kagglehub | kaggle-cli | MCP | UI |
|------|-----------|------------|-----|-----|
| Download dataset | dataset_download() | datasets download | Yes | Yes |
| Download model | model_download() | models instances versions download | Yes | Yes |
| Execute notebook | — | kernels push/status/output | Yes | Yes |
| Submit to competition | — | competitions submit | Yes | Yes |
| Publish dataset | dataset_upload() | datasets create | Yes | Yes |
| Publish model | model_upload() | models create | Yes | Yes |
Known issues:
dataset_load() broken in kagglehub v0.4.3 — use dataset_download() + pd.read_csv()competitions download has no --unzip in CLI >= 1.8Read modules/kllm/README.md for full details and all task workflows.
Systematically earns ~38 automatable Kaggle badges across 5 phases:
| Phase | Name | Badges | Time | |-------|------|--------|------| | 1 | Instant API | ~16 | 5-10 min | | 2 | Competition | ~7 | 10-15 min | | 3 | Pipeline | ~3 | 15-30 min | | 4 | Browser | ~8 | 5-10 min | | 5 | Streaks | ~4 | Setup only |
python3 skills/kaggle/modules/badge-collector/scripts/orchestrator.py --dry-run
python3 skills/kaggle/modules/badge-collector/scripts/orchestrator.py --phase 1
python3 skills/kaggle/modules/badge-collector/scripts/orchestrator.py --status
Read modules/badge-collector/README.md for full details.
This skill is primarily a reference — use the modules and scripts as needed based on the user's request. When explicitly asked to run the full Kaggle workflow, follow these steps:
python3 skills/kaggle/shared/check_all_credentials.py
If any credentials are missing, walk through the registration module. Never echo or log actual credential values.
Run the comp-report workflow: list competitions, get details, scrape with Playwright, compose report. Output inline.
Present a concise summary of the four ways to interact with Kaggle (kagglehub, kaggle-cli, MCP Server, UI) with the capability matrix from the kllm module.
Ask the user what they'd like to do next:
Handle the user's choice using the appropriate module, then loop back to offer more options.
.env, kaggle.json, or any credential files.gitignore excludes .env, kaggle.json, and related fileschmod 600 .env ~/.kaggle/kaggle.jsonThis skill performs both read-only and write operations on kaggle.com.
Read-only operations (no account side-effects):
Write operations (create or modify resources on your account):
Phase 5 (Streaks) generates a local shell script for daily execution but does not auto-install cron jobs or launchd plists. Users must manually configure scheduling if desired.
Shared:
shared/check_all_credentials.py — Unified credential checker (all 3 types)Registration:
modules/registration/scripts/check_registration.py — Check all 3 credentialsmodules/registration/scripts/setup_env.sh — Auto-configure credentials from env/dotenvCompetition Reports:
modules/comp-report/scripts/utils.py — Credential check, API init, rate limitingmodules/comp-report/scripts/list_competitions.py — Fetch competitions across categoriesmodules/comp-report/scripts/competition_details.py — Files, leaderboard, kernels per competitionKaggle Interaction (kllm):
modules/kllm/scripts/setup_env.sh — Auto-configure credentials (with .env loading)modules/kllm/scripts/check_credentials.py — Verify and auto-map credentialsmodules/kllm/scripts/network_check.sh — Check Kaggle API reachabilitymodules/kllm/scripts/cli_download.sh — Download datasets/models via CLImodules/kllm/scripts/cli_execute.sh — Execute notebook on KKBmodules/kllm/scripts/cli_competition.sh — Competition workflow (list/download/submit)modules/kllm/scripts/cli_publish.sh — Publish datasets/notebooks/modelsmodules/kllm/scripts/poll_kernel.sh — Poll kernel status and download outputmodules/kllm/scripts/kagglehub_download.py — Download via kagglehubmodules/kllm/scripts/kagglehub_publish.py — Publish via kagglehubBadge Collector:
modules/badge-collector/scripts/orchestrator.py — Main entry pointmodules/badge-collector/scripts/badge_registry.py — 59 badge definitionsmodules/badge-collector/scripts/badge_tracker.py — Progress persistencemodules/badge-collector/scripts/utils.py — Shared utilitiesmodules/badge-collector/scripts/phase_1_instant_api.py — Instant API badgesmodules/badge-collector/scripts/phase_2_competition.py — Competition badgesmodules/badge-collector/scripts/phase_3_pipeline.py — Pipeline badgesmodules/badge-collector/scripts/phase_4_browser.py — Browser badgesmodules/badge-collector/scripts/phase_5_streaks.py — Streak automationmodules/registration/references/kaggle-setup.md — Full credential setup guide with troubleshootingmodules/comp-report/references/competition-categories.md — Competition types and API mappingmodules/kllm/references/kaggle-knowledge.md — Comprehensive Kaggle platform knowledgemodules/kllm/references/kagglehub-reference.md — Full kagglehub Python API referencemodules/kllm/references/cli-reference.md — Complete kaggle-cli command referencemodules/kllm/references/mcp-reference.md — Kaggle MCP server referencemodules/badge-collector/references/badge-catalog.md — Complete 59-badge catalogdevelopment
Run structured What-If scenario analysis with multi-branch possibility exploration. Use this skill when the user asks speculative questions like "what if...", "what would happen if...", "what are the possibilities", "explore scenarios", "scenario analysis", "possibility space", "what could go wrong", "best case / worst case", "risk analysis", "contingency planning", "strategic options", or any question about uncertain futures. Also trigger when the user faces a fork-in-the-road decision, wants to stress-test an idea, or needs to think through consequences before committing.
development
Access comprehensive LaTeX templates, formatting requirements, and submission guidelines for major scientific publication venues (Nature, Science, PLOS, IEEE, ACM), academic conferences (NeurIPS, ICML, CVPR, CHI), research posters, and grant proposals (NSF, NIH, DOE, DARPA). This skill should be used when preparing manuscripts for journal submission, conference papers, research posters, or grant proposals and need venue-specific formatting requirements and templates.
development
Use when challenging ideas, plans, decisions, or proposals using structured critical reasoning. Invoke to play devil's advocate, run a pre-mortem, red team, or audit evidence and assumptions.
tools
Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process with (1) section outlines with key points using research-lookup then (2) convert to flowing prose. IMRAD structure, citations (APA/AMA/Vancouver), figures/tables, reporting guidelines (CONSORT/STROBE/PRISMA), for research papers and journal submissions.