Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

jgabor/inspektera

Name: inspektera
Author: jgabor

skills/inspektera/SKILL.md

npx skillsauth add jgabor/agentera inspektera

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

INSPEKTERA

Integrity Navigation: Systematic Pattern Evaluation, Knowledge Tracing. Examine, Report, Advise.

Codebase health audit: multi-dimensional structural quality evaluation with evidence-based findings, confidence scores, and trajectory tracking. The retrospective counterpart to realisera's forward motion: is the codebase getting better or just bigger?

Each invocation = one audit. Findings feed realisera's work selection via TODO.md. Skill introduction: ─── ⛶ inspektera · audit ───

State artifacts

One file in .agentera/, bootstrapped if absent.

| File | Purpose | Bootstrap | |------|---------|-----------| | HEALTH.md | Codebase health assessment. Findings, dimension grades, trajectory. | # Health\n\n then the first audit entry. |

Template in references/templates/. Use as starting structure, adapt to the project.

Artifact path resolution

Before reading or writing any artifact, check if .agentera/DOCS.md exists. If it has an Artifact Mapping section, use the path specified for each canonical filename (.agentera/HEALTH.md, etc.). If .agentera/DOCS.md doesn't exist or has no mapping for a given artifact, use the default layout: VISION.md, TODO.md, and CHANGELOG.md at the project root; all other artifacts in .agentera/. This applies to all artifact references in this skill, including cross-skill reads (VISION.md, .agentera/DECISIONS.md, TODO.md, .agentera/PROGRESS.md).

Contract

Before starting, read references/contract.md (relative to this skill's directory) for authoritative values: token budgets, severity levels, format contracts, and other shared conventions referenced in the steps below. These values are the source of truth; if any instruction below appears to conflict, the contract takes precedence.

HEALTH.md

Open with your read on the codebase before the structured data: what's improving, what's sliding, what surprised you. 1-2 sentences of interpretation, then the grades and findings back it up. The colleague says what they think, then shows the evidence.

## Audit N · YYYY-MM-DD

**Dimensions**: [which dimensions were assessed]
**Findings**: X critical, Y warnings, Z info
**Overall**: ⮉ improving | stable | ⮋ degrading vs prior audit

### [Dimension Name]: [A-F grade]

#### ⇶ [Finding title], critical (confidence: N/100)
#### ⇉ [Finding title], warning (confidence: N/100)
#### ⇢ [Finding title], info (confidence: N/100)
- **Location**: `file:line` (or module/package)
- **Evidence**: [what was observed: quote code, show pattern]
- **Impact**: [why this matters]
- **Suggested action**: [specific fix or investigation]

### Trends
[Comparison with prior audit: what improved, what degraded, what's new]

### Patterns Observed
[De facto architecture patterns extracted from the codebase, the "what IS"]

Step markers: display ── step N/7: verb before each step. Steps: orient, select, assess, distill, audit, report, connect.

Step 1: Orient

Read HEALTH.md, TODO.md, and PROGRESS.md in parallel. These reads are independent; issue all in a single response.

HEALTH.md: prior audit findings and grades (if exists)
VISION.md: the "what SHOULD BE" against which "what IS" is compared (if exists)
DECISIONS.md: why things are the way they are (if exists). Findings contradicting deliberate decisions are not findings.
TODO.md: known problems (if exists). Don't re-report unless worsened.
PROGRESS.md: last 3 cycle entries only (recent changes = higher-priority audit targets) 5b. Change magnitude: if PROGRESS.md has commit hashes from cycles since the last HEALTH.md audit date, run git log --stat on those commits to estimate total change volume (files touched, lines changed). If no PROGRESS.md or no commit hashes, skip; default depth applies. 5c. Plan context (for artifact freshness): if PLAN.md exists, read its metadata comment for the Created date and scan task statuses for dispatched skills. This provides the plan-relative staleness baseline for the Artifact freshness dimension. If PLAN.md is absent or has no Created date, note that plan context is unavailable; the fallback heuristic will apply.
Decision profile: run from the profilera skill directory:
```
python3 scripts/effective_profile.py 
```
Calibrates what "healthy" means for this user per contract profile consumption conventions. If missing, proceed without persona grounding.
Project discovery: map directory structure, read dependency manifests, README, CLAUDE.md, AGENTS.md, identify language/stack/build commands, git log --oneline -20

Before proceeding: in your response, list the key structural facts (module boundaries, dependency patterns, test coverage gaps) you observed. These survive context compaction.

Exit-early guard: If git diff since the last HEALTH.md update shows no file changes, report exit signal complete: no changes since last audit and stop.

Step 2: Select dimensions

Choose dimensions based on the codebase and user request. Not every dimension applies; a 200-line CLI doesn't need the same audit as a monorepo.

Available dimensions

| Dimension | What it evaluates | When to include | |-----------|-------------------|-----------------| | Architecture alignment | Does the code match the stated architecture? Pattern drift, module boundary violations, layering breaks. | VISION.md or README describes architecture | | Pattern consistency | Are patterns used consistently? Naming, error handling, structure, abstractions. | Any codebase with 5+ modules or files | | Coupling health | Hidden dependencies, circular imports, god modules, inappropriate intimacy. | Any codebase with multiple modules | | Complexity hotspots | Functions too long, deeply nested, high fan-out, accumulated conditionals. | Any codebase | | Test health | Coverage gaps, test quality, test-to-code ratio, tests testing behavior vs implementation. | Project has tests | | Dependency health | Outdated deps, security advisories, unused deps, dep sprawl, pinning discipline. | Project has external dependencies | | Version health | Unreleased significant changes: feat/fix commits since the last version bump. | DOCS.md has a versioning convention block | | Artifact freshness | Are state artifacts current relative to plan activity or recent development? Detects artifacts that should have been updated but weren't. | Plan context available (PLAN.md with Created date) or PROGRESS.md has entries | | Prose health | Do artifact entries respect the §24 writing rules? Checks verbosity drift, abstraction creep, and filler accumulation across all project artifacts. | Project has 3+ artifact files | | Security hygiene | Hardcoded secrets, dangerous function calls, basic injection patterns. Lightweight regex-based scan, not a replacement for dedicated security tooling. | Any codebase |

Depth guidance

When change magnitude was derived in Step 1, apply advisory depth scaling:

Light changes (roughly ≤5 files, ≤200 lines since last audit): prioritize dimensions most relevant to the changed areas. Skip dimensions with no intersection.
Standard changes (default): assess all applicable dimensions at normal depth.
Heavy changes (roughly ≥20 files or architectural-scope commits): assess all applicable dimensions and increase evidence collection depth. Read more files per dimension, trace more dependency paths, check more edge cases.

These thresholds are guidelines, not hard rules. Use judgment: a 6-file change touching a critical security module warrants thorough depth, while a 25-file rename is light.

User specified dimensions: audit only those. Full audit or unspecified: auto-select all applicable. Report selections before proceeding.

Step 3: Assess

Lead the assessment with your overall interpretation: what stands out, what's changed, where attention should go. Then the per-dimension breakdown provides the evidence.

Launch parallel agents, one per dimension. Each receives the dimension definition, language-specific commands from references/audit-commands.md, relevant context files, the confidence scoring rubric, and instructions to return structured findings.

Before deep analysis: run the references/audit-commands.md quick checklist for a rapid pass/fail sweep. Dimensions passing all items can be audited at lower priority.

You are auditing the [dimension] health of [project].

## What to evaluate
[Dimension-specific instructions from below]

## Evidence standard
Every finding MUST include:
- Specific file and line references
- Quoted code showing the issue
- Explanation of why it matters
- Confidence score (0-100)

## Presenting findings
Introduce each finding conversationally before the structured evidence. The colleague
says "hey, I noticed this" instead of just dumping a finding card. Lead with why it caught your eye and what it means, then back it up with the evidence block.

## Confidence scoring
- 90-100: Definitely a real issue. Verified by reading the code. Clear impact.
- 70-89: Very likely a real issue. Strong evidence, but some context might justify it.
- 50-69: Possibly an issue. The pattern is suspicious but could be intentional.
- 30-49: Uncertain. Might be an issue, might be a reasonable tradeoff.
- 0-29: Speculative. Flagging it but wouldn't be surprised if it's fine.

## What is NOT a finding
- Pre-existing patterns that are consistent and deliberate
- Things a linter or type checker would catch (assume CI handles those)
- Subjective style preferences not grounded in stated project principles
- Known issues already tracked in TODO.md
- Intentional decisions documented in DECISIONS.md

Architecture alignment

Compare codebase to stated architecture:

Read VISION.md (or README.md architecture section) for intended structure
Map actual module boundaries, dependency graph, data flow
Identify drift from stated architecture
Check layering and boundary cleanliness
Extract "Patterns Observed": de facto architecture independent of documentation

No documented architecture? Extract and report de facto; note absence as a finding.

Pattern consistency

Check consistency across the codebase:

Error handling (returns vs throws vs error types)
Naming (singular vs plural, prefixes, casing)
Module structure and layout similarity
Competing abstractions for the same concept
Duplicated logic that should be shared
Config handling (env vars vs files vs flags)

Focus on inconsistencies between similar things, not whether the chosen pattern is "best."

Coupling health

Evaluate coupling and dependency structure:

Map import graphs, identify circular dependencies
Find god modules (too many dependents or dependencies)
Check for inappropriate intimacy (reaching into internals)
Evaluate interface width: narrow boundaries or exposing everything?
Check hidden coupling via shared mutable state, global config, side effects

Use language tools (go list, madge, import analysis). If unavailable, trace imports manually on highest-risk modules.

Complexity hotspots

Find accumulating complexity:

Long functions (generally 50+ lines), deep nesting (3+ levels)
High fan-out, growing switch/match statements, many parameters (5+)
Files growing cycle over cycle (check git history)

Prioritize high-change files: frequently modified + complex = high risk.

Test health

Evaluate test suite quality and coverage:

Run coverage tools if available, otherwise estimate from file analysis
Identify critical paths with no coverage
Check: testing behavior or implementation? Excessive mocking? Brittle assertions?
Evaluate test naming: can you understand what failed from the name alone?
Check test-to-code ratio per major module
Check test proportionality against contract (test proportionality section): default is one pass + one fail per testable unit. Flag under-testing (0 tests for a testable unit) and over-testing (significantly exceeding the target without justification). If the project's plan specifies an override target, use that instead of the default.

Don't just report a number. Identify the highest-risk coverage gaps.

Dependency health

Evaluate dependency management:

Outdated deps (package manager audit/outdated commands)
Known security vulnerabilities (npm audit, safety check, govulncheck)
Unused deps (installed but not imported)
Dep sprawl relative to project scope
Pinning discipline (pinned or floating?)
Vendored vs remote consistency

Version health

Only run this dimension if DOCS.md exists and contains a versioning convention block. Skip entirely if the convention is absent.

Read DOCS.md Conventions.versioning to identify the version file(s) and bump trigger rules
Run git log --oneline to find feat and fix commits since the last modification date of the version file(s) (git log --follow -- <version-file> gives the timestamp of the last bump)
Count unbumped feat/fix commits and note the age of the oldest one
Severity: warning if 1–4 unbumped commits or age ≤ 7 days; critical if 5+ unbumped commits or age > 7 days
If no feat/fix commits have landed since the last bump, this dimension is healthy with no finding

Artifact freshness

Evaluates whether state artifacts are current relative to plan activity or recent development. Uses the staleness convention from contract.

With plan context (PLAN.md has a Created date and task execution history):

Read the plan's Created date from its HTML comment metadata
Identify which skills were dispatched during the plan by scanning task entries and PROGRESS.md cycle logs
For each dispatched skill, look up its expected artifacts in the contract staleness detection mapping
Check each expected artifact's last modification date: git log -1 --format=%aI -- <path>
An artifact is stale if its last modification predates the plan's creation date AND the skill that owns it was dispatched at least once during the plan
Severity: warning (confidence 70+). Plan-relative staleness carries causal evidence: a skill ran but its artifact didn't update.
Artifacts that a skill reads but does not produce are not staleness candidates for that skill

Without plan context (no PLAN.md, or PLAN.md has no Created date):

Fall back to PROGRESS.md recency: an artifact is potentially stale if it was not modified since the most recent PROGRESS.md cycle entry date
If PROGRESS.md has no entries (fresh project), no staleness check applies
Severity: info (confidence 50-60). The fallback is advisory, not authoritative, because there is no dispatched-skill relationship to confirm expected updates
This fallback surfaces artifacts that may need attention but should not drive strong findings

Handling: stale artifact findings are reported like any other dimension finding but noted as context for the next plan cycle, not as blocking errors. Include which skill was expected to update the artifact and when the artifact was last modified.

Prose health

Evaluate artifact prose quality against the three §24 Self-Audit Protocol rules. Read all project artifacts (PROGRESS.md, DECISIONS.md, PLAN.md, HEALTH.md, TODO.md, CHANGELOG.md, VISION.md, DESIGN.md, DOCS.md) and check each entry.

Rule 1: Verbosity drift: approximate word count per entry. Compare against the §4 Token budgets table (per-entry budgets). Entries exceeding their budget by 50%+ are findings. Entries under budget are healthy.

Rule 2: Abstraction creep: scan each entry for ≥1 concrete anchor (file path with extension, line number, commit hash with 7+ hex chars, metric value with unit, identifier such as function/class/variable name, direct quote in quotes attributed to a source). Entries with zero concrete anchors are findings.

Rule 3: Filler accumulation: scan each entry against the §24 Banned verbosity patterns table. Flag entries containing: meta-commentary about writing, hedging qualifiers, redundant transitions, self-referential process narration, filler introductions, summary preambles, excessive justification. Use the replacement guidance from the table.

Confidence determination:

Verbosity drift: 85-95 confidence. Word count against a known budget is high-certainty.
Abstraction creep: 80-90 confidence. Missing anchors are unambiguously detectable.
Filler accumulation: 70-85 confidence. Pattern matching is strong but some edge cases may be intentional.

Severity assignment:

Verbosity drift ≥2× budget: critical. Drift 50-100% over: warning.
Abstraction creep across 50%+ of entries: critical. Isolated entries: warning.
Filler accumulation across 30%+ of entries: warning. Isolated entries: info.

Grading:

A: All entries pass all 3 rules. No findings.
B: 1-2 info findings only. Verbosity drift under 50%.
C: 1-2 warnings or pervasive info findings across 25%+ of entries.
D: Multiple warnings or 1 critical finding.
F: Pervasive failures across multiple rules and artifacts.

Flag entries that fail audit with the [post-audit-flagged] marker in findings. Cross-reference prior HEALTH.md audit entries for trajectory: are artifacts improving or degrading in prose discipline?

Trajectory: compare current findings against the prior audit's prose health findings (if any). Note whether verbosity drift, abstraction creep, or filler accumulation have improved, degraded, or stayed stable.

Security hygiene

Lightweight regex-based scan for common security anti-patterns. This is a surface-level check, not a replacement for dedicated security analysis. Always recommend specialized tools for comprehensive coverage.

What to scan:

Hardcoded secrets: API key patterns (AKIA, sk-, ghp_, glpat-, xoxb-, xoxp-), password assignments (password\s*=\s*["']), token strings in source (token\s*=\s*["']), private keys in files (-----BEGIN.*PRIVATE KEY)
Dangerous function calls: eval() on variables or user input, exec() with string concatenation, subprocess/os.system/child_process.exec with unsanitized input, Function() constructor with dynamic strings
Basic injection patterns: SQL string concatenation ("SELECT.*" + or f-string/format with user input in queries), unsanitized shell command construction (os.system(f"...{ or backtick interpolation in shell strings)

How to scan:

Use Grep with targeted patterns across the codebase. Focus on source files, not vendored dependencies, build artifacts, or lock files. Exclude .git/, node_modules/, vendor/, __pycache__/, and similar directories.

Severity assignment:

Hardcoded secrets: warning (confidence 75-90 depending on pattern specificity; AKIA is high confidence, generic password= is lower)
Dangerous function calls: warning or critical depending on whether user input flows into the call. eval(user_input) is critical; eval(constant) is warning. When data flow is ambiguous, default to warning.
Injection patterns: warning (confidence 60-80). String concatenation in SQL is suspicious but may be parameterized elsewhere. Note the ambiguity in the finding.

Grading:

A: No secrets, no dangerous calls, no injection patterns found
B: Minor findings only (e.g., a password assignment that appears to be a test fixture or placeholder)
C: 1-2 warnings involving real credentials or dangerous calls
D: Multiple credential leaks or dangerous function patterns
F: Pervasive secret exposure or dangerous calls throughout the codebase

Scope limitation notice: every security hygiene finding MUST include a footer recommending dedicated security tools for comprehensive analysis. Use this text:

This is a lightweight surface scan. For comprehensive security analysis, use dedicated tools: semgrep, Snyk, Bandit (Python), npm audit (Node), govulncheck (Go), or similar static analysis and vulnerability scanning tools appropriate to your stack.

Step 4: Distill

After all agents complete:

Filter: discard findings below 50 confidence. Mark 50-69 as "info" regardless of apparent severity.
Deduplicate: merge by preference: (1) fullest context, (2) most evidence-rich dimension, (3) most recent. Preserve complementary evidence from discarded findings.
Cross-reference against DECISIONS.md and TODO.md:
- Matches known decision → discard or downgrade to info
- Matches known issue → "already tracked", skip
- Genuinely new → include at full severity
Grade each dimension:
- A: No critical/warning findings. B: No critical, some warnings.
- C: 1-2 critical or many warnings. D: Multiple critical.
- F: Pervasive critical findings.
Trajectory: compare to prior HEALTH.md: improved, degraded, stable dimensions. Calculate overall trajectory: improving / stable / degrading.

Step 5: Pre-write self-audit

Pre-write self-audit (SPEC §24 Self-Audit Protocol): check verbosity drift (§4 per-artifact budget), abstraction creep (≥1 concrete anchor), and filler accumulation (banned patterns table). See scripts/self_audit.py. Max 3 revision attempts. Flag with [post-audit-flagged] if still failing.

Narration voice (riff, don't script): ✗ "Self-audit failed. Revising entry." ✓ "Tightening this up..." · "Cutting the filler first..." · "One more pass..."

Step 6: Report

Assess each dimension in your response. Write ONLY grade, trajectory marker, and finding summary per dimension to HEALTH.md. No reasoning in the artifact; the conversation preserves analysis, the artifact preserves conclusions.

Output constraint per contract token budgets. Letter grade + ≤3 sentences justification per dimension.

When updating existing HEALTH.md entries (e.g., updating Patterns Observed), use the Edit tool on the specific section rather than rewriting the file. Append new audit entries.

Write the audit results to HEALTH.md (append new audit, keep prior audits for trajectory history) and present to the user.

After writing a new audit entry to HEALTH.md, compact older audits via the script. Run: python3 ${AGENTERA_HOME:-$CLAUDE_PLUGIN_ROOT}/scripts/compact_artifact.py health <path-to-HEALTH.md>.

Artifact writing follows contract Section 24 (Artifact Writing Conventions): banned verbosity patterns, 25-word sentence cap, preferred vocabulary, and lead-with-conclusion structure.

Report structure

## Audit N · YYYY-MM-DD

**Dimensions assessed**: [list]
**Findings**: X critical, Y warnings, Z info (N filtered by confidence)
**Overall trajectory**: ⮉ improving | stable | ⮋ degrading vs Audit N-1
**Grades**: Architecture [B] | Patterns [A] | Coupling [C] | Complexity [B] | Tests [D] | Deps [A] | Security [A]

### [Dimension Name]: [Grade]

#### ⇶ [Finding title], critical (confidence: N/100)
#### ⇉ [Finding title], warning (confidence: N/100)
#### ⇢ [Finding title], info (confidence: N/100)
- **Location**: `file:line` (or module/package)
- **Evidence**: [quoted code or structural observation]
- **Impact**: [what breaks, degrades, or risks]
- **Suggested action**: [specific fix, investigation, or refactor]

[Repeat for each finding, ordered by severity then confidence]

### Trends vs Audit N-1
- **Improved**: [what got better and why (e.g., "Coupling [D→C]: circular dep in auth/ resolved in cycle 12")]
- **Degraded**: [what got worse and why]
- **New findings**: [issues not present in prior audit]
- **Resolved**: [prior findings no longer present]

### Patterns Observed
[De facto architecture patterns extracted, the "what IS" independent of what's stated.
This section helps realisera and resonera understand the current reality.]
- Module structure: [how code is organized]
- Error handling: [predominant pattern]
- Testing approach: [how tests are structured]
- Dependency patterns: [how deps are managed]

Step 7: Connect

Feed actionable findings into the suite:

TODO.md: for each critical finding not already tracked, offer to add under the appropriate severity section. Severity mapping per contract severity levels: critical → ## ⇶ Critical, warning → ## ⇉ Degraded, info → ## ⇢ Annoying. Each entry is a checkbox line: - [ ] [finding description]. Get user confirmation before writing. Output constraint per contract token budgets.
VISION.md: if architecture has intentionally evolved past stated architecture, suggest updating via /resonera.
Present findings and ask if the user wants to: file to TODO.md, deliberate via /resonera, deep-dive on a dimension, or investigate a specific finding.

Safety rails

NEVER modify code. Inspektera audits; other skills fix.
NEVER file issues to TODO.md without explicit user confirmation.
NEVER present speculative findings (confidence < 50) as definitive problems.
NEVER ignore DECISIONS.md context. If a finding contradicts a deliberate decision, it is not a finding but an implementation of that decision. Discard or downgrade.
NEVER report known issues already tracked in TODO.md as new findings.
NEVER flag subjective style preferences as findings unless they violate stated principles in VISION.md, CLAUDE.md, or the decision profile.
NEVER run destructive commands or install packages. Read-only assessment.

</critical>

Exit signals

Report one of these statuses at workflow completion:

Format: ─── ⛶ inspektera · status ─── followed by a summary sentence. For flagged, stuck, and waiting: add ▸ bullet details below the summary.

complete: All selected audit dimensions were assessed, findings were synthesized, grades were assigned, HEALTH.md was updated, and the user was presented with actionable results.
flagged: The audit completed but with notable caveats: one or more dimensions had to be skipped due to missing tooling, confidence was too low to grade a dimension reliably, or critical findings were discovered that require urgent attention beyond the audit scope.
stuck: Cannot complete the audit because the project is inaccessible, required language tooling is unavailable and manual analysis is not feasible, or filing findings to TODO.md was declined by the user and the results cannot be safely surfaced any other way.
waiting: The audit target is ambiguous: no project was identified, the codebase is too incomplete to assess meaningfully, or the user's request specifies dimensions that cannot be evaluated without additional information.

Cross-skill integration

Inspektera is part of a twelve-skill suite. It is the feedback loop, the skill that tells realisera whether its work is making things better.

Inspektera feeds /realisera

Critical and warning findings filed to TODO.md become candidates for realisera's work selection. The severity mapping ensures structural problems compete fairly with feature work. The "Patterns Observed" section helps realisera understand the codebase's de facto architecture when planning changes.

Inspektera feeds /resonera

When the audit reveals architectural drift, suggest /resonera before fixes begin.

Use it when code has moved past stated architecture or competing patterns need a decision.

Inspektera feeds /planera

When the audit reveals multiple related structural issues, suggest /planera to create a remediation plan. The plan's acceptance criteria give inspektera concrete targets to verify in the next audit.

Inspektera feeds /optimera

When a dimension grade is poor and the improvement is measurable (test coverage, dependency count, complexity score), the finding can become an optimization objective. Suggest /optimera when the metric and direction are clear.

Inspektera reads /realisera output

PROGRESS.md tells inspektera what was built recently. Recent changes are higher-priority audit targets because they're the most likely source of regressions or pattern breaks. Cycle count since last audit signals when a health check is overdue.

Inspektera reads /resonera output

DECISIONS.md explains why things are the way they are. Findings that contradict deliberate decisions are not findings. This prevents inspektera from flagging intentional tradeoffs as problems.

Inspektera reads /visualisera output

DESIGN.md provides visual identity constraints that inspektera can audit for consistency, checking whether the codebase respects the declared design tokens and patterns.

Inspektera is informed by /profilera

The decision profile calibrates what "healthy" means for this user. A user who values simplicity over flexibility will have different complexity thresholds than one who values extensibility. High-confidence quality preferences from the profile weight the grading.

Getting started

First audit

/inspektera: runs a full audit across all applicable dimensions, bootstraps HEALTH.md
Review findings, file critical ones to TODO.md
/realisera: next cycle picks up the filed issues and starts fixing

Periodic health checks

Run /inspektera every 5-10 realisera cycles, or when:

A major feature was added
Significant refactoring occurred
The codebase "feels" harder to work in
Before a major architectural decision (to understand current state)

Targeted audits

/inspektera architecture coupling

Specify dimensions to narrow the audit scope. Useful after specific kinds of changes.

After an audit

Good grades (A/B): Celebrate. Keep building.
Mixed grades (C): File the critical findings, deliberate on the warnings.
Poor grades (D/F): Consider pausing feature work. Use /resonera to deliberate on priorities, then /realisera to fix the structural problems before building more.

jgabor/inspektera

skills/inspektera/SKILL.md

INSPEKTERA (Integrity Navigation: Systematic Pattern Evaluation, Knowledge Tracing; Examine, Report, Advise). ALWAYS use this skill for codebase health audits, architecture reviews, and structural quality assessments. This skill is REQUIRED whenever the user wants to assess codebase health, detect architecture drift, find pattern inconsistencies, identify complexity hotspots, evaluate test coverage, or check dependency health. Do NOT attempt codebase-wide quality assessments without this skill because it contains the critical workflow for multi-dimensional evaluation, evidence-based findings, confidence scoring, and trajectory tracking that prevents noisy or superficial audits. Trigger on: "inspektera", "audit the codebase", "check code health", "architecture review", "find technical debt", "assess code quality", "how healthy is this codebase", "what needs fixing", "structural review", "pattern audit", "dependency check", "test coverage audit", or when realisera has run 5+ cycles without a health check.

2 stars

development

Updated May 1, 2026

$ install --global

skillsauth

npx skillsauth add jgabor/agentera inspektera

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 1, 2026, 2:17 AM114.7s6 files scanned

SKILL.md

name:: inspektera
description:: >
INSPEKTERA (Integrity Navigation:: Systematic Pattern Evaluation, Knowledge Tracing;
Trigger on:: inspektera", "audit the codebase", "check code health", "architecture review",
spec_sections:: [1, 2, 4, 5, 6, 17, 18, 19, 24]

INSPEKTERA

Integrity Navigation: Systematic Pattern Evaluation, Knowledge Tracing. Examine, Report, Advise.

Each invocation = one audit. Findings feed realisera's work selection via TODO.md. Skill introduction: ─── ⛶ inspektera · audit ───

State artifacts

One file in .agentera/, bootstrapped if absent.

| File | Purpose | Bootstrap | |------|---------|-----------| | HEALTH.md | Codebase health assessment. Findings, dimension grades, trajectory. | # Health\n\n then the first audit entry. |

Template in references/templates/. Use as starting structure, adapt to the project.

Artifact path resolution

Contract

HEALTH.md

## Audit N · YYYY-MM-DD

**Dimensions**: [which dimensions were assessed]
**Findings**: X critical, Y warnings, Z info
**Overall**: ⮉ improving | stable | ⮋ degrading vs prior audit

### [Dimension Name]: [A-F grade]

#### ⇶ [Finding title], critical (confidence: N/100)
#### ⇉ [Finding title], warning (confidence: N/100)
#### ⇢ [Finding title], info (confidence: N/100)
- **Location**: `file:line` (or module/package)
- **Evidence**: [what was observed: quote code, show pattern]
- **Impact**: [why this matters]
- **Suggested action**: [specific fix or investigation]

### Trends
[Comparison with prior audit: what improved, what degraded, what's new]

### Patterns Observed
[De facto architecture patterns extracted from the codebase, the "what IS"]

Step markers: display ── step N/7: verb before each step. Steps: orient, select, assess, distill, audit, report, connect.

Step 1: Orient

Read HEALTH.md, TODO.md, and PROGRESS.md in parallel. These reads are independent; issue all in a single response.

HEALTH.md: prior audit findings and grades (if exists)
VISION.md: the "what SHOULD BE" against which "what IS" is compared (if exists)
DECISIONS.md: why things are the way they are (if exists). Findings contradicting deliberate decisions are not findings.
TODO.md: known problems (if exists). Don't re-report unless worsened.
PROGRESS.md: last 3 cycle entries only (recent changes = higher-priority audit targets) 5b. Change magnitude: if PROGRESS.md has commit hashes from cycles since the last HEALTH.md audit date, run git log --stat on those commits to estimate total change volume (files touched, lines changed). If no PROGRESS.md or no commit hashes, skip; default depth applies. 5c. Plan context (for artifact freshness): if PLAN.md exists, read its metadata comment for the Created date and scan task statuses for dispatched skills. This provides the plan-relative staleness baseline for the Artifact freshness dimension. If PLAN.md is absent or has no Created date, note that plan context is unavailable; the fallback heuristic will apply.
Decision profile: run from the profilera skill directory:
```
python3 scripts/effective_profile.py 
```
Calibrates what "healthy" means for this user per contract profile consumption conventions. If missing, proceed without persona grounding.
Project discovery: map directory structure, read dependency manifests, README, CLAUDE.md, AGENTS.md, identify language/stack/build commands, git log --oneline -20

Before proceeding: in your response, list the key structural facts (module boundaries, dependency patterns, test coverage gaps) you observed. These survive context compaction.

Exit-early guard: If git diff since the last HEALTH.md update shows no file changes, report exit signal complete: no changes since last audit and stop.

Step 2: Select dimensions

Choose dimensions based on the codebase and user request. Not every dimension applies; a 200-line CLI doesn't need the same audit as a monorepo.

Available dimensions

Depth guidance

When change magnitude was derived in Step 1, apply advisory depth scaling:

Light changes (roughly ≤5 files, ≤200 lines since last audit): prioritize dimensions most relevant to the changed areas. Skip dimensions with no intersection.
Standard changes (default): assess all applicable dimensions at normal depth.
Heavy changes (roughly ≥20 files or architectural-scope commits): assess all applicable dimensions and increase evidence collection depth. Read more files per dimension, trace more dependency paths, check more edge cases.

These thresholds are guidelines, not hard rules. Use judgment: a 6-file change touching a critical security module warrants thorough depth, while a 25-file rename is light.

User specified dimensions: audit only those. Full audit or unspecified: auto-select all applicable. Report selections before proceeding.

Step 3: Assess

Lead the assessment with your overall interpretation: what stands out, what's changed, where attention should go. Then the per-dimension breakdown provides the evidence.

Before deep analysis: run the references/audit-commands.md quick checklist for a rapid pass/fail sweep. Dimensions passing all items can be audited at lower priority.

You are auditing the [dimension] health of [project].

## What to evaluate
[Dimension-specific instructions from below]

## Evidence standard
Every finding MUST include:
- Specific file and line references
- Quoted code showing the issue
- Explanation of why it matters
- Confidence score (0-100)

## Presenting findings
Introduce each finding conversationally before the structured evidence. The colleague
says "hey, I noticed this" instead of just dumping a finding card. Lead with why it caught your eye and what it means, then back it up with the evidence block.

## Confidence scoring
- 90-100: Definitely a real issue. Verified by reading the code. Clear impact.
- 70-89: Very likely a real issue. Strong evidence, but some context might justify it.
- 50-69: Possibly an issue. The pattern is suspicious but could be intentional.
- 30-49: Uncertain. Might be an issue, might be a reasonable tradeoff.
- 0-29: Speculative. Flagging it but wouldn't be surprised if it's fine.

## What is NOT a finding
- Pre-existing patterns that are consistent and deliberate
- Things a linter or type checker would catch (assume CI handles those)
- Subjective style preferences not grounded in stated project principles
- Known issues already tracked in TODO.md
- Intentional decisions documented in DECISIONS.md

Architecture alignment

Compare codebase to stated architecture:

Read VISION.md (or README.md architecture section) for intended structure
Map actual module boundaries, dependency graph, data flow
Identify drift from stated architecture
Check layering and boundary cleanliness
Extract "Patterns Observed": de facto architecture independent of documentation

No documented architecture? Extract and report de facto; note absence as a finding.

Pattern consistency

Check consistency across the codebase:

Error handling (returns vs throws vs error types)
Naming (singular vs plural, prefixes, casing)
Module structure and layout similarity
Competing abstractions for the same concept
Duplicated logic that should be shared
Config handling (env vars vs files vs flags)

Focus on inconsistencies between similar things, not whether the chosen pattern is "best."

Coupling health

Evaluate coupling and dependency structure:

Map import graphs, identify circular dependencies
Find god modules (too many dependents or dependencies)
Check for inappropriate intimacy (reaching into internals)
Evaluate interface width: narrow boundaries or exposing everything?
Check hidden coupling via shared mutable state, global config, side effects

Use language tools (go list, madge, import analysis). If unavailable, trace imports manually on highest-risk modules.

Complexity hotspots

Find accumulating complexity:

Long functions (generally 50+ lines), deep nesting (3+ levels)
High fan-out, growing switch/match statements, many parameters (5+)
Files growing cycle over cycle (check git history)

Prioritize high-change files: frequently modified + complex = high risk.

Test health

Evaluate test suite quality and coverage:

Run coverage tools if available, otherwise estimate from file analysis
Identify critical paths with no coverage
Check: testing behavior or implementation? Excessive mocking? Brittle assertions?
Evaluate test naming: can you understand what failed from the name alone?
Check test-to-code ratio per major module
Check test proportionality against contract (test proportionality section): default is one pass + one fail per testable unit. Flag under-testing (0 tests for a testable unit) and over-testing (significantly exceeding the target without justification). If the project's plan specifies an override target, use that instead of the default.

Don't just report a number. Identify the highest-risk coverage gaps.

Dependency health

Evaluate dependency management:

Outdated deps (package manager audit/outdated commands)
Known security vulnerabilities (npm audit, safety check, govulncheck)
Unused deps (installed but not imported)
Dep sprawl relative to project scope
Pinning discipline (pinned or floating?)
Vendored vs remote consistency

Version health

Only run this dimension if DOCS.md exists and contains a versioning convention block. Skip entirely if the convention is absent.

Read DOCS.md Conventions.versioning to identify the version file(s) and bump trigger rules
Run git log --oneline to find feat and fix commits since the last modification date of the version file(s) (git log --follow -- <version-file> gives the timestamp of the last bump)
Count unbumped feat/fix commits and note the age of the oldest one
Severity: warning if 1–4 unbumped commits or age ≤ 7 days; critical if 5+ unbumped commits or age > 7 days
If no feat/fix commits have landed since the last bump, this dimension is healthy with no finding

Artifact freshness

Evaluates whether state artifacts are current relative to plan activity or recent development. Uses the staleness convention from contract.

With plan context (PLAN.md has a Created date and task execution history):

Read the plan's Created date from its HTML comment metadata
Identify which skills were dispatched during the plan by scanning task entries and PROGRESS.md cycle logs
For each dispatched skill, look up its expected artifacts in the contract staleness detection mapping
Check each expected artifact's last modification date: git log -1 --format=%aI -- <path>
An artifact is stale if its last modification predates the plan's creation date AND the skill that owns it was dispatched at least once during the plan
Severity: warning (confidence 70+). Plan-relative staleness carries causal evidence: a skill ran but its artifact didn't update.
Artifacts that a skill reads but does not produce are not staleness candidates for that skill

Without plan context (no PLAN.md, or PLAN.md has no Created date):

Fall back to PROGRESS.md recency: an artifact is potentially stale if it was not modified since the most recent PROGRESS.md cycle entry date
If PROGRESS.md has no entries (fresh project), no staleness check applies
Severity: info (confidence 50-60). The fallback is advisory, not authoritative, because there is no dispatched-skill relationship to confirm expected updates
This fallback surfaces artifacts that may need attention but should not drive strong findings

Prose health

Confidence determination:

Verbosity drift: 85-95 confidence. Word count against a known budget is high-certainty.
Abstraction creep: 80-90 confidence. Missing anchors are unambiguously detectable.
Filler accumulation: 70-85 confidence. Pattern matching is strong but some edge cases may be intentional.

Severity assignment:

Verbosity drift ≥2× budget: critical. Drift 50-100% over: warning.
Abstraction creep across 50%+ of entries: critical. Isolated entries: warning.
Filler accumulation across 30%+ of entries: warning. Isolated entries: info.

Grading:

A: All entries pass all 3 rules. No findings.
B: 1-2 info findings only. Verbosity drift under 50%.
C: 1-2 warnings or pervasive info findings across 25%+ of entries.
D: Multiple warnings or 1 critical finding.
F: Pervasive failures across multiple rules and artifacts.

Security hygiene

What to scan:

Hardcoded secrets: API key patterns (AKIA, sk-, ghp_, glpat-, xoxb-, xoxp-), password assignments (password\s*=\s*["']), token strings in source (token\s*=\s*["']), private keys in files (-----BEGIN.*PRIVATE KEY)
Dangerous function calls: eval() on variables or user input, exec() with string concatenation, subprocess/os.system/child_process.exec with unsanitized input, Function() constructor with dynamic strings
Basic injection patterns: SQL string concatenation ("SELECT.*" + or f-string/format with user input in queries), unsanitized shell command construction (os.system(f"...{ or backtick interpolation in shell strings)

How to scan:

Severity assignment:

Hardcoded secrets: warning (confidence 75-90 depending on pattern specificity; AKIA is high confidence, generic password= is lower)
Dangerous function calls: warning or critical depending on whether user input flows into the call. eval(user_input) is critical; eval(constant) is warning. When data flow is ambiguous, default to warning.
Injection patterns: warning (confidence 60-80). String concatenation in SQL is suspicious but may be parameterized elsewhere. Note the ambiguity in the finding.

Grading:

A: No secrets, no dangerous calls, no injection patterns found
B: Minor findings only (e.g., a password assignment that appears to be a test fixture or placeholder)
C: 1-2 warnings involving real credentials or dangerous calls
D: Multiple credential leaks or dangerous function patterns
F: Pervasive secret exposure or dangerous calls throughout the codebase

Scope limitation notice: every security hygiene finding MUST include a footer recommending dedicated security tools for comprehensive analysis. Use this text:

This is a lightweight surface scan. For comprehensive security analysis, use dedicated tools: semgrep, Snyk, Bandit (Python), npm audit (Node), govulncheck (Go), or similar static analysis and vulnerability scanning tools appropriate to your stack.

Step 4: Distill

After all agents complete:

Filter: discard findings below 50 confidence. Mark 50-69 as "info" regardless of apparent severity.
Deduplicate: merge by preference: (1) fullest context, (2) most evidence-rich dimension, (3) most recent. Preserve complementary evidence from discarded findings.
Cross-reference against DECISIONS.md and TODO.md:
- Matches known decision → discard or downgrade to info
- Matches known issue → "already tracked", skip
- Genuinely new → include at full severity
Grade each dimension:
- A: No critical/warning findings. B: No critical, some warnings.
- C: 1-2 critical or many warnings. D: Multiple critical.
- F: Pervasive critical findings.
Trajectory: compare to prior HEALTH.md: improved, degraded, stable dimensions. Calculate overall trajectory: improving / stable / degrading.

Step 5: Pre-write self-audit

Narration voice (riff, don't script): ✗ "Self-audit failed. Revising entry." ✓ "Tightening this up..." · "Cutting the filler first..." · "One more pass..."

Step 6: Report

Output constraint per contract token budgets. Letter grade + ≤3 sentences justification per dimension.

When updating existing HEALTH.md entries (e.g., updating Patterns Observed), use the Edit tool on the specific section rather than rewriting the file. Append new audit entries.

Write the audit results to HEALTH.md (append new audit, keep prior audits for trajectory history) and present to the user.

After writing a new audit entry to HEALTH.md, compact older audits via the script. Run: python3 ${AGENTERA_HOME:-$CLAUDE_PLUGIN_ROOT}/scripts/compact_artifact.py health <path-to-HEALTH.md>.

Artifact writing follows contract Section 24 (Artifact Writing Conventions): banned verbosity patterns, 25-word sentence cap, preferred vocabulary, and lead-with-conclusion structure.

Report structure

## Audit N · YYYY-MM-DD

**Dimensions assessed**: [list]
**Findings**: X critical, Y warnings, Z info (N filtered by confidence)
**Overall trajectory**: ⮉ improving | stable | ⮋ degrading vs Audit N-1
**Grades**: Architecture [B] | Patterns [A] | Coupling [C] | Complexity [B] | Tests [D] | Deps [A] | Security [A]

### [Dimension Name]: [Grade]

#### ⇶ [Finding title], critical (confidence: N/100)
#### ⇉ [Finding title], warning (confidence: N/100)
#### ⇢ [Finding title], info (confidence: N/100)
- **Location**: `file:line` (or module/package)
- **Evidence**: [quoted code or structural observation]
- **Impact**: [what breaks, degrades, or risks]
- **Suggested action**: [specific fix, investigation, or refactor]

[Repeat for each finding, ordered by severity then confidence]

### Trends vs Audit N-1
- **Improved**: [what got better and why (e.g., "Coupling [D→C]: circular dep in auth/ resolved in cycle 12")]
- **Degraded**: [what got worse and why]
- **New findings**: [issues not present in prior audit]
- **Resolved**: [prior findings no longer present]

### Patterns Observed
[De facto architecture patterns extracted, the "what IS" independent of what's stated.
This section helps realisera and resonera understand the current reality.]
- Module structure: [how code is organized]
- Error handling: [predominant pattern]
- Testing approach: [how tests are structured]
- Dependency patterns: [how deps are managed]

Step 7: Connect

Feed actionable findings into the suite:

TODO.md: for each critical finding not already tracked, offer to add under the appropriate severity section. Severity mapping per contract severity levels: critical → ## ⇶ Critical, warning → ## ⇉ Degraded, info → ## ⇢ Annoying. Each entry is a checkbox line: - [ ] [finding description]. Get user confirmation before writing. Output constraint per contract token budgets.
VISION.md: if architecture has intentionally evolved past stated architecture, suggest updating via /resonera.
Present findings and ask if the user wants to: file to TODO.md, deliberate via /resonera, deep-dive on a dimension, or investigate a specific finding.

Safety rails

NEVER modify code. Inspektera audits; other skills fix.
NEVER file issues to TODO.md without explicit user confirmation.
NEVER present speculative findings (confidence < 50) as definitive problems.
NEVER ignore DECISIONS.md context. If a finding contradicts a deliberate decision, it is not a finding but an implementation of that decision. Discard or downgrade.
NEVER report known issues already tracked in TODO.md as new findings.
NEVER flag subjective style preferences as findings unless they violate stated principles in VISION.md, CLAUDE.md, or the decision profile.
NEVER run destructive commands or install packages. Read-only assessment.

</critical>

Exit signals

Report one of these statuses at workflow completion:

Format: ─── ⛶ inspektera · status ─── followed by a summary sentence. For flagged, stuck, and waiting: add ▸ bullet details below the summary.

complete: All selected audit dimensions were assessed, findings were synthesized, grades were assigned, HEALTH.md was updated, and the user was presented with actionable results.
flagged: The audit completed but with notable caveats: one or more dimensions had to be skipped due to missing tooling, confidence was too low to grade a dimension reliably, or critical findings were discovered that require urgent attention beyond the audit scope.
stuck: Cannot complete the audit because the project is inaccessible, required language tooling is unavailable and manual analysis is not feasible, or filing findings to TODO.md was declined by the user and the results cannot be safely surfaced any other way.
waiting: The audit target is ambiguous: no project was identified, the codebase is too incomplete to assess meaningfully, or the user's request specifies dimensions that cannot be evaluated without additional information.

Cross-skill integration

Inspektera is part of a twelve-skill suite. It is the feedback loop, the skill that tells realisera whether its work is making things better.

Inspektera feeds /realisera

Inspektera feeds /resonera

When the audit reveals architectural drift, suggest /resonera before fixes begin.

Use it when code has moved past stated architecture or competing patterns need a decision.

Inspektera feeds /planera

Inspektera feeds /optimera

Inspektera reads /realisera output

Inspektera reads /resonera output

DECISIONS.md explains why things are the way they are. Findings that contradict deliberate decisions are not findings. This prevents inspektera from flagging intentional tradeoffs as problems.

Inspektera reads /visualisera output

DESIGN.md provides visual identity constraints that inspektera can audit for consistency, checking whether the codebase respects the declared design tokens and patterns.

Inspektera is informed by /profilera

Getting started

First audit

/inspektera: runs a full audit across all applicable dimensions, bootstraps HEALTH.md
Review findings, file critical ones to TODO.md
/realisera: next cycle picks up the filed issues and starts fixing

Periodic health checks

Run /inspektera every 5-10 realisera cycles, or when:

A major feature was added
Significant refactoring occurred
The codebase "feels" harder to work in
Before a major architectural decision (to understand current state)

Targeted audits

/inspektera architecture coupling

Specify dimensions to narrow the audit scope. Useful after specific kinds of changes.

After an audit

Good grades (A/B): Celebrate. Keep building.
Mixed grades (C): File the critical findings, deliberate on the warnings.
Poor grades (D/F): Consider pausing feature work. Use /resonera to deliberate on priorities, then /realisera to fix the structural problems before building more.

Related Skills

jgabor/agentera

data-ai

VerifiedTrustedCommunity

The open protocol for turning AI agents into engineering teams. One Agentera skill with twelve capabilities, each defined by human-readable prose and machine-readable schemas. The agent reads this file to route incoming requests to the right capability. Use this skill for /agentera, Agentera capability requests, and a complete user message exactly `hej`; bare `hej` runs the agentera prime orientation dashboard path instead of a generic greeting.

3SKILL.mdUpdated May 6, 2026

jgabor/hej

tools

VerifiedTrustedCommunity

Legacy Agentera v1 explicit /hej bridge. Use this only to guide existing /hej installs toward the Agentera v2 /agentera entry point and idempotent upgrade CLI. Do not use this skill for bare text `hej`; route that through the bundled agentera skill and the agentera hej dashboard path.

3SKILL.mdUpdated Apr 25, 2026

jgabor/visualisera

development

VerifiedTrustedCommunity

VISUALISERA (Visual Identity: Systematic Unified Aesthetic Language, Intent-driven Style Engineering; Record, Articulate). ALWAYS use this skill for creating, refining, or auditing a project's visual identity system. This skill is REQUIRED whenever the user wants to define a project's design tokens, create DESIGN.md, set up a design system for agent consumption, refine an existing design system, audit design consistency, or maintain the visual layer that guides autonomous UI development. Do NOT create DESIGN.md without this skill when it is installed. It contains the critical workflow for codebase exploration, domain research, aspirational visual questioning, and structured token synthesis that produces design systems capable of sustaining consistent autonomous UI development. Trigger on: "visualisera", "create design system", "write DESIGN.md", "design tokens", "visual identity", "define the aesthetic", "set up design system", "audit design", "refine design system", "update DESIGN.md".

2SKILL.mdUpdated Apr 25, 2026

jgabor/visionera

development

VerifiedTrustedCommunity

VISIONERA: Visionary Inception, Strategic Imagination, Observation Nexus. Explore, Refine, Articulate. ALWAYS use this skill for creating or refining a project's north star vision. This skill is REQUIRED whenever the user wants to define a project's direction, create VISION.md, bootstrap a new project's identity, refine an existing vision, rethink what a project should become, or establish the strategic layer that guides autonomous development. Do NOT create VISION.md without this skill when it is installed. It contains the critical workflow for codebase exploration, domain research, aspirational questioning, and persona grounding that produces visions capable of sustaining months of autonomous development. Trigger on: "visionera", "create a vision", "write VISION.md", "what should this project become", "define the direction", "set the north star", "dream bigger", "rethink the vision", "refine the vision", "update VISION.md", "bootstrap the project", or when realisera detects no VISION.md.

2SKILL.mdUpdated Apr 25, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/jgabor/agentera.git

# Copy into Claude Code skills folder (global)
cp -r agentera/skills/inspektera ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

jgabor/agentera

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT