Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

HeshamFS/simulation-failure-triage

Name: simulation-failure-triage
Author: HeshamFS

skills/robustness/simulation-failure-triage/SKILL.md

npx skillsauth add HeshamFS/materials-simulation-skills simulation-failure-triage

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Simulation Failure Triage

Goal

Classify common simulation failure signatures and return immediate actions, retry ladders, and stop conditions.

Requirements

Python 3.10+
No external dependencies
Works on Linux, macOS, and Windows

Inputs to Gather

| Input | Description | Example | |-------|-------------|---------| | Code | Simulation code | LAMMPS, VASP, MOOSE, QE | | Stage | Setup, runtime, postprocess | runtime | | Symptoms | Failure signs | nan,pressure-blowup | | Log text or file | Error evidence | Lost atoms, ZBRENT | | Recent change | Last modified setting | larger timestep |

Decision Guidance

First preserve evidence: logs, inputs, executable version, and scheduler output.
Separate setup errors from numerical instability and physical model issues.
Retry with a single controlled change.
Stop retrying when the result becomes scientifically meaningless or a required model input is missing.

Script Outputs

scripts/failure_triage.py emits:

likely_causes
immediate_actions
retry_ladder
stop_conditions
evidence

Workflow

python3 skills/robustness/simulation-failure-triage/scripts/failure_triage.py \
  --code LAMMPS \
  --stage runtime \
  --symptoms nan,pressure-blowup \
  --recent-change "increased timestep" \
  --json

Error Handling

Invalid stages or oversized log files stop with exit code 2. Unknown symptoms are retained as custom evidence.

Limitations

This skill gives first-response triage. It does not guarantee that a failed simulation can be repaired.

Security

Log files are read with a 10 MB size cap.
Log text is truncated and never executed.
The script does not run external solvers.
The skill uses Bash only to run its bundled script.

References

See references/failure_patterns.md for common failure signatures and retry ladders.

Version History

1.0.0: Initial cross-code simulation failure triage skill.

HeshamFS/simulation-failure-triage

skills/robustness/simulation-failure-triage/SKILL.md

Triage cross-code simulation failures and propose safe retry ladders for nonconvergence, NaN/Inf, exploding energies, unstable timesteps, pressure blow-up, missing potentials, bad pseudopotentials, corrupted output, and incomplete runs. Use when an agent sees a failed or suspicious materials simulation and needs a defensible first response.

39 stars

development

Updated May 19, 2026

$ install --global

skillsauth

npx skillsauth add HeshamFS/materials-simulation-skills simulation-failure-triage

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 19, 2026, 5:34 AM343.2s5 files scanned

SKILL.md

name:: simulation-failure-triage
description:: >
allowed-tools:: Read, Bash, Write, Grep, Glob
author:: HeshamFS
version:: 1.0.0
security_tier:: high
security_reviewed:: true
eval_cases:: 3
last_reviewed:: 2026-05-18

Simulation Failure Triage

Goal

Classify common simulation failure signatures and return immediate actions, retry ladders, and stop conditions.

Requirements

Python 3.10+
No external dependencies
Works on Linux, macOS, and Windows

Inputs to Gather

Decision Guidance

First preserve evidence: logs, inputs, executable version, and scheduler output.
Separate setup errors from numerical instability and physical model issues.
Retry with a single controlled change.
Stop retrying when the result becomes scientifically meaningless or a required model input is missing.

Script Outputs

scripts/failure_triage.py emits:

likely_causes
immediate_actions
retry_ladder
stop_conditions
evidence

Workflow

python3 skills/robustness/simulation-failure-triage/scripts/failure_triage.py \
  --code LAMMPS \
  --stage runtime \
  --symptoms nan,pressure-blowup \
  --recent-change "increased timestep" \
  --json

Error Handling

Invalid stages or oversized log files stop with exit code 2. Unknown symptoms are retained as custom evidence.

Limitations

This skill gives first-response triage. It does not guarantee that a failed simulation can be repaired.

Security

Log files are read with a 10 MB size cap.
Log text is truncated and never executed.
The script does not run external solvers.
The skill uses Bash only to run its bundled script.

References

See references/failure_patterns.md for common failure signatures and retry ladders.

Version History

1.0.0: Initial cross-code simulation failure triage skill.

Related Skills

HeshamFS/benchmark-and-mms-planner

development

VerifiedTrustedCommunity

Plan verification and validation campaigns for simulation codes using manufactured solutions, canonical benchmark problems, grid/time refinement, uncertainty propagation, and pass/fail acceptance criteria. Use when an agent needs to prove a solver, model, or result is trustworthy rather than only plausible.

39SKILL.mdUpdated May 19, 2026

HeshamFS/benchmark-and-mms-planner

HeshamFS/workflow-engine-mapper

testing

VerifiedTrustedCommunity

Map computational materials tasks onto workflow engines such as atomate2, jobflow, AiiDA, pyiron, or a simple one-off script. Use when deciding how to structure a reproducible campaign, DAG, restart strategy, provenance record, storage layout, or migration path from ad hoc scripts to managed workflows.

39SKILL.mdUpdated May 19, 2026

HeshamFS/workflow-engine-mapper

HeshamFS/md-analysis-planner

development

VerifiedTrustedCommunity

Plan molecular dynamics post-processing for materials simulations, including RDF, MSD and diffusion, VACF/VDOS, coordination numbers, bond-angle distributions, stress-strain curves, equilibration detection, PBC unwrapping, and trajectory format choices. Use before writing MD analysis scripts or trusting trajectory-derived results.

39SKILL.mdUpdated May 19, 2026

HeshamFS/md-analysis-planner

HeshamFS/hpc-runtime-doctor

documentation

VerifiedTrustedCommunity

Diagnose HPC runtime and scheduler problems for materials simulations, including MPI/OpenMP/GPU layout, modules, CUDA/Kokkos hints, scratch paths, walltime, job arrays, restart strategy, scheduler portability, and resource mismatch. Use when jobs fail, run slowly, get killed, or behave differently on a cluster than on a workstation.

39SKILL.mdUpdated May 19, 2026

HeshamFS/hpc-runtime-doctor

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/HeshamFS/materials-simulation-skills.git

# Copy into Claude Code skills folder (global)
cp -r materials-simulation-skills/skills/robustness/simulation-failure-triage ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

HeshamFS/materials-simulation-skills

39 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT