.claude/skills/evaluate-diagram/SKILL.md
--- user-invocable: true allowed-tools: - mcp__paperbanana__evaluate_diagram - Read - "Bash(paperbanana *)" --- # Evaluate Diagram Evaluate a generated diagram against a human reference using PaperBanana's VLM-as-Judge scoring. ## Instructions 1. `$ARGUMENTS[0]` is the path to the generated image. 2. `$ARGUMENTS[1]` is the path to the human reference image. 3. Ask the user for: - **Source context**: the methodology text (or a file path to read it from). If the user provides a file p
npx skillsauth add llmsresearch/paperbanana .claude/skills/evaluate-diagramInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Evaluate a generated diagram against a human reference using PaperBanana's VLM-as-Judge scoring.
$ARGUMENTS[0] is the path to the generated image.$ARGUMENTS[1] is the path to the human reference image.evaluate_diagram with:
generated_path: the generated image pathreference_path: the reference image pathcontext: the methodology text contentcaption: the figure captionIf the MCP tool is not available, fall back to the CLI:
paperbanana evaluate --generated <generated-img> --reference <reference-img> --context <context-file> --caption "<caption>"
/evaluate-diagram output.png reference.png
tools
--- user-invocable: true allowed-tools: - mcp__paperbanana__generate_diagram - Read - "Bash(paperbanana *)" --- # Generate Diagram Generate a publication-quality methodology diagram from a text file using PaperBanana. ## Instructions 1. Read the file at `$ARGUMENTS[0]` to get the methodology text content. 2. If `$ARGUMENTS[1]` is provided, use it as the figure caption. Otherwise, ask the user for a caption describing what the diagram should communicate. 3. Call the MCP tool `generate_d
tools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------