studio/services/erotic-image-captioner/SKILL.md
# Erotic Image Captioner Pipeline (Unified Caption Engine) ## Overview The Erotic Image Captioner (EC) has been refactored from a multi-agent manual prompting sequence and complex MCP tool structure into a **unified, high-performance local Caption Engine** (`studio/core/caption_engine/caption_engine.py`). Instead of prompting multiple text-only agents (Kana, Luna, Suki) in-session with large system files or invoking fragmented MCP tools, LND Studio now leverages a single optimized multimodal
npx skillsauth add datdang-dev/light-novel-development studio/services/erotic-image-captionerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
The Erotic Image Captioner (EC) has been refactored from a multi-agent manual prompting sequence and complex MCP tool structure into a unified, high-performance local Caption Engine (studio/core/caption_engine/caption_engine.py).
Instead of prompting multiple text-only agents (Kana, Luna, Suki) in-session with large system files or invoking fragmented MCP tools, LND Studio now leverages a single optimized multimodal execution pass using the Qwen2-VL local vision engine. Prompts reside as independent templates in studio/prompts/ and are loaded dynamically on-demand based on the specified --type.
This architecture ensures:
_lnd-output/caption_engine.log) suitable for tail -f.--no-stream) to switch between instant single-shot generation and real-time chunk streaming.Unified Architecture:
[User Image] ──> [CaptionEngine (caption_engine.py)]
│
├──► Loads: studio/prompts/{type}.txt (e.g., long_thoughts_v2, manga, etc.)
├──► Resolves base64/local files
└──► Executes local InProcessQwen2VLAdapter
The engine can be invoked directly from the CLI or via automated workflows using the following syntax:
python3 studio/core/caption_engine/caption_engine.py \
--image "/path/to/image.png" \
--type "long_thoughts_v2" \
--mood "AUTO" \
--temp 0.5 \
--max-tokens 4096
--image: Absolute path to the target image file (mandatory).--type: Prompt style from studio/prompts/ (e.g., long_thoughts_v2, manga, json, md_comic).--mood: Mood setting seeds (e.g., AUTO, MANIC, COLD, BRATTY, BROKEN, MASO).--context: Optional backstory or descriptive user context to inject.--temp: Generation temperature (default: 0.5).--max-tokens: Max output generation length (default: 4096).--no-stream: Disables real-time streaming, running in clean single-shot mode.When executing the pipeline, the active agent (typically Nova) or the orchestrator invokes the Caption Engine directly:
mood_seed parameter (default: AUTO).user_context (backstory/constraints).StudioPromptLoader from the target template in studio/prompts/{type}.txt.InProcessQwen2VLAdapter) referencing models/ToriiGate-0.5_Q4_K_L.gguf and mmproj_Q8_0.gguf.--no-stream, tokens are either streamed back in real-time with an ANSI colored dashboard or processed in a single fast block._lnd-output/_captions/ as well as color-logged in _lnd-output/caption_engine.log.For each processed image, the pipeline writes:
_lnd-output/_captions/
└── {image_name}_{timestamp}_{type}.md # Output markdown/JSON formatted report
And appends live execution monitoring data to:
_lnd-output/caption_engine.log # Structured color logs (viewable with tail -f)
studio/prompts/ and calling --type {filename}.development
SillyTavern framework engineering expert — Prompt Manager architecture, character card V3 optimization, context template engineering, lorebook design, and Vietnamese R18 roleplay configuration. Bridges LND Studio character bibles to high-fidelity ST deployments.
documentation
Kuro's R18 Research & Downloader Engine. Harvests fetish references, downloads R18 assets (novels, manga, illustrations), and extracts natural Japanese R18 prose style guides from Syosetu.
development
Ultimate Novel Development Pipeline — A slow-paced, deeply contextualized narrative engine. Orchestrated flow: Kana (Visuals) -> Luna (World/Pacing) -> Suki (Long-form Prose).
data-ai
Luna's Scene Prelude engine — generates an erotic narrative context (micro-scenario) from Kana's forensic data, bridging raw visual analysis to Suki's caption writing. Outputs a structured scenario seed that maximizes erotic tension and narrative coherence.