plugins/khuym/skills/book-sft-pipeline/SKILL.md
Use when the user asks to fine-tune on books, create an SFT dataset from books, train a style-transfer or author-voice model, extract ePub text, segment long-form book content, or prepare literary data for LoRA or small-model training.
npx skillsauth add hoangnb24/skills book-sft-pipelineInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Convert long-form books into supervised fine-tuning data for literary style transfer. Keep the entrypoint lightweight: use this file to route the work, then open only the references needed for the current phase.
| Setting | Default | |---------|---------| | Chunk size | 150-400 words | | Prompt diversity | 15+ templates, 5+ system prompts | | Variants | 2 per chunk | | Model | Qwen/Qwen3-8B-Base or another base 8B-class model | | LoRA rank | 32 | | Epochs | 3 | | Test set | 50 examples minimum |
Open these only when the task reaches that layer:
Use the sample script when the user wants executable scaffolding:
python plugins/khuym/skills/book-sft-pipeline/scripts/pipeline_example.py
The script demonstrates the same pipeline semantics as this skill: segmentation, diverse prompt construction, Tinker datum construction, and originality checks.
Internal references:
External resources:
Created: 2025-12-26 Last Updated: 2025-12-28 Author: Muratcan Koylan Version: 2.0.0 Standalone: Yes
tools
Use when the user asks to commit everything, smart commit, group commits, organize staged or unstaged changes, create logical commits, push current work, or preserve a clean commit stack from an existing working tree.
development
Use when starting or resuming any Khuym project session, choosing the next Khuym skill, running go mode, checking onboarding/scout state, or enforcing workflow gates. Bootstrap meta-skill for routing across the Khuym agentic development ecosystem.
development
Use when the user wants to turn a rough idea, vague task, feature wish, bug-fix intent, or Codex /goal objective into a clear verifiable goal prompt. Use when the user mentions goal mode, /goal, set_goal, long-running autonomous Codex work, or asks to be interviewed/grilled before creating a goal.
tools
Use when Codex should turn a concept, topic, tutorial, lesson, algorithm, system, workflow, or abstract idea into a playable browser game that teaches through interaction. Use for planning or implementing educational web games with the Game Studio plugin, including concept-to-mechanic translation, gameplay loop design, learning progression, and browser-game QA.