skills/skillxiv-v0.0.2-claude-opus-4.6/editthinker-iterative-reasoning/SKILL.md
Enable image editors to handle complex instructions through iterative critique and refinement cycles. A multimodal LLM critiques editing results, reasons about improvements, and refines instructions until satisfactory output—ideal for instruction-following challenges in visual editing.
npx skillsauth add ADu2021/skillXiv editthinker-iterative-reasoningInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
EditThinker implements a deliberative "Think-while-Edit" cycle where a multimodal language model critiques results, generates reasoning explanations, and produces refined instructions that guide the next iteration. This addresses inherent stochasticity in generation models through multi-turn reasoning rather than attempting perfection in single outputs.
The EditThinker model jointly produces critique scores, reasoning processes, and enhanced instructions:
# Iterative editing framework
class EditThinker:
def __init__(self, vllm_model, editor):
self.vllm = vllm_model # Vision-language model
self.editor = editor # Any compatible image editor
def refine_edit(self, image, instruction, max_iterations=3):
current_image = image
for iteration in range(max_iterations):
# Generate one-step edit
current_image = self.editor.apply(current_image, instruction)
# Get critique and reasoning
critique = self.vllm.critique(image, current_image)
reasoning = self.vllm.explain_improvements(critique)
# Generate enhanced instruction
instruction = self.vllm.refine_instruction(
instruction, reasoning, critique
)
if critique.satisfactory:
break
return current_image, instruction
Training uses reinforcement learning to align the model's reasoning with actual editing improvements, generating more targeted instruction modifications.
testing
Uses flow maps as look-ahead operators to enable principled reward-guided diffusion by predicting trajectory endpoints at any denoising step. Deploy when applying rewards or preferences to diffusion trajectories with meaningful gradients throughout generation.
testing
Train language models where each expert learns independently on closed datasets, enabling flexible inference with selective data inclusion or exclusion. 41% performance improvement while allowing users to opt out of specific data sources without retraining.
data-ai
Understand how token generation flexibility in diffusion LMs paradoxically constrains reasoning, as models exploit ordering flexibility to avoid uncertain tokens, and apply simplified approaches that preserve parallel decoding benefits. Use when optimizing diffusion-based language models for reasoning tasks.
devops
Enable LLM agents to improve continuously during deployment by constructing structured experience libraries through self-reflection on successes and failures—achieving 23% improvement on reasoning without gradient-based parameter updates or external training.