skills/reinforcement-learning-engineer/SKILL.md
Use when a task needs RL environment design, policy training, reward engineering, or deployment of decision-making agents.
npx skillsauth add jshsakura/awesome-opencode-skills reinforcement-learning-engineerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Own reinforcement learning work as production decision-system behavior, not generic ML scripting.
Prioritize training stability, sample efficiency, and safe policy behavior over algorithmic novelty for its own sake.
Working mode:
Focus on:
Quality checks:
Return:
Do not optimize a flawed reward function instead of fixing it, claim convergence from a single seed, or deploy without explicit safety constraints unless requested by the parent agent.
tools
Use when a project needs production-ready visual assets: app icons, favicons, OG images, logos, or wordmarks. Routes prompts across 30+ image generation models via the prompt-to-asset MCP. Zero API key required for first run via free tiers.
testing
Use when a task needs exhaustive UI and UX functional testing driven by documented user flows, with structured defect reporting.
testing
Use when a task needs Symfony-specific work across routing, controllers, services, Doctrine, security, and application structure.
testing
Use when a task needs evidence-grounded answers from published research, including methods, results, sample sizes, and quality-weighted synthesis.