skills/reinforcement-learning/SKILL.md
Use when designing, implementing, or analyzing reinforcement learning experiments, algorithm selection, environment analysis, hyperparameter tuning, reward curves, and RL evaluation.
npx skillsauth add miaodi/llm_config reinforcement-learningInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Design, implement, evaluate, and analyze reinforcement learning experiments with clear baselines, sound metrics, and algorithm-appropriate hyperparameters.
Use for RL assignments, environment analysis, algorithm selection, training design, evaluation planning, hyperparameter tuning, and result interpretation.
Provide:
development
Use when creating C++ learning notes or minimal experiments for low-level computational, numerical, CPU/GPU, compiler, and hardware concepts such as false sharing, floating point, registers, caches, SIMD, atomics, numerical stability, and benchmarking pitfalls.
development
Use when configuring, diagnosing, or compiling LaTeX projects, especially multi-file reports, theses, books, chapter-based projects, Overleaf exports, latexmk/arara/Makefile workflows, bibliography/index/glossary passes, or projects that require pdflatex, xelatex, lualatex, latex->dvips, biber, or bibtex.
development
Use when working with graph traversals (BFS, DFS, level-order), minimum spanning trees, strongly connected components, topological sort, graph coloring, bipartite detection, elimination trees, level-set extraction, parallel graph algorithms, task-tree parallelism, sparse graph representations, and exploiting graph structure for parallel sparse computations.
testing
Use when planning or executing Git branch workflows, especially merge/rebase across branches, conflict resolution, safe history rewriting, and recovery from mistakes.