
Generate EvalView test cases — either from a SKILL.md file using LLM-powered generation, or by capturing real agent interactions through a proxy.
Start EvalView watch mode to automatically re-run regression checks whenever project files change.
Run EvalView regression checks against golden baselines to detect regressions in AI agent behavior after code, prompt, or model changes.
Performs comprehensive code reviews with security, quality, and best practice checks
A simple skill that creates a greeting file
A skill that helps review code for best practices, bugs, and security issues
Beat procrastination with task breakdown, 2-minute starts, and accountability tracking