.agents/skills/report-generation/SKILL.md
JSON/HTML/Markdown reports, template system
npx skillsauth add ronniegeraghty/hyoka report-generationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Hyoka generates multi-format evaluation reports (JSON, HTML, Markdown) from a unified internal representation. Reports capture the entire evaluation timeline: generation, build, grading, and review phases.
The canonical format — contains all raw data and is used to generate other formats:
{
"metadata": {
"run_id": "eval-2024-04-06-123456",
"timestamp": "2024-04-06T12:34:56Z",
"prompt": {
"id": "identity-dp-python-default-credential",
"service": "identity",
"language": "python"
},
"config": {
"name": "baseline/claude-opus-4.6"
}
},
"generation": {
"status": "success",
"duration_ms": 45000,
"code": "# Generated Python code...",
"action_timeline": {
"events": [...],
"summary": {...}
}
},
"build": {
"status": "success",
"duration_ms": 5000,
"stdout": "Compilation successful",
"stderr": ""
},
"graders": [
{
"kind": "behavior",
"name": "tool_compliance",
"pass": true,
"score": 1.0,
"message": "All required tools used"
}
],
"review": {
"status": "success",
"reviewers": [
{
"model": "claude-opus-4.6",
"score": 0.9,
"findings": "Code is mostly correct..."
}
],
"consensus_score": 0.85
}
}
Browser-viewable report with interactive elements:
<!DOCTYPE html>
<html>
<head>
<title>Evaluation Report: identity-dp-python</title>
<style>...</style>
</head>
<body>
<div class="report-container">
<h1>Evaluation Report</h1>
<section class="generation">
<h2>Generation Phase</h2>
<code>{{ .Code }}</code>
<div class="timeline">...</div>
</section>
<section class="graders">
<h2>Grading Results</h2>
...
</section>
</div>
<script>
// Interactive features (expand/collapse, filtering)
</script>
</body>
</html>
Human-readable text format for documentation and sharing:
# Evaluation Report: identity-dp-python
## Generation
- Status: Success
- Duration: 45s
## Code
\`\`\`python
# Generated code...
\`\`\`
## Graders
| Grader | Status | Score | Message |
|--------|--------|-------|---------|
| behavior | Pass | 1.0 | All required tools used |
| lint | Pass | 0.95 | 1 warning |
## Review Panel
- Claude Opus 4.6: 0.9
- GPT-5.4: 0.85
- Consensus: 0.875
EvaluationResult with all phase outputsreports/{run_id}/report.jsonReports use Go's text/template package:
// report.go
type ReportTemplate struct {
HTML *template.Template
Markdown *template.Template
}
func (rt *ReportTemplate) RenderHTML(data EvaluationResult) (string, error) {
var buf bytes.Buffer
if err := rt.HTML.Execute(&buf, data); err != nil {
return "", err
}
return buf.String(), nil
}
Templates can access all fields from EvaluationResult:
{{- .Metadata.RunID }}
{{- .Generation.Status }}
{{- range .Graders }}
{{- .Kind }}: {{- .Score }}
{{- end }}
Reports are written to:
reports/
{run_id}/
report.json # Canonical data
report.html # Browser view
report.md # Text format
assets/
style.css
script.js
For batch runs (multiple prompts × configs):
{
"batch_metadata": {
"run_date": "2024-04-06",
"prompts": 5,
"configs": 2,
"total_evals": 10
},
"summary": {
"avg_score": 0.82,
"pass_count": 8,
"fail_count": 2
},
"reports": [
{ "prompt_id": "...", "config": "...", "result": {...} }
]
}
The rerender command regenerates HTML/Markdown from saved JSON:
go run ./hyoka rerender --report-id eval-2024-04-06-123456 --format html
This allows updating templates without re-running evaluations.
hyoka/internal/report/report.gohyoka/internal/report/json.gohyoka/internal/report/render.gohyoka/templates/development
Identifies Azure SDK packages in generated code and checks whether they are the latest available versions. Use during code review to catch outdated dependencies.
development
Sets up build environments for generated Azure SDK code samples and attempts to compile/build without modifying generated files. Use during review to verify code compiles correctly.
development
# Java SDK Validation Skill You are a **Java Azure SDK validation reviewer** for generated code samples. Your job is to check whether generated Java code follows modern Azure SDK for Java conventions and flag violations of common anti-patterns that LLMs frequently produce. ## Rules 1. **NEVER modify generated code.** You are evaluating, not fixing. 2. Report all findings honestly — pass or fail with specific evidence. 3. Check every rule below. A single violation in a category means that cate
development
Reads generated Azure SDK code files and adds inline review comments without changing any actual code. Use during code review to annotate quality issues, best practices, and suggestions.