Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

sentenz/go-benchmark-testing

Name: go-benchmark-testing
Author: sentenz

skills/go-benchmark-testing/SKILL.md

npx skillsauth add sentenz/skills go-benchmark-testing

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Benchmark Testing

Instructions for AI coding agents on automating benchmark test creation using consistent software testing patterns in this Go project.

1. Benefits
2. Principles
- 2.1. FIRST
3. Patterns
- 3.1. Microbenchmarking
- 3.2. Comparative Benchmarking
- 3.3. Memory Profiling
- 3.4. Statistical Benchmarking
- 3.5. Sub-benchmarks
- 3.6. Table-Driven Testing
4. Workflow
5. Commands
6. Style Guide
7. Template
- 7.1. Multi-Scenario Benchmarks
- 7.2. Simple Benchmarks
- 7.3. Benchmarks with Validation
8. References

1. Benefits

Performance Measurement

Benchmark tests measure the execution time and memory allocation of functions, providing quantifiable metrics for performance analysis.
Regression Detection

Continuous benchmarking helps identify performance regressions early in the development cycle before they reach production.
Optimization Guidance

Benchmark results guide optimization efforts by identifying bottlenecks and quantifying the impact of performance improvements.
Comparative Analysis

Benchmarks enable comparison of different implementations or algorithms to make informed decisions about performance trade-offs.
Resource Profiling

Memory allocation tracking helps identify unnecessary allocations and optimize memory usage patterns.

2. Principles

2.1. FIRST

The FIRST principles for benchmark testing focus on creating reliable and meaningful measurements.

Fast

Benchmark setup and teardown should be minimal and excluded from timing to ensure accurate measurement of the function under test.
Independent

Each benchmark should be self-contained and not depend on shared state or results from other benchmarks to ensure isolated performance measurements.
Repeatable

Benchmarks should produce consistent, comparable results across runs and environments by controlling inputs and avoiding non-deterministic operations.
Self-Validating

Benchmarks should optionally validate results to prevent the compiler from optimizing away the code under measurement.
Timely

Benchmarks should be established before optimization work begins to provide a performance baseline and measure the impact of changes.

3. Patterns

3.1. Microbenchmarking

Microbenchmarking is a software testing technique that measures the performance of small, isolated code units to identify performance characteristics and bottlenecks.

3.2. Comparative Benchmarking

Comparative Benchmarking is a testing approach that compares the performance of different implementations or algorithms side-by-side using consistent workloads.

3.3. Memory Profiling

Memory Profiling is the process of measuring memory allocations and usage patterns during benchmark execution using -benchmem flag.

3.4. Statistical Benchmarking

Statistical Benchmarking uses multiple iterations to calculate statistical measures (mean, variance) to ensure reliable and reproducible results.

3.5. Sub-benchmarks

Sub-benchmarks organize related benchmark cases using b.Run() to group variations of the same function with different input scenarios.

3.6. Table-Driven Testing

Table-Driven Testing is a software testing technique in which benchmark cases are organized in a tabular format to systematically cover different input scenarios.

4. Workflow

Identify

Identify performance-critical functions in pkg/ or internal/ that benefit from performance tracking (e.g., pkg/<package>/<file>.go).
Add/Create

Create benchmark tests in the same package (e.g., pkg/<package>/<file>_test.go).
Benchmark Test Coverage Requirements

Focus on functions that:
- Are called frequently in hot paths
- Perform mathematical operations or calculations
- Process data structures or collections
- Have multiple implementation approaches to compare
- Are candidates for optimization
Apply Templates

Structure all benchmark tests using the template pattern.
Baseline Measurements

Establish performance baselines by running benchmarks on stable code before making changes.

5. Commands

| Command | Description | | --------------------------------------------------------------- | -------------------------------------------------- | | make go-test-bench | Execute all benchmarks with memory statistics | | go test -bench=BenchmarkPercent -benchmem ./pkg/percent | Execute a specific benchmark function | | go test -bench=. -benchmem -cpuprofile=cpu.prof ./pkg/percent | Generate CPU profile for performance analysis | | go test -bench=. -benchmem -memprofile=mem.prof ./pkg/percent | Generate memory profile for allocation analysis | | go test -bench=. -benchtime=10s ./pkg/percent | Run benchmarks for a specific duration | | benchstat old.txt new.txt | Compare benchmark results before and after changes |

6. Style Guide

Test Framework

Use the standard Go testing package with testing.B for benchmark tests.
Include Imports

Include testing and any packages needed for the function under test.
Benchmark Function Naming

Name benchmark functions with the Benchmark prefix followed by the function name (e.g., BenchmarkPercent for testing Percent()).
Benchmark Loop

Use b.Loop() to control the number of iterations. The testing framework automatically adjusts the loop iterations to get reliable timing measurements. b.Loop() is preferred over b.N as it provides better integration with the testing framework and more accurate measurements. Unlike b.N-style benchmarks, b.Loop() integrates timer management, it automatically handles b.ResetTimer() at the loop's start and b.StopTimer() at its end, eliminating the need to manually manage the benchmark timer for setup and cleanup code.
Timer Control

When using b.Loop(), timer management is automatic and no manual b.ResetTimer(), b.StopTimer(), or b.StartTimer() calls are needed for typical benchmarks. For advanced scenarios not using b.Loop(), use b.ResetTimer() to exclude setup time from measurements and b.StopTimer()/b.StartTimer() to exclude specific operations.
Sub-benchmarks

Use b.Run() to organize related benchmark cases with different input scenarios. Each sub-benchmark runs independently with its own b.N iterations.
Memory Reporting

Use b.ReportAllocs() to report memory allocations per operation when not using -benchmem flag.
Result Validation

Optionally validate results in benchmarks to prevent compiler optimizations from eliminating dead code.

7. Template

Use this template for new benchmark test functions. Replace placeholders with actual values and adjust as needed for the use case.

7.1. Multi-Scenario Benchmarks

For benchmarking multiple scenarios or input variations, use sub-benchmarks with table-driven approach.

func Benchmark<FunctionName>(b *testing.B) {
	// Define benchmark cases with different scenarios
	benchmarks := []struct {
		name   string
		param1 <type>
		param2 <type>
		// Add more parameters as needed
	}{
		{
			name:   "scenario description 1",
			param1: <value1>,
			param2: <value2>,
		},
		{
			name:   "scenario description 2",
			param1: <value1>,
			param2: <value2>,
		},
		// Add more benchmark cases
	}

	for _, bm := range benchmarks {
		b.Run(bm.name, func(b *testing.B) {
			// Arrange
			// Setup code here (automatically excluded from timing by b.Loop)

			// Act
			for b.Loop() {
				_, _ = <Function>(bm.param1, bm.param2)
			}
		})
	}
}

7.2. Simple Benchmarks

For benchmarking a single scenario, use a simple loop without sub-benchmarks.

func Benchmark<FunctionName>(b *testing.B) {
	// Arrange
	// Setup code here (automatically excluded from timing by b.Loop)
	param1 := <value1>
	param2 := <value2>

	// Act
	for b.Loop() {
		_, _ = <Function>(param1, param2)
	}
}

7.3. Benchmarks with Validation

For benchmarks that need to prevent compiler optimizations, store results in package-level variables.

var (
	benchResult <type>
	benchError  error
)

func Benchmark<FunctionName>(b *testing.B) {
	// Arrange
	// Setup code here (automatically excluded from timing by b.Loop)
	param1 := <value1>
	param2 := <value2>

	// Act
	for b.Loop() {
		benchResult, benchError = <Function>(param1, param2)
	}
}

8. References

Go Benchmarks documentation.
Go testing.B package documentation.
Go benchstat tool documentation.

sentenz/go-benchmark-testing

skills/go-benchmark-testing/SKILL.md

Automates benchmark test creation for Go projects using the standard testing package with consistent software testing patterns. Use when creating performance benchmarks, profiling tests, or when the user mentions benchmarking, performance testing, or optimization.

1 stars

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add sentenz/skills go-benchmark-testing

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 3:37 AM7.7s1 file scanned

SKILL.md

name:: go-benchmark-testing
description:: Automates benchmark test creation for Go projects using the standard testing package with consistent software testing patterns. Use when creating performance benchmarks, profiling tests, or when the user mentions benchmarking, performance testing, or optimization.
version:: 1.0.0
implicit:: true
priority:: 2
languages:: ["go", "golang"]
paths:: ["pkg/**/*_test.go", "internal/**/*_test.go"]
prompt_regex:: (?i)(benchmark|benchmarking|performance test|profiling|microbenchmark|optimization|performance)
load_on_prompt:: true
autodispatch:: true

Benchmark Testing

Instructions for AI coding agents on automating benchmark test creation using consistent software testing patterns in this Go project.

1. Benefits
2. Principles
- 2.1. FIRST
3. Patterns
- 3.1. Microbenchmarking
- 3.2. Comparative Benchmarking
- 3.3. Memory Profiling
- 3.4. Statistical Benchmarking
- 3.5. Sub-benchmarks
- 3.6. Table-Driven Testing
4. Workflow
5. Commands
6. Style Guide
7. Template
- 7.1. Multi-Scenario Benchmarks
- 7.2. Simple Benchmarks
- 7.3. Benchmarks with Validation
8. References

1. Benefits

Performance Measurement

Benchmark tests measure the execution time and memory allocation of functions, providing quantifiable metrics for performance analysis.
Regression Detection

Continuous benchmarking helps identify performance regressions early in the development cycle before they reach production.
Optimization Guidance

Benchmark results guide optimization efforts by identifying bottlenecks and quantifying the impact of performance improvements.
Comparative Analysis

Benchmarks enable comparison of different implementations or algorithms to make informed decisions about performance trade-offs.
Resource Profiling

Memory allocation tracking helps identify unnecessary allocations and optimize memory usage patterns.

2. Principles

2.1. FIRST

The FIRST principles for benchmark testing focus on creating reliable and meaningful measurements.

Fast

Benchmark setup and teardown should be minimal and excluded from timing to ensure accurate measurement of the function under test.
Independent

Each benchmark should be self-contained and not depend on shared state or results from other benchmarks to ensure isolated performance measurements.
Repeatable

Benchmarks should produce consistent, comparable results across runs and environments by controlling inputs and avoiding non-deterministic operations.
Self-Validating

Benchmarks should optionally validate results to prevent the compiler from optimizing away the code under measurement.
Timely

Benchmarks should be established before optimization work begins to provide a performance baseline and measure the impact of changes.

3. Patterns

3.1. Microbenchmarking

Microbenchmarking is a software testing technique that measures the performance of small, isolated code units to identify performance characteristics and bottlenecks.

3.2. Comparative Benchmarking

Comparative Benchmarking is a testing approach that compares the performance of different implementations or algorithms side-by-side using consistent workloads.

3.3. Memory Profiling

Memory Profiling is the process of measuring memory allocations and usage patterns during benchmark execution using -benchmem flag.

3.4. Statistical Benchmarking

Statistical Benchmarking uses multiple iterations to calculate statistical measures (mean, variance) to ensure reliable and reproducible results.

3.5. Sub-benchmarks

Sub-benchmarks organize related benchmark cases using b.Run() to group variations of the same function with different input scenarios.

3.6. Table-Driven Testing

Table-Driven Testing is a software testing technique in which benchmark cases are organized in a tabular format to systematically cover different input scenarios.

4. Workflow

Identify

Identify performance-critical functions in pkg/ or internal/ that benefit from performance tracking (e.g., pkg/<package>/<file>.go).
Add/Create

Create benchmark tests in the same package (e.g., pkg/<package>/<file>_test.go).
Benchmark Test Coverage Requirements

Focus on functions that:
- Are called frequently in hot paths
- Perform mathematical operations or calculations
- Process data structures or collections
- Have multiple implementation approaches to compare
- Are candidates for optimization
Apply Templates

Structure all benchmark tests using the template pattern.
Baseline Measurements

Establish performance baselines by running benchmarks on stable code before making changes.

5. Commands

6. Style Guide

Test Framework

Use the standard Go testing package with testing.B for benchmark tests.
Include Imports

Include testing and any packages needed for the function under test.
Benchmark Function Naming

Name benchmark functions with the Benchmark prefix followed by the function name (e.g., BenchmarkPercent for testing Percent()).
Benchmark Loop

Use b.Loop() to control the number of iterations. The testing framework automatically adjusts the loop iterations to get reliable timing measurements. b.Loop() is preferred over b.N as it provides better integration with the testing framework and more accurate measurements. Unlike b.N-style benchmarks, b.Loop() integrates timer management, it automatically handles b.ResetTimer() at the loop's start and b.StopTimer() at its end, eliminating the need to manually manage the benchmark timer for setup and cleanup code.
Timer Control

When using b.Loop(), timer management is automatic and no manual b.ResetTimer(), b.StopTimer(), or b.StartTimer() calls are needed for typical benchmarks. For advanced scenarios not using b.Loop(), use b.ResetTimer() to exclude setup time from measurements and b.StopTimer()/b.StartTimer() to exclude specific operations.
Sub-benchmarks

Use b.Run() to organize related benchmark cases with different input scenarios. Each sub-benchmark runs independently with its own b.N iterations.
Memory Reporting

Use b.ReportAllocs() to report memory allocations per operation when not using -benchmem flag.
Result Validation

Optionally validate results in benchmarks to prevent compiler optimizations from eliminating dead code.

7. Template

Use this template for new benchmark test functions. Replace placeholders with actual values and adjust as needed for the use case.

7.1. Multi-Scenario Benchmarks

For benchmarking multiple scenarios or input variations, use sub-benchmarks with table-driven approach.

func Benchmark<FunctionName>(b *testing.B) {
	// Define benchmark cases with different scenarios
	benchmarks := []struct {
		name   string
		param1 <type>
		param2 <type>
		// Add more parameters as needed
	}{
		{
			name:   "scenario description 1",
			param1: <value1>,
			param2: <value2>,
		},
		{
			name:   "scenario description 2",
			param1: <value1>,
			param2: <value2>,
		},
		// Add more benchmark cases
	}

	for _, bm := range benchmarks {
		b.Run(bm.name, func(b *testing.B) {
			// Arrange
			// Setup code here (automatically excluded from timing by b.Loop)

			// Act
			for b.Loop() {
				_, _ = <Function>(bm.param1, bm.param2)
			}
		})
	}
}

7.2. Simple Benchmarks

For benchmarking a single scenario, use a simple loop without sub-benchmarks.

func Benchmark<FunctionName>(b *testing.B) {
	// Arrange
	// Setup code here (automatically excluded from timing by b.Loop)
	param1 := <value1>
	param2 := <value2>

	// Act
	for b.Loop() {
		_, _ = <Function>(param1, param2)
	}
}

7.3. Benchmarks with Validation

For benchmarks that need to prevent compiler optimizations, store results in package-level variables.

var (
	benchResult <type>
	benchError  error
)

func Benchmark<FunctionName>(b *testing.B) {
	// Arrange
	// Setup code here (automatically excluded from timing by b.Loop)
	param1 := <value1>
	param2 := <value2>

	// Act
	for b.Loop() {
		benchResult, benchError = <Function>(param1, param2)
	}
}

8. References

Go Benchmarks documentation.
Go testing.B package documentation.
Go benchstat tool documentation.

Related Skills

sentenz/threat-modeling-ics

tools

VerifiedTrustedCommunity

Performs end-to-end threat modeling for OT/ICS systems from Microsoft Threat Modeling Tool (TMT) threat-list exports (`*.csv`) and model files (`*.tm7`). Uses TMT and STRIDE for initial threat enumeration, then enriches each threat with OT/ICS context, MITRE ATT&CK for ICS mappings, MITRE EMB3D device-property threat enrichment for embedded field devices, CWE weakness classification, CVSS v4.0 scoring, Likelihood of Exploit, Risk-based Prioritization via a Risk Matrix, minimum-capable Threat Actor assignment, Risk Treatment decisions, and OT impact categories ranging from Denial of View to Physical Damage to Property.

2SKILL.mdUpdated Apr 16, 2026

sentenz/threat-modeling-ics

sentenz/adr

development

VerifiedTrustedCommunity

Creates and maintains Architecture Decision Records (ADRs) following a structured format with State, Context, Decision, Considered, Consequences, Implementation, and References sections. Supports single-option decisions, multi-option decisions within one decision scope, multiple complementary decisions, and deferred decisions. Use when creating, updating, or reviewing architectural decisions, or when the user mentions ADR, architecture decisions, technical decisions, or design records.

1SKILL.mdUpdated Apr 19, 2026

sentenz/go-unit-testing

development

VerifiedTrustedCommunity

Automates unit test creation for Go projects using the standard testing package with consistent software testing patterns including In-Got-Want, Table-Driven Testing, and AAA patterns. Use when creating, modifying, or reviewing unit tests, or when the user mentions unit tests, test coverage, or Go testing.

1SKILL.mdUpdated Apr 16, 2026

sentenz/go-unit-testing

sentenz/go-fuzz-testing

development

VerifiedTrustedCommunity

Automates fuzz test creation for Go projects using Go's native fuzzing engine with consistent software testing patterns. Use when creating fuzz tests, mutation testing, or when the user mentions fuzzing, coverage-guided testing, or property-based testing.

1SKILL.mdUpdated Apr 16, 2026

sentenz/go-fuzz-testing

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/sentenz/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skills/go-benchmark-testing ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

sentenz/skills

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT