Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

a5c-ai/assessment-item-development

Name: assessment-item-development
Author: a5c-ai

library/specializations/domains/social-sciences-humanities/education/skills/assessment-item-development/SKILL.md

npx skillsauth add a5c-ai/babysitter assessment-item-development

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Assessment Item Development

Create valid, reliable assessment items across formats including multiple choice, constructed response, and performance tasks following psychometric best practices.

Overview

This skill enables the development of high-quality assessment items that accurately measure learning outcomes. It encompasses item writing across formats, alignment with objectives, and application of psychometric principles to create valid and reliable assessments.

Capabilities

Multiple Choice Items

Write clear, unambiguous stems
Develop plausible distractors
Avoid item-writing flaws
Address various cognitive levels
Apply item analysis principles

Constructed Response

Design short-answer items
Create essay prompts
Develop case-based questions
Write open-ended problems
Create scoring guidelines

Performance Tasks

Design authentic tasks
Develop task specifications
Create rubrics and scoring guides
Plan administration conditions
Document task requirements

Quality Assurance

Review for bias and sensitivity
Verify content alignment
Apply item statistics
Conduct item review
Document item metadata

Usage Guidelines

Item Development Process

Review learning objectives
Select appropriate item format
Draft items following guidelines
Review and revise items
Pilot test when possible
Analyze and refine

Multiple Choice Guidelines

Single correct answer
Parallel answer choices
Avoid "all of the above"
Place correct answer randomly
Keep options similar length

Constructed Response Guidelines

Clear task requirements
Specific scoring criteria
Appropriate scope
Sufficient context
Model responses available

Integration Points

Related Processes

Formative Assessment Design
Summative Assessment Development
Item Writing and Test Development

Collaborating Skills

learning-objectives-writing
rubric-design-validation
learning-analytics-interpretation

References

Item writing guidelines (Haladyna)
ETS item development standards
Psychometric principles
Assessment best practices

a5c-ai/assessment-item-development

library/specializations/domains/social-sciences-humanities/education/skills/assessment-item-development/SKILL.md

Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices

514 stars

development

Updated Apr 2, 2026

$ install --global

skillsauth

npx skillsauth add a5c-ai/babysitter assessment-item-development

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 2, 2026, 9:42 AM58.1s1 file scanned

SKILL.md

name:: assessment-item-development
description:: Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices
allowed-tools:: Read, Grep, Write, Edit, Glob

Assessment Item Development

Create valid, reliable assessment items across formats including multiple choice, constructed response, and performance tasks following psychometric best practices.

Overview

Capabilities

Multiple Choice Items

Write clear, unambiguous stems
Develop plausible distractors
Avoid item-writing flaws
Address various cognitive levels
Apply item analysis principles

Constructed Response

Design short-answer items
Create essay prompts
Develop case-based questions
Write open-ended problems
Create scoring guidelines

Performance Tasks

Design authentic tasks
Develop task specifications
Create rubrics and scoring guides
Plan administration conditions
Document task requirements

Quality Assurance

Review for bias and sensitivity
Verify content alignment
Apply item statistics
Conduct item review
Document item metadata

Usage Guidelines

Item Development Process

Review learning objectives
Select appropriate item format
Draft items following guidelines
Review and revise items
Pilot test when possible
Analyze and refine

Multiple Choice Guidelines

Single correct answer
Parallel answer choices
Avoid "all of the above"
Place correct answer randomly
Keep options similar length

Constructed Response Guidelines

Clear task requirements
Specific scoring criteria
Appropriate scope
Sufficient context
Model responses available

Integration Points

Related Processes

Formative Assessment Design
Summative Assessment Development
Item Writing and Test Development

Collaborating Skills

learning-objectives-writing
rubric-design-validation
learning-analytics-interpretation

References

Item writing guidelines (Haladyna)
ETS item development standards
Psychometric principles
Assessment best practices

Related Skills

a5c-ai/model-card-generator

development

VerifiedTrustedCommunity

Model documentation skill for generating model cards following Google's model card framework.

680SKILL.mdUpdated Apr 28, 2026

a5c-ai/model-card-generator

a5c-ai/mlflow-experiment-tracker

development

VerifiedTrustedCommunity

MLflow integration skill for experiment tracking, model registry, and artifact management. Enables LLMs to log experiments, compare runs, manage model lifecycle, and retrieve artifacts through the MLflow API.

680SKILL.mdUpdated Apr 28, 2026

a5c-ai/mlflow-experiment-tracker

a5c-ai/lime-explainer

data-ai

VerifiedTrustedCommunity

LIME-based local explanation skill for individual predictions across tabular, text, and image data.

680SKILL.mdUpdated Apr 28, 2026

a5c-ai/lime-explainer

a5c-ai/kubeflow-pipeline-executor

devops

VerifiedTrustedCommunity

Kubeflow Pipelines skill for ML workflow orchestration, component management, and Kubernetes-native ML.

680SKILL.mdUpdated Apr 28, 2026

a5c-ai/kubeflow-pipeline-executor

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/a5c-ai/babysitter.git

# Copy into Claude Code skills folder (global)
cp -r babysitter/library/specializations/domains/social-sciences-humanities/education/skills/assessment-item-development ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

a5c-ai/babysitter

514 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT