.cursor/skills/examining-discovery-spaces/SKILL.md
End-to-end workflow to examine and summarise an ado discoveryspace — fetch space YAML, describe entity and measurement space structure, assess sampling coverage, export measurement data, and find related resources. Use when the user asks to inspect, summarise, debug, or analyse a discoveryspace; wants to understand dimensions, experiments, or sampling coverage; provides a space ID or asks to use --use-latest for the current space.
npx skillsauth add ibm/ado examining-discovery-spacesInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Structured workflow for understanding what a discoveryspace contains, how covered its entity space is, and what data has been collected.
uv run.reports/<ado_context_name>/ (create the
directory if needed)
ado_context_name is the
active ado metastore context (uv run ado context)<SPACEID>_<YYYY-MM-DD>_report.mdRelated skills:
When working with the data from a discoveryspace the following distinctions are important.
The samplestore used by a discovery space is shared. This means there may be relevant measurement data in the samplestore for entities in the space but that measurement has not been performed by an operation on the space (it was performed on another).
Why is it useful to work with matching data?
To apply this skill you need either:
(a) a space id; (b) explicit instruction to examine “the latest” space
In the case of (b) get the actual identifier:
uv run ado show related space --use-latest
ado get-o yaml flag outputs YAML to console. It's often useful to redirect
this to a temporary file and work with that to avoid multiple ado get calls
for same YAML.
The output produced by '-o/--output' can be very large e.g. from "show entities".
Use the --output-file flag with the name of the file where to save the output
and, when inspecting these files:
Run Step 2 and 3 first. Then steps 4,5 and 6 can be run in parallel.
uv run ado get space SPACE_ID -o yaml
Extract and summarise:
Execute
uv run ado show details space SPACE_ID
This outputs two sections:
DETAILS — sampling coverage:
Compare measured vs total to understand exploration progress. Compare measured vs matching to understand memoization opportunities. Also, a signal that other overlapping spaces exist.
RELATED RESOURCES — all operations and stores linked to this space.
Performance note:
ado show details spaceis slow as it fetches and aggregates entity data. Use only when sampling coverage is needed.
reports/<ado_context_name>/ado get space --matching-space-id SPACE_ID --details finds spaces with the
same entity structure. Use this to understand research progression and why this
space was created.
uv run ado get space --matching-space-id SPACE_ID --details
Note: Keep in mind the guidelines on large output files for the following.
uv run ado show entities space SPACE_ID \
--include measured \
--property-format target \
-o csv --output-file SPACE_ID_entities.csv
This writes the data to SPACE_ID_entities.csv. If you find SPACE_ID_entities.csv
already exists do not use it, as data may be stale
You can also get lists of all unmeasured or missing entities, though this is not typically required unless you want to analyse the unsampled portion.
Perform an analysis of the measurements, checking e.g. distributions of metrics, metric outliers, correlations between metrics. Take into account the domain of the experiment and meaning of metrics when looking for patterns.
For each related operation (output in step 2), use the examining-ado-operations skill to understand what each operation did and what it produced.
Note: Do not analyze the data in the operations, or do detailed diagnoses. Just enough for summary.
Structure the report as:
development
Builds a picture of work in an ado project: activity volume, spaces and operations created over time, experiments and operation configs used etc. Use to create a project/context overview report, summarize what the team has been doing in an ado project, report trends across spaces/operations, or to onboard onto an ado project.
tools
Guidelines for using ado CLI commands and documenting them correctly. Use when writing documentation that includes ado commands, verifying CLI syntax, or explaining ado CLI usage patterns to users.
testing
Guidance for creating ado resource YAML files (discoveryspace, operation, actuatorconfiguration, samplestore). Covers metadata conventions, dynamic reference resolution with --use-latest/--with/--set, space design principles, avoiding duplicate resources, and validation. Use when creating or editing any ado resource YAML file.
tools
Run ado operations on remote Ray clusters using --remote execution context files. Use when the user wants to create an operation, asks about remote clusters, wants to ship local plugins or data files to a cluster, or asks about execution context YAML files. Also applies proactively when creating an operation if execution context files are present in the workspace.