Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

ersilia-os/ersilia-metadata

Name: ersilia-metadata
Author: ersilia-os

skills/ersilia-metadata/SKILL.md

npx skillsauth add ersilia-os/claude-ersilia-skills ersilia-metadata

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Ersilia Model Metadata Filler

Your job is to fill in specific fields of an Ersilia model's metadata.yml file using information extracted from the original publication (PDF) and the original source code repository.

What you receive

The user will provide:

A path to the metadata.yml file (already partially filled from the model request)
A path or URL to the original publication (PDF)
A URL to the original source code repository (GitHub or similar)

What you must fill in

Fill only these fields — leave everything else exactly as it is:

| Field | Type | Accepted values | |-------|------|-----------------| | Deployment | list | Local, Online | | Source | string | Local, Online | | Source Type | string | External, Internal, Replicated | | Task | string | Annotation, Representation, Sampling | | Subtask | string | Property calculation or prediction, Activity prediction, Featurization, Projection, Similarity search, Generation | | Output | list | Score, Value, Compound, Text | | Output Dimension | integer | number of output values per input compound | | Output Consistency | string | Fixed, Variable | | Interpretation | string | free text | | Biomedical Area | list | free text (disease areas, application areas) | | Target Organism | list | scientific names or Any | | Publication Type | string | Peer reviewed, Preprint, Other | | Publication Year | integer | year |

Never modify: Identifier, Slug, Status, Title, Description, Tag, Publication, Source Code, License, Contributor, and any auto-populated fields (Incorporation Date, S3, DockerHub, Model Size, etc.)

Step-by-step workflow

1. Read the existing metadata.yml

Note the Source Code URL and Publication URL — you will need them. Confirm which fields still need filling (they may have template placeholder text like "Biomedical Area 1" or multiple values comma-separated).

2. Read the publication

Use the PDF reading tools to extract:

What the model predicts (property, activity, representation, etc.)
Number of endpoints/outputs (crucial for Output Dimension)
Training dataset and target organisms
Disease or application area
Whether outputs are probabilities, measured values, or generated structures
Year of publication and publication venue (journal vs preprint)

3. Fetch the source code repository

Use WebFetch on the repository URL (and its README, and key code files if needed) to understand:

Number of model endpoints / output columns (cross-check with paper)
Whether the model calls an external API or runs locally
Whether the license/code is from external authors, the Ersilia team, or re-trained

4. Fill in each field

Work through the fields systematically:

Deployment + Source

Deployment: [Local] and Source: Local for the vast majority of models — the model runs in Ersilia's infrastructure.
Use Online only if the model posts predictions to an external third-party server/API that Ersilia does not control.
Deployment is a list; Source is a single string.

Source Type

External: the model was developed by third-party authors (most models incorporated from published papers).
Internal: developed by the Ersilia team themselves.
Replicated: Ersilia re-trained the model following the original authors' methodology.

Task + Subtask Choose Task first, then the corresponding Subtask:

Annotation → model assigns a label or score to a molecule
- Activity prediction if the output is biological/pharmacological activity
- Property calculation or prediction if the output is a physicochemical or ADMET property
Representation → model encodes a molecule into a numerical vector or projection
- Featurization if it produces a fixed-length embedding/descriptor vector
- Projection if it projects molecules into 2D/3D space
Sampling → model generates or retrieves molecules
- Generation if it generates new molecules
- Similarity search if it retrieves similar molecules from a database

Output

Score: a probability or likelihood (0–1 range, binary classification output)
Value: a numerical measurement (IC50, logP, pKa, molecular weight, descriptors, embeddings…)
Compound: a generated or retrieved molecule (SMILES or InChI)
Text: natural language output
Can be a list if the model has mixed output types. Example: a model returning both a probability and an associated value would be [Score, Value].

Output Dimension The number of output values produced per input compound. Only count continuous numeric outputs (scores, values) — do not count binary class labels separately. So a model with 6 endpoints each returning one probability score has Output Dimension 6, not 12.

This is often explicit in the paper ("6 endpoints", "512-dimensional vector"). If not stated directly:

Count the number of prediction tasks/endpoints described
Check the repository: look at model output shapes, column headers in example outputs, or README tables
For embedding models, look for the vector size (e.g., 512, 1024, 2048)
If the vector size is configurable (e.g., n_components is a user parameter), ask the user — do not guess a default from examples in the docs

Output Consistency

Fixed: the model always returns the same output for the same input (most QSAR models, classifiers, regression models)
Variable: the model is stochastic and may return different outputs on repeated runs (generative models, models with dropout at inference, sampling-based methods)

Interpretation Write one short sentence describing what the output means and how to read it. Keep it under ~20 words.

Good examples:

Higher score indicates greater predicted probability of anti-malarial activity.
100 features encoding molecular structure from a pretrained MACAW autoencoder.
Predicted probability of AMES mutagenicity; values closer to 1 indicate higher risk.

Biomedical Area List the relevant therapeutic or research areas. Use specific disease names or application areas rather than generic terms. Examples: Malaria, Tuberculosis, ADMET, COVID-19, Solubility, Toxicity. Use Any only if the model is truly domain-agnostic (e.g., a general-purpose molecular featurizer with no disease focus).

Target Organism Use full scientific names where applicable:

Pathogens: Plasmodium falciparum, Mycobacterium tuberculosis, SARS-CoV-2, etc.
Human studies: Homo sapiens
Animal models: Mus musculus, Rattus norvegicus
Use Any if the model is not organism-specific

Publication Type

Check the Publication URL in the metadata: journal DOIs → Peer reviewed; bioRxiv/ChemRxiv/arXiv links → Preprint
Use Other only in exceptional cases (thesis, technical report)

Publication Year Extract the year of publication from the paper or publication URL.

5. Write the updated metadata.yml

Edit the file in place, replacing only the fields listed above. Keep the YAML formatting consistent with the rest of the file (use list syntax for list fields, plain string for string fields, integer for integer fields). Do not add quotes unless the original file uses them for that field.

6. Show the user what you changed

Print a brief summary of the fields you filled in and the values chosen. If any field required a judgment call or the evidence was ambiguous, say so clearly and invite the user to verify.

When you cannot determine a value

If you've read the paper and the repository and still cannot confidently determine a value (especially Output Dimension), state clearly what you found and what is unclear, and ask the user to clarify. Do not guess or leave placeholder text.

ersilia-os/ersilia-metadata

skills/ersilia-metadata/SKILL.md

Fills in the required metadata fields for an Ersilia Model Hub model, given the original publication as a PDF and the link to the original code repository. Use this skill whenever a user wants to populate, complete, or update a metadata.yml file for an Ersilia model, mentions an Ersilia model contribution, or is working on model metadata for the Ersilia Model Hub. Trigger even if the user just says "fill in the metadata" or "help me with the metadata.yml" in any Ersilia context.

2 stars

development

Updated May 26, 2026

$ install --global

skillsauth

npx skillsauth add ersilia-os/claude-ersilia-skills ersilia-metadata

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 26, 2026, 3:22 AM163.4s2 files scanned

SKILL.md

name:: ersilia-metadata
description:: Fills in the required metadata fields for an Ersilia Model Hub model, given the original publication as a PDF and the link to the original code repository. Use this skill whenever a user wants to populate, complete, or update a metadata.yml file for an Ersilia model, mentions an Ersilia model contribution, or is working on model metadata for the Ersilia Model Hub. Trigger even if the user just says "fill in the metadata" or "help me with the metadata.yml" in any Ersilia context.

Ersilia Model Metadata Filler

Your job is to fill in specific fields of an Ersilia model's metadata.yml file using information extracted from the original publication (PDF) and the original source code repository.

What you receive

The user will provide:

A path to the metadata.yml file (already partially filled from the model request)
A path or URL to the original publication (PDF)
A URL to the original source code repository (GitHub or similar)

What you must fill in

Fill only these fields — leave everything else exactly as it is:

Step-by-step workflow

1. Read the existing metadata.yml

2. Read the publication

Use the PDF reading tools to extract:

What the model predicts (property, activity, representation, etc.)
Number of endpoints/outputs (crucial for Output Dimension)
Training dataset and target organisms
Disease or application area
Whether outputs are probabilities, measured values, or generated structures
Year of publication and publication venue (journal vs preprint)

3. Fetch the source code repository

Use WebFetch on the repository URL (and its README, and key code files if needed) to understand:

Number of model endpoints / output columns (cross-check with paper)
Whether the model calls an external API or runs locally
Whether the license/code is from external authors, the Ersilia team, or re-trained

4. Fill in each field

Work through the fields systematically:

Deployment + Source

Deployment: [Local] and Source: Local for the vast majority of models — the model runs in Ersilia's infrastructure.
Use Online only if the model posts predictions to an external third-party server/API that Ersilia does not control.
Deployment is a list; Source is a single string.

Source Type

External: the model was developed by third-party authors (most models incorporated from published papers).
Internal: developed by the Ersilia team themselves.
Replicated: Ersilia re-trained the model following the original authors' methodology.

Task + Subtask Choose Task first, then the corresponding Subtask:

Annotation → model assigns a label or score to a molecule
- Activity prediction if the output is biological/pharmacological activity
- Property calculation or prediction if the output is a physicochemical or ADMET property
Representation → model encodes a molecule into a numerical vector or projection
- Featurization if it produces a fixed-length embedding/descriptor vector
- Projection if it projects molecules into 2D/3D space
Sampling → model generates or retrieves molecules
- Generation if it generates new molecules
- Similarity search if it retrieves similar molecules from a database

Output

Score: a probability or likelihood (0–1 range, binary classification output)
Value: a numerical measurement (IC50, logP, pKa, molecular weight, descriptors, embeddings…)
Compound: a generated or retrieved molecule (SMILES or InChI)
Text: natural language output
Can be a list if the model has mixed output types. Example: a model returning both a probability and an associated value would be [Score, Value].

This is often explicit in the paper ("6 endpoints", "512-dimensional vector"). If not stated directly:

Count the number of prediction tasks/endpoints described
Check the repository: look at model output shapes, column headers in example outputs, or README tables
For embedding models, look for the vector size (e.g., 512, 1024, 2048)
If the vector size is configurable (e.g., n_components is a user parameter), ask the user — do not guess a default from examples in the docs

Output Consistency

Fixed: the model always returns the same output for the same input (most QSAR models, classifiers, regression models)
Variable: the model is stochastic and may return different outputs on repeated runs (generative models, models with dropout at inference, sampling-based methods)

Interpretation Write one short sentence describing what the output means and how to read it. Keep it under ~20 words.

Good examples:

Higher score indicates greater predicted probability of anti-malarial activity.
100 features encoding molecular structure from a pretrained MACAW autoencoder.
Predicted probability of AMES mutagenicity; values closer to 1 indicate higher risk.

Target Organism Use full scientific names where applicable:

Pathogens: Plasmodium falciparum, Mycobacterium tuberculosis, SARS-CoV-2, etc.
Human studies: Homo sapiens
Animal models: Mus musculus, Rattus norvegicus
Use Any if the model is not organism-specific

Publication Type

Check the Publication URL in the metadata: journal DOIs → Peer reviewed; bioRxiv/ChemRxiv/arXiv links → Preprint
Use Other only in exceptional cases (thesis, technical report)

Publication Year Extract the year of publication from the paper or publication URL.

5. Write the updated metadata.yml

6. Show the user what you changed

Print a brief summary of the fields you filled in and the values chosen. If any field required a judgment call or the evidence was ambiguous, say so clearly and invite the user to verify.

When you cannot determine a value

Related Skills

ersilia-os/literature-digest

testing

VerifiedTrustedCommunity

Produce the weekly Ersilia literature digest covering AI/ML for drug discovery, antibiotic and antimicrobial discovery, NTDs and AMR, and open science for global health — through an explicit LMIC and decolonisation lens. Use this skill whenever the user asks to prepare, run, or refresh the literature digest. Triggers include: "weekly literature digest", "literature digest for Ersilia", "/literature-digest", "lit digest this week", "what did we miss last week", "digest the literature". Always use this skill for digest requests even if the ask seems simple.

2SKILL.mdUpdated May 29, 2026

ersilia-os/literature-digest

ersilia-os/test-skill

testing

VerifiedTrustedCommunity

A minimal test skill to verify that the ersilia-skills repository and local setup (symlinks, git hook) are working correctly. Use this skill to confirm that skill loading, slash commands, and the setup.sh workflow are functioning as expected. Trigger on phrases like "run test skill", "check skill setup", or "verify ersilia skills".

2SKILL.mdUpdated May 26, 2026

ersilia-os/test-skill

ersilia-os/stylia-plotting

development

VerifiedTrustedCommunity

How to create Python plots using the stylia package — Ersilia's matplotlib wrapper for publication-ready figures. ALWAYS use this skill when the user says anything like "make a plot", "plot this", "plot the results", "visualize", "prepare a plotting function", "show me a chart", "can you plot", "add a figure", or any similar phrasing during a coding session. This includes scatter plots, line plots, bar charts, heatmaps, histograms, ROC curves, and any other chart type. Also trigger on requests to visualize data, compare values, show distributions, or create any kind of figure — even if the user does not mention stylia or matplotlib explicitly. Never generate matplotlib figures without stylia — always use stylia.create_figure() instead of plt.figure() or plt.subplots().

2SKILL.mdUpdated May 26, 2026

ersilia-os/stylia-plotting

ersilia-os/linkedin-posts

documentation

VerifiedTrustedCommunity

Create LinkedIn post drafts and end-of-month newsletter content for Ersilia Open Source Initiative. Use this skill whenever the user asks to plan LinkedIn posts, draft a monthly content schedule, write a weekly post, or create the monthly newsletter digest. Triggers include: "start of month", "end of month", "write a LinkedIn post", "prepare this month's posts", "draft the newsletter", "monthly update", "weekly post", or any request to create content for Ersilia's LinkedIn or newsletter. Also triggers when the user uploads a content calendar (PDF or text) and asks for posts for a given month. Always use this skill for any Ersilia content creation request, even if the ask seems simple.

2SKILL.mdUpdated May 26, 2026

ersilia-os/linkedin-posts

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/ersilia-os/claude-ersilia-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-ersilia-skills/skills/ersilia-metadata ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

ersilia-os/claude-ersilia-skills

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT