Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hotdata-dev/hotdata-search

Name: hotdata-search
Author: hotdata-dev

skills/hotdata-search/SKILL.md

npx skillsauth add hotdata-dev/hotdata-cli hotdata-search

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Hotdata Search Skill

Retrieval workloads in Hotdata: BM25 full-text, vector similarity, and the indexes and embedding providers that power them.

Prerequisites: Authenticate and select a workspace (see the hotdata skill). Use fully qualified table names: <connection>.<schema>.<table>.

Related skills: hotdata-analytics (OLAP SQL, query history, materialized chains), hotdata-geospatial (PostGIS-style functions).

Search CLI

Both run server-side. --type and --column are optional when the table has exactly one search index — they are inferred automatically. Specify them when multiple indexes exist.

# BM25 (requires a BM25 index on the column)
hotdata search "<query>" --table <connection.schema.table> [--type bm25] [--column <column>] \
  [--select <columns>] [--limit <n>] [--workspace-id <workspace_id>] [--output table|json|csv]

# Vector (requires a vector index; server auto-embeds the query text)
hotdata search "<query>" --table <connection.schema.table> [--type vector] [--column <source_text_column>] \
  [--select <columns>] [--limit <n>] [--workspace-id <workspace_id>] [--output table|json|csv]

| Type | Behavior | |------|----------| | bm25 | Server generates bm25_search(table, col, 'text'). Results sort by score (descending). | | vector | Pass plain-text query; name the source text column (e.g. title). Server embeds using the same provider/metric/dimensions as the index. SQL uses vector_distance(col, 'text'). Results sort by distance (ascending). |

Inference: when --type or --column are omitted, the CLI fetches the table's indexes and selects the only BM25/vector index. If multiple exist, you must specify both flags.
No vector index, or custom embedding model? Use raw SQL via hotdata query (e.g. cosine_distance(col, [<vec>])). The removed --model / stdin-vector paths hardcoded l2_distance and are not supported.
Before search: create the right index (indexes create --type bm25 or --type vector). See references/INDEXES.md.
Default --limit is 10.

Indexes (BM25 and vector)

Indexes attach to a connection table (--connection-id + --schema + --table) or a dataset (--dataset-id). Scopes are mutually exclusive for create/delete.

# List — workspace scan on connection tables (filter with -c / --schema / --table)
hotdata indexes list [--connection-id <id>] [--schema <schema>] [--table <table>] [--workspace-id <ws>] [--output table|json|yaml]
hotdata indexes list --dataset-id <dataset_id> [--workspace-id <ws>] [--output table|json|yaml]

# Managed database (catalog alias — uses the active database when the catalog matches)
hotdata indexes create --catalog <alias> --schema <schema> --table <table> \
  --column <col> --type bm25|vector \
  [--name <name>] [--metric l2|cosine|dot] [--async] \
  [--embedding-provider-id <id>] [--dimensions <n>] [--output-column <name>] [--description <text>]

# Connection table (raw connection ID)
hotdata indexes create --connection-id <id> --schema <schema> --table <table> \
  --column <col> --type bm25|vector [--name <name>] ...
hotdata indexes delete --connection-id <id> --schema <schema> --table <table> --name <name>

# Dataset
hotdata indexes create --dataset-id <dataset_id> --column <col> --type bm25|vector [--name <name>] ...
hotdata indexes delete --dataset-id <dataset_id> --name <name>

--type is required on create: bm25 (one text column) or vector (exactly one column; often embeddings or auto-embedded text).
sorted indexes (range/equality for OLAP filters) are documented in hotdata-analytics — this skill focuses on retrieval types.
--async: poll with hotdata jobs <job_id> (see hotdata skill Jobs).
Auto-embedding: --type vector on a text column generates embeddings server-side. Optional --embedding-provider-id; default output column {column}_embedding (override with --output-column).

Full workflow (gather workload → compare existing → create → verify): references/INDEXES.md.

Embedding providers

hotdata embedding-providers list [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers get <id> [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers create --name <name> --provider-type service|local \
  [--config '<json>'] [--provider-api-key <key> | --secret-name <name>] [--workspace-id <workspace_id>]
hotdata embedding-providers update <id> [--name <name>] [--config '<json>'] [--provider-api-key <key> | --secret-name <name>] [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers delete <id> [--workspace-id <workspace_id>]

System providers (e.g. sys_emb_openai) are pre-configured; use list for IDs to pass to --embedding-provider-id.
--provider-api-key is the embedding service key (not Hotdata --api-key). --secret-name references an existing secret.

Quick workflow

hotdata tables list --connection-id <id> — confirm column types.
hotdata indexes list — avoid duplicate indexes.
hotdata indexes create --catalog <alias> --table <table> --column <col> --type bm25|vector (add --async if large).
hotdata search "..." --table <catalog.table> — --type and --column are inferred when there is one search index.
Record what exists in context:DATAMODEL (core skill) when the workspace should remember index choices.

hotdata-dev/hotdata-search

skills/hotdata-search/SKILL.md

Use this skill when the user wants full-text search, BM25 keyword search, vector similarity search, semantic search, embeddings, or retrieval indexes in Hotdata. Activate for "hotdata search", "BM25", "full-text", "vector search", "semantic search", "similarity", "embedding", "embedding provider", "create an index" (bm25 or vector), "list indexes" for search, or SQL using bm25_search or vector_distance. Do not load for general SQL analytics (aggregations, GROUP BY) or geospatial work — use hotdata-analytics or hotdata-geospatial instead. Requires the core hotdata skill for auth and workspace basics.

2 stars

data-ai

Updated Jun 5, 2026

$ install --global

skillsauth

npx skillsauth add hotdata-dev/hotdata-cli hotdata-search

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 5, 2026, 2:35 AM30.4s2 files scanned

SKILL.md

name:: hotdata-search
description:: Use this skill when the user wants full-text search, BM25 keyword search, vector similarity search, semantic search, embeddings, or retrieval indexes in Hotdata. Activate for "hotdata search", "BM25", "full-text", "vector search", "semantic search", "similarity", "embedding", "embedding provider", "create an index" (bm25 or vector), "list indexes" for search, or SQL using bm25_search or vector_distance. Do not load for general SQL analytics (aggregations, GROUP BY) or geospatial work — use hotdata-analytics or hotdata-geospatial instead. Requires the core hotdata skill for auth and workspace basics.
version:: 0.4.0

Hotdata Search Skill

Retrieval workloads in Hotdata: BM25 full-text, vector similarity, and the indexes and embedding providers that power them.

Prerequisites: Authenticate and select a workspace (see the hotdata skill). Use fully qualified table names: <connection>.<schema>.<table>.

Related skills: hotdata-analytics (OLAP SQL, query history, materialized chains), hotdata-geospatial (PostGIS-style functions).

Search CLI

Both run server-side. --type and --column are optional when the table has exactly one search index — they are inferred automatically. Specify them when multiple indexes exist.

# BM25 (requires a BM25 index on the column)
hotdata search "<query>" --table <connection.schema.table> [--type bm25] [--column <column>] \
  [--select <columns>] [--limit <n>] [--workspace-id <workspace_id>] [--output table|json|csv]

# Vector (requires a vector index; server auto-embeds the query text)
hotdata search "<query>" --table <connection.schema.table> [--type vector] [--column <source_text_column>] \
  [--select <columns>] [--limit <n>] [--workspace-id <workspace_id>] [--output table|json|csv]

Inference: when --type or --column are omitted, the CLI fetches the table's indexes and selects the only BM25/vector index. If multiple exist, you must specify both flags.
No vector index, or custom embedding model? Use raw SQL via hotdata query (e.g. cosine_distance(col, [<vec>])). The removed --model / stdin-vector paths hardcoded l2_distance and are not supported.
Before search: create the right index (indexes create --type bm25 or --type vector). See references/INDEXES.md.
Default --limit is 10.

Indexes (BM25 and vector)

Indexes attach to a connection table (--connection-id + --schema + --table) or a dataset (--dataset-id). Scopes are mutually exclusive for create/delete.

# List — workspace scan on connection tables (filter with -c / --schema / --table)
hotdata indexes list [--connection-id <id>] [--schema <schema>] [--table <table>] [--workspace-id <ws>] [--output table|json|yaml]
hotdata indexes list --dataset-id <dataset_id> [--workspace-id <ws>] [--output table|json|yaml]

# Managed database (catalog alias — uses the active database when the catalog matches)
hotdata indexes create --catalog <alias> --schema <schema> --table <table> \
  --column <col> --type bm25|vector \
  [--name <name>] [--metric l2|cosine|dot] [--async] \
  [--embedding-provider-id <id>] [--dimensions <n>] [--output-column <name>] [--description <text>]

# Connection table (raw connection ID)
hotdata indexes create --connection-id <id> --schema <schema> --table <table> \
  --column <col> --type bm25|vector [--name <name>] ...
hotdata indexes delete --connection-id <id> --schema <schema> --table <table> --name <name>

# Dataset
hotdata indexes create --dataset-id <dataset_id> --column <col> --type bm25|vector [--name <name>] ...
hotdata indexes delete --dataset-id <dataset_id> --name <name>

--type is required on create: bm25 (one text column) or vector (exactly one column; often embeddings or auto-embedded text).
sorted indexes (range/equality for OLAP filters) are documented in hotdata-analytics — this skill focuses on retrieval types.
--async: poll with hotdata jobs <job_id> (see hotdata skill Jobs).
Auto-embedding: --type vector on a text column generates embeddings server-side. Optional --embedding-provider-id; default output column {column}_embedding (override with --output-column).

Full workflow (gather workload → compare existing → create → verify): references/INDEXES.md.

Embedding providers

hotdata embedding-providers list [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers get <id> [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers create --name <name> --provider-type service|local \
  [--config '<json>'] [--provider-api-key <key> | --secret-name <name>] [--workspace-id <workspace_id>]
hotdata embedding-providers update <id> [--name <name>] [--config '<json>'] [--provider-api-key <key> | --secret-name <name>] [--workspace-id <workspace_id>] [--output table|json|yaml]
hotdata embedding-providers delete <id> [--workspace-id <workspace_id>]

System providers (e.g. sys_emb_openai) are pre-configured; use list for IDs to pass to --embedding-provider-id.
--provider-api-key is the embedding service key (not Hotdata --api-key). --secret-name references an existing secret.

Quick workflow

hotdata tables list --connection-id <id> — confirm column types.
hotdata indexes list — avoid duplicate indexes.
hotdata indexes create --catalog <alias> --table <table> --column <col> --type bm25|vector (add --async if large).
hotdata search "..." --table <catalog.table> — --type and --column are inferred when there is one search index.
Record what exists in context:DATAMODEL (core skill) when the workspace should remember index choices.

Related Skills

hotdata-dev/hotdata-analytics

data-ai

VerifiedTrustedCommunity

Use this skill when the user wants OLAP-style SQL analytics in Hotdata — aggregations, GROUP BY, JOINs, reporting, exploratory queries, query run history, stored results, or materialized follow-up tables (Chain via datasets or managed databases). Activate for "analyze", "aggregate", "rollup", "pivot", "report", "metrics", "GROUP BY", "query history", "past queries", "query runs", "stored results", "materialize", "chain", "intermediate table", or sorted indexes for filters/range scans. Do not load for BM25/vector search or geospatial SQL — use hotdata-search or hotdata-geospatial. Requires the core hotdata skill for connections, tables, datasets, and auth.

2SKILL.mdUpdated May 20, 2026

hotdata-dev/hotdata-analytics

hotdata-dev/hotdata-geospatial

development

VerifiedTrustedCommunity

Use this skill only when the user is working with geospatial data in Hotdata (PostGIS-style SQL like ST_* functions, geometry/WKB, bbox filtering, point-in-polygon, distance/area, lat/lon, spatial joins, “geospatial”, “GIS”, “PostGIS”). Do not load this skill for non-geospatial SQL or general Hotdata usage.

2SKILL.mdUpdated Apr 30, 2026

hotdata-dev/hotdata-geospatial

hotdata-dev/hotdata

tools

VerifiedTrustedCommunity

Use this skill when the user wants to run core hotdata CLI commands — auth, workspaces, connections, managed databases, datasets, tables, basic SQL query, database context (context:DATAMODEL), jobs, and skill install. Activate for "run hotdata", "list workspaces", "list connections", "create a connection", "list databases", "managed database", "load parquet", "list tables", "list datasets", "create a dataset", "execute a query", "database context", "context:DATAMODEL", or general Hotdata CLI usage. For full-text/vector search and retrieval indexes use hotdata-search; for OLAP analytics, query history, stored results, and Chain materializations use hotdata-analytics; for geospatial/GIS use hotdata-geospatial.

2SKILL.mdUpdated Apr 18, 2026

openclaw/taskflow-inbox-triage

data-ai

VerifiedTrustedCommunity

Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/taskflow-inbox-triage

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hotdata-dev/hotdata-cli.git

# Copy into Claude Code skills folder (global)
cp -r hotdata-cli/skills/hotdata-search ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hotdata-dev/hotdata-cli

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT