skills/codex/databricks-genie/SKILL.md
<!-- AUTO-GENERATED by export-skills.py — DO NOT EDIT --> --- name: databricks-genie --- # Databricks Genie Create and query Databricks Genie Spaces - natural language interfaces for SQL-based data exploration. ## Overview Genie Spaces allow users to ask natural language questions about structured data in Unity Catalog. The system translates questions into SQL queries, executes them on a SQL warehouse, and presents results conversationally. ## When to Use This Skill Use this skill when: -
npx skillsauth add frank-luongt/faos-skills-marketplace skills/codex/databricks-genieInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Create and query Databricks Genie Spaces - natural language interfaces for SQL-based data exploration.
Genie Spaces allow users to ask natural language questions about structured data in Unity Catalog. The system translates questions into SQL queries, executes them on a SQL warehouse, and presents results conversationally.
Use this skill when:
| Tool | Purpose |
| ------------------------ | --------------------------------------- |
| list_genie | List all Genie Spaces accessible to you |
| create_or_update_genie | Create or update a Genie Space |
| get_genie | Get Genie Space details |
| delete_genie | Delete a Genie Space |
| Tool | Purpose |
| -------------------- | -------------------------------------------------- |
| ask_genie | Ask a question to a Genie Space, get SQL + results |
| ask_genie_followup | Ask follow-up question in existing conversation |
| Tool | Purpose |
| ------------------- | --------------------------------------------- |
| get_table_details | Inspect table schemas before creating a space |
| execute_sql | Test SQL queries directly |
Before creating a Genie Space, understand your data:
get_table_details(
catalog="my_catalog",
schema="sales",
table_stat_level="SIMPLE"
)
create_or_update_genie(
display_name="Sales Analytics",
table_identifiers=[
"my_catalog.sales.customers",
"my_catalog.sales.orders"
],
description="Explore sales data with natural language",
sample_questions=[
"What were total sales last month?",
"Who are our top 10 customers?"
]
)
ask_genie(
space_id="your_space_id",
question="What were total sales last month?"
)
# Returns: SQL, columns, data, row_count
1. Inspect tables → get_table_details
2. Create space → create_or_update_genie
3. Query space → ask_genie (or test in Databricks UI)
4. Curate (optional) → Use Databricks UI to add instructions
Before creating a Genie Space:
Use these skills in sequence:
synthetic-data-generation - Generate raw parquet filesspark-declarative-pipelines - Create bronze/silver/gold tables| Issue | Solution |
| -------------------------- | ------------------------------------------------------------------------ |
| No warehouse available | Create a SQL warehouse or provide warehouse_id explicitly |
| Poor query generation | Add instructions and sample questions that reference actual column names |
| Slow queries | Ensure warehouse is running; use OPTIMIZE on tables |
development
<!-- AUTO-GENERATED by export-skills.py — DO NOT EDIT --> --- name: databricks-mlflow-evaluation --- # MLflow 3 GenAI Evaluation ## Before Writing Any Code 1. **Read GOTCHAS.md** - 15+ common mistakes that cause failures 2. **Read CRITICAL-interfaces.md** - Exact API signatures and data schemas ## End-to-End Workflows Follow these workflows based on your goal. Each step indicates which reference files to read. ### Workflow 1: First-Time Evaluation Setup For users new to MLflow GenAI evalu
development
<!-- AUTO-GENERATED by export-skills.py — DO NOT EDIT --> --- name: databricks-lakebase-provisioned --- # Lakebase Provisioned Patterns and best practices for using Lakebase Provisioned (Databricks managed PostgreSQL) for OLTP workloads. ## When to Use Use this skill when: - Building applications that need a PostgreSQL database for transactional workloads - Adding persistent state to Databricks Apps - Implementing reverse ETL from Delta Lake to an operational database - Storing chat/agent m
tools
<!-- AUTO-GENERATED by export-skills.py — DO NOT EDIT --> --- name: databricks-jobs --- # Databricks Lakeflow Jobs ## Overview Databricks Jobs orchestrate data workflows with multi-task DAGs, flexible triggers, and comprehensive monitoring. Jobs support diverse task types and can be managed via Python SDK, CLI, or Asset Bundles. ## Reference Files | Use Case | Reference File | | ----------------------
tools
<!-- AUTO-GENERATED by export-skills.py — DO NOT EDIT --> --- name: databricks-docs --- # Databricks Documentation Reference This skill provides access to the complete Databricks documentation index via llms.txt - use it as a **reference resource** to supplement other skills and inform your use of MCP tools. ## Role of This Skill This is a **reference skill**, not an action skill. Use it to: - Look up documentation when other skills don't cover a topic - Get authoritative guidance on Databr