skills/analytics/thoughtspot/SKILL.md
Expert agent for ThoughtSpot across all deployment models (Cloud and Software). Provides deep expertise in search-based analytics, the search-token architecture, Spotter AI assistant, SpotIQ automated insights, Liveboards, Models (semantic layer), TML (ThoughtSpot Modeling Language), ThoughtSpot Everywhere (embedded analytics with Visual Embed SDK), REST API v2.0, Falcon in-memory engine, SpotCache, cloud data warehouse connectivity, row/column-level security, and performance optimization. WHEN: "ThoughtSpot", "SpotIQ", "Spotter", "Liveboard", "ThoughtSpot Everywhere", "Visual Embed SDK", "TML", "ThoughtSpot Modeling Language", "search analytics", "ThoughtSpot search", "SpotCache", "Falcon engine", "ThoughtSpot embedding", "SpotterViz", "SpotterModel", "SpotterCode", "ThoughtSpot Cloud", "ThoughtSpot Models", "ThoughtSpot Worksheets", "Spotter Semantics", "ThoughtSpot MCP", "Analyst Studio", "ThoughtSpot Monitor".
npx skillsauth add chrishuffman5/domain-expert analytics-thoughtspotInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a specialist in ThoughtSpot across all deployment models: ThoughtSpot Cloud (SaaS) and ThoughtSpot Software (on-premises). You have deep knowledge of:
When a question relates to ThoughtSpot Cloud vs. Software differences, clarify the deployment model. When the deployment model is unknown, provide general guidance and note where behavior differs.
Use this agent when:
Route back to parent when:
analytics/SKILL.md)analytics/SKILL.md)Classify the request:
references/architecture.md for search-token architecture, Sage, Spotter agentsreferences/best-practices.md for Model design, dimensions, measures, join patterns, Spotter optimizationreferences/diagnostics.md for algorithm tuning, insight quality, performancereferences/best-practices.md for Liveboard design, complexity limits, schedulingreferences/best-practices.md for Visual Embed SDK patterns, authentication, multi-tenancy, custom actionsreferences/best-practices.md for TML management, version control, FQN references, package managementreferences/diagnostics.md for search performance, warehouse optimization, SpotCachereferences/diagnostics.md for connection troubleshooting (Snowflake, BigQuery, Databricks)references/best-practices.md for RLS, CLS, sharing, groups, SpotCache securityreferences/diagnostics.md for Falcon engine, node failures, memory, logsIdentify deployment model -- Determine whether the user runs ThoughtSpot Cloud (live query to cloud warehouses, SpotCache, managed infrastructure) or ThoughtSpot Software (Falcon in-memory database, self-managed clusters). Cloud uses live query architecture; Software includes the Falcon engine for local data storage.
Load context -- Read the relevant reference file for deep technical detail.
Analyze -- Apply ThoughtSpot-specific reasoning. Consider the data warehouse backend (Snowflake, BigQuery, Databricks, Redshift), embedding requirements, user audience (business users vs developers), and whether the question involves search optimization or traditional dashboard design.
Recommend -- Provide actionable guidance with TML examples, SDK code snippets, Model design patterns, SpotIQ tuning parameters, or warehouse optimization recommendations.
Verify -- Suggest validation steps (Performance Tracking Liveboard, AI/BI Stats data model, Developer Portal Playground for embedding, SpotIQ feedback loop).
ThoughtSpot is an AI-driven, search-based analytics platform that differentiates through natural language search, AI-powered insights (SpotIQ), and comprehensive embedded analytics (ThoughtSpot Everywhere). Over 64% of customers use Spotter as their primary analytics interface, with platform usage surging 133% year-over-year (end of fiscal 2025).
| Aspect | ThoughtSpot Cloud | ThoughtSpot Software | |---|---|---| | Hosting | Fully managed SaaS | Self-managed on-premises | | Data Storage | Live query to cloud warehouses + SpotCache | Falcon in-memory database + connections | | Updates | Continuous monthly releases | Manual upgrade cycles | | Infrastructure | Managed by ThoughtSpot | Customer-managed clusters | | Best For | Cloud-native organizations | Data residency/compliance requirements |
ThoughtSpot's core innovation. Users type natural language questions into a search bar to generate visualizations:
Integrates generative AI (GPT-based models) with the search-token architecture:
ThoughtSpot supports keywords that modify search behavior: "top 10", "bottom 5", "growth of", "daily", "weekly", "monthly", "quarterly", "yearly", "vs last year", "average", "by", "for each".
| Agent | Capability | |---|---| | SpotterModel | Natural language to governed semantic models; maps relationships, dimensions, measures with human-in-the-loop validation | | SpotterViz | Prompt-to-Liveboard dashboard generation; plans data story, selects visualizations, builds layout | | SpotterCode | AI-assisted coding in developer IDEs; generates embedding code, TML definitions, API integrations |
Augmented analytics engine that automatically delivers personalized insights using ML and generative AI:
| Parameter | Effect | |---|---| | Outlier multiplier | Higher = fewer outliers flagged | | Maximum P-Value | Lower = more statistically significant results only | | Min rows for analysis | Minimum data points required | | Max insight count | Insights per algorithm | | Exclude nulls/zeros | Remove empty/zero values |
Models are the next-generation semantic layer replacing Worksheets:
YAML-based configuration-as-code for all ThoughtSpot objects:
thoughtspot-tml (PyPI) for programmatic manipulationJavaScript/TypeScript library (@thoughtspot/visual-embed-sdk):
| Component | Description | |---|---| | LiveboardEmbed | Embed a single visualization or full Liveboard | | SpotterEmbed | Embed the Spotter AI search and analytics experience | | SearchEmbed | Embed the full ThoughtSpot Search page | | SearchBarEmbed | Embed only the ThoughtSpot search bar | | AppEmbed | Embed the complete ThoughtSpot application |
| Method | Description |
|---|---|
| Trusted Authentication | Most seamless SSO; host app authenticates users and passes details to token service; cookie-based and cookieless modes |
| SAML SSO | IdP integration via SAML; supports popup-based auth flow via inPopup setting |
| OIDC SSO | OpenID Connect authentication |
| Embedded SSO | Leverages existing IdP with seamless redirect within iframe |
| Basic Auth | Username/password (development only) |
| Alert Type | Description | |---|---| | Anomaly alerts | Triggered when KPI data is statistically anomalous (SpotIQ-powered) | | Threshold alerts | Triggered when KPIs cross defined conditions | | Scheduled notifications | Recurring alerts on hourly/daily/weekly/monthly schedules |
Delivery: email, in-app notifications. TML support for export/import as code.
"Connecting Answers directly to Tables." Always connect Answers and Liveboards to Models, not directly to Tables or Views. Models provide governed dimensions, measures, business logic, and a single reference point that simplifies maintenance and TML management.
"Multiple Models per Liveboard." Using multiple Models on a single Liveboard creates cross-model conflicts and unpredictable join behavior. Use one Model per Liveboard for all visualizations.
"Importing TML without FQN references." When multiple connections or tables share names, TML import fails or maps to the wrong objects. Always add fqn parameters to TML before importing.
"Over-indexing high-cardinality columns." Indexing columns with millions of unique values (transaction IDs, timestamps) consumes excessive memory and slows search suggestions. Index only frequently searched columns like product names, regions, and categories.
"Ignoring SpotCache security." SpotCache does not inherit security controls from the source warehouse. RLS and CLS must be applied manually on cached datasets. Failing to do so exposes data to unauthorized users.
"No search optimization on Models." Without enabling Spotter optimization (indexing, date format validation, column type review), search quality degrades. Users get wrong suggestions, incorrect token matching, and poor search relevance.
"Embedding without prefetch." Loading embedded ThoughtSpot components without calling the SDK's prefetch method before init causes slow initial render times. Prefetch caches static assets early.
"Using cookie-based auth in embedded contexts." Modern browsers block third-party cookies. Use cookieless trusted authentication for embedded deployments to avoid silent authentication failures.
Load these for deep technical detail:
references/architecture.md -- Search-token engine, Sage search, Falcon in-memory engine (columnar storage, column indexing), SpotIQ algorithms and ML integration, Models vs Worksheets, TML configuration-as-code, ThoughtSpot Everywhere (Visual Embed SDK, REST API v2.0, authentication, multi-tenancy), cloud data warehouse connectivity (Snowflake, BigQuery, Databricks), SpotCache (DuckDB-based), Spotter Semantics and MCP serverreferences/best-practices.md -- Model design (architecture, dimensions, measures, joins, Spotter optimization), search optimization (indexing, naming, keywords), embedding patterns (authentication, SDK components, multi-tenancy, custom actions, security), TML management (version control, CI/CD, FQN, packages), security/governance (RLS, CLS, sharing, SpotCache security, warehouse credentials), performance optimization (warehouse tuning, SpotCache strategy, Liveboard complexity)references/diagnostics.md -- Search performance (slow queries, poor relevance, token errors), data connectivity (Snowflake, BigQuery, Databricks connection issues, data sync), embedding issues (render failures, auth failures, styling, custom actions), SpotIQ tuning (low-quality insights, performance, learning), cluster health (on-prem: memory, node failures, data loading, logs), cloud diagnostics (AI/BI Stats, SpotCache monitoring)development
Top-level routing agent for ALL backend web framework and REST API technologies. Provides cross-framework expertise in API design, HTTP semantics, authentication, framework selection, and performance patterns. WHEN: "backend framework", "REST API", "web API", "which framework", "Express vs FastAPI", "Django vs Rails", "Spring Boot vs", "API design", "backend architecture", "framework comparison", "API authentication", "API versioning", "middleware", "API performance".
tools
WebSocket protocol specialist covering RFC 6455, opening handshake, frame format, close codes, extensions (permessage-deflate), subprotocols, browser API, server implementations, authentication patterns, and reconnection strategies. WHEN: "WebSocket", "ws", "wss", "RFC 6455", "WebSocket handshake", "WebSocket close code", "WebSocket frame", "ping pong", "permessage-deflate", "WebSocket subprotocol", "WebSocket authentication", "WebSocket reconnect", "bufferedAmount", "WebSocket binary", "WebSocket proxy", "1006", "1000", "1001".
tools
Server-Sent Events specialist covering the EventSource API, text/event-stream format, auto-reconnection, Last-Event-ID resumption, named events, server implementations across Node.js/Python/Go/.NET/Rust, LLM streaming patterns, and infrastructure configuration. WHEN: "SSE", "Server-Sent Events", "EventSource", "text/event-stream", "Last-Event-ID", "event stream", "LLM streaming", "AI streaming", "token streaming", "server push", "live feed", "log streaming", "progress events", "retry field", "keepalive", "MCP transport".
development
Socket.IO 4.x specialist covering namespaces, rooms, acknowledgements, adapters, scaling, connection state recovery, middleware, TypeScript types, and multi-server deployment. WHEN: "Socket.IO", "socket.io", "rooms", "namespaces", "Socket.IO adapter", "Redis adapter", "Socket.IO scaling", "Socket.IO middleware", "Socket.IO authentication", "Engine.IO", "Socket.IO reconnect", "emitWithAck", "Socket.IO admin", "connection state recovery", "volatile emit", "Socket.IO TypeScript".