ux-for-ai/SKILL.md
AI interface design framework for building AI-powered features that feel premium, trustworthy, and world-class — not sloppy or robotic. Covers trust/transparency principles, avoiding AI slop, human oversight requirements, onboarding AI features...
npx skillsauth add peterbamuhigire/skills-web-dev ux-for-aiInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
ux-for-ai or would be better handled by a more specific companion skill.SKILL.md first, then load only the referenced deep-dive files that are necessary for the task.Based on Nudelman (2024) UX for AI: A Framework for Product Design.
Load this skill when:
Five non-negotiable principles for trustworthy AI UX:
Restate — Before acting, the AI must echo back what it understood. "Showing results for: [interpreted query]." This closes the gap between what the user meant and what AI understood without requiring re-entry.
Calibrate — AI must know what it doesn't know. Communicate confidence levels. When confidence is low, fall back to deterministic baselines or say "I don't know." An AI that cannot self-assess accuracy is a liability.
Explain — Show the "why" behind AI decisions. Citations, recommended reasons, translated queries. "Recommended because you viewed X." Users who understand AI reasoning trust it more and use it better.
Transparent — State what data the AI uses and whether conversations are stored. Don't bury privacy stance in documentation — state it directly in the AI interface.
Human oversight — Never remove manual override controls. Always provide stop, pause, and override buttons for any AI action. The Boeing 737 Max crash (346 deaths) is the foundational case study for what happens when AI silently overrides humans with no visible intervention path.
Users want outcomes, not AI. The mechanism is irrelevant. If a simpler deterministic solution delivers the outcome, use it. Never force AI into a product just because it is technically possible.
Algorithm aversion is real and manageable. Users tolerate bad AI output less than equivalent bad human output. However, aversion decreases when users can slightly modify the AI output, or when the system demonstrates it is improving over time.
AI must not claim to replace experts. Positioning AI as a direct replacement for an established expert (farmer, doctor, analyst) provokes distrust and resistance. AI should assist and augment the expert — never compete with their core skill.
Users expect contextual intelligence. Modern AI should infer context from calendar data, location, time of day, and prior conversation — without making users re-specify what they clearly mean.
Deskilling warning. Over-reliance on AI erodes the user's own capability. Design for augmentation, not replacement, especially in high-stakes domains (medicine, security, finance).
Domain-specific training is the single biggest differentiator. Generic ChatGPT answers feel wrong for your product. Fine-tune on your specific dataset or use RAG (retrieval-augmented generation). An AI that gives domain-accurate answers feels intelligent; one that gives generic LLM responses feels broken.
Stateful AI feels intelligent; stateless AI feels dumb. Allow users to resume investigations, maintain context across sessions, and build on prior interactions. Statelessness makes AI feel disconnected and forgetful.
Size the copilot to match task importance:
Context-aware initial suggestions feel like mind-reading. Surface the right suggestions before the user types anything — based on time of day, data type, prior sessions, and usage patterns. Generic "What can I help you with?" prompts feel cheap.
Use Promptbooks to reduce prompt engineering burden. Users should never need to write complex prompts. Provide pre-built recipes for common workflows. Curated, plain-language queries that interact with domain data in predictable ways is the target.
Give AI a deliberate personality. Define the persona explicitly (supportive health coach, expert security analyst, etc.). Avoid pedantic or preachy tone. Generic LLM personality is a missed differentiator.
Test with real AI output, not mocked text. Products that demo well on polished placeholder copy but fail with actual AI responses are caught in research, not production. Use Python notebooks + paper prototype ("Wizard of Oz") for honest testing.
The core product must be resilient to bad AI output. AI autocomplete and pre-fill should save time when correct but must never block the user when wrong. Always preserve the manual fallback path.
Wrap all AI output in business logic guardrails. Never pass raw model output directly to users or downstream systems. Hard-code limits (max bid, medication dose, allowed actions) regardless of what the AI recommends.
The Value Matrix principle. False positive vs false negative costs are not equal. Calculate the real cost of each error type before choosing how to handle AI failures. Optimise for real-world ROI, not data science accuracy metrics. A "Balanced AI" regularly outperforms a "Accurate (Conservative) AI" in real-world ROI.
Progressive accuracy improvement is critical for retention. Users will abandon an AI product within 3–5 interactions if accuracy does not visibly improve with use. Design AI systems to learn from user corrections.
Guardrails layer required for all LLM outputs. Define explicitly what the AI will and will not output. Code explicit failure responses for denied or malformed requests. Guardrails must be designed, not improvised.
"Set it and forget it" for low-stakes AI. Activate with zero setup when the downside is minimal and immediate value is clear. Zoom AI Companion activating automatically on meeting start is the benchmark.
Lead with immediate, tangible value. The benefit must be clear in the first interaction. Users should not need to invest effort to see value.
Promptbooks and initial suggestions cure cold-start anxiety. An empty prompt box paralyses users. Offer context-aware suggestions based on data type, time of day, and prior sessions.
Validate the use case with storyboards first. A use case that sounds reasonable as text often falls apart when storyboarded step-by-step. The conclusion panel of the storyboard must follow naturally from the rest. If it doesn't, the use case is wrong.
Train on 2,000+ domain-specific examples before launch. An untrained LLM answering domain-specific queries feels broken. Users who encounter generic responses in their first interaction rarely return.
Start with the "one thing." Every AI feature needs a single, clear primary job. What is the one thing users should always be able to do with this AI? Everything else is secondary.
Reveal AI capabilities through contextual next steps, not documentation. After each AI interaction, surface the 2–3 most valuable next questions the user is likely to have. This teaches users what the AI can do — inside their actual workflow.
Use Mad Lib fill-in-the-blank for complex AI configuration. Write the configuration as a complete English sentence with embedded drop-downs: "Trigger a [Critical] alert whenever the value exceeds [90%] for [1 minute]" is far more comprehensible than a table of parameters.
Information architecture still matters in AI-first apps. Chat alone is not a complete interface. Users need predetermined starting points, context-aware alerts, and structured navigation. A blank chat box communicates nothing about what the product does.
In agentic AI, the human's job is to accept or reject evidence, not direct every step. Design accept/reject flows for observations and hypotheses. The human is a reviewer and quality gate, not a director.
Agentic AI requires start, stop, pause, and approval controls. Agents are asynchronous and semi-autonomous. Without stop and pause controls, agents create runaway costs ("Sorcerer's Apprentice" problem). Approval gates for expensive or irreversible actions are mandatory.
Agents require RBAC, data isolation, and versioning. Treat AI agents like employees — with access controls, data isolation, and version management. This governance UI is a new and urgent design challenge.
Augmented intelligence, not replacement. Machines handle pattern recognition, data processing, and hypothesis generation. Humans handle empathy, ethics, judgment, and creative synthesis.
data-ai
Use when adding AI-powered analytics to a SaaS platform — semantic search over business data, natural language queries, trend detection, anomaly alerts, and AI-generated insights for dashboards. Covers embeddings, NL2SQL, and per-tenant analytics...
data-ai
Design AI-powered analytics dashboards — what metrics to show, how to display AI predictions and confidence, drill-down patterns, KPI cards, trend visualisation, AI Insights panels, export design, and role-based dashboard variants. Invoke when...
development
Use when designing, building, reviewing, or upgrading production software systems that must be secure, performant, maintainable, scalable, and user-centered. Apply before writing specs, code, architecture, APIs, databases, mobile apps, SaaS platforms, or ERP systems.
development
Professional web app UI using commercial templates (Tabler/Bootstrap 5) with strong frontend design direction when needed. Use for CRUD interfaces, dashboards, admin panels with SweetAlert2, DataTables, Flatpickr. Clone seeder-page.php, use...