Johnny Chan logo
AI ResearchMay 15, 20265 min read

By Johnny Chan · UI/UX Designer, Hong Kong

How to Usability-Test AI Features Before You Ship

Test plans for probabilistic UI — task scripts, trust questions, and what to measure when every session can feel different.

How to Usability-Test AI Features Before You Ship

Classic usability testing assumes stable screens. AI features change answers run to run. You still need structured tests — you just add questions about trust, comprehension, and recovery when the model misfires.

Script realistic jobs, not prompt engineering

Ask participants to complete goals (“find why my order failed,” “draft a reply to this customer”) without telling them magic words. You are testing product UX, not their ability to prompt.

Add AI-specific probes

  • Did you trust this answer? Why or why not?
  • What would you do if this looked wrong?
  • Did you notice sources / confidence cues?

Run multiple sessions — variance is data

The same prompt may yield different outputs. Note when inconsistency confuses users versus when it is harmless. If variance breaks comprehension, add structure: templates, constrained choices, or post-processing in UI.

Synthesize into product fixes

Cluster issues into model policy, prompt design, and pure UI. Many “AI failures” are fixable with clearer empty states, better suggested actions, or forcing confirmation before destructive steps — no retraining required.

Let's work together

Open to UI/UX projects, collaborations, and product design support in Hong Kong and remotely.

Let's Connect