By Johnny Chan · UI/UX Designer, Hong Kong
How to Usability-Test AI Features Before You Ship
Test plans for probabilistic UI — task scripts, trust questions, and what to measure when every session can feel different.

Classic usability testing assumes stable screens. AI features change answers run to run. You still need structured tests — you just add questions about trust, comprehension, and recovery when the model misfires.
Script realistic jobs, not prompt engineering
Ask participants to complete goals (“find why my order failed,” “draft a reply to this customer”) without telling them magic words. You are testing product UX, not their ability to prompt.
Add AI-specific probes
- Did you trust this answer? Why or why not?
- What would you do if this looked wrong?
- Did you notice sources / confidence cues?
Run multiple sessions — variance is data
The same prompt may yield different outputs. Note when inconsistency confuses users versus when it is harmless. If variance breaks comprehension, add structure: templates, constrained choices, or post-processing in UI.
Synthesize into product fixes
Cluster issues into model policy, prompt design, and pure UI. Many “AI failures” are fixable with clearer empty states, better suggested actions, or forcing confirmation before destructive steps — no retraining required.
Let's work together
Open to UI/UX projects, collaborations, and product design support in Hong Kong and remotely.
Let's Connect