Log in or create a free Rosenverse account to watch this video.
Log in Create free account100s of community videos are available to free members. Conference talks are generally available to Gold members.
UX Lessons from running more than 1,200 A/B Tests
Summary
Knowing how to solve the right problem is only one small part of the product-success equation. And nailing the execution is harder than most people think. As Erin says in this talk, “There are far more ways to fail than there are to succeed” when it comes to experimentation. To help you learn, Erin shares some of the silly mistakes she made while A/B testing her designs. That way you can avoid those pitfalls as you work to make your digital product better—not just different.
Key Insights
-
•
Nine out of ten AB tests at Booking.com fail, but the few that succeed compound into massive growth.
-
•
Execution often fails on good ideas rather than the ideas themselves.
-
•
Technical details like page load time can negate positive design changes if not carefully managed.
-
•
Edge case bugs in different languages or currencies can silently kill conversions.
-
•
Minor typography choices, such as switching from Times New Roman to Arial, have a large impact on conversion.
-
•
Using system fonts improves legibility and page load speed, positively affecting conversions.
-
•
Tracking should only include users exposed to the tested change to avoid noisy data.
-
•
Qualitative user research drives many experiment ideas by revealing real user challenges.
-
•
Guardrail metrics like customer service calls and loyalty prevent short-term wins from harming long-term value.
-
•
Products should be retested over time as technology and user behaviors evolve, since what fails today may succeed tomorrow.
Notable Quotes
"I’m actually a really big failure — nine out of ten tests fail with no positive measurable impact."
"Compound effect means good upon good upon good eventually builds to incredibly fast growth."
"If a problem keeps coming up, it’s usually the execution of the idea that’s failing, not the concept."
"Increasing page load time by three seconds can nullify all your design improvements."
"Picking the wrong size image can impact customer experience as much as what you see on screen."
"There are as many versions of the website as stars in the sky because of language, currency, and device variations."
"No tracking is perfect because numbers tell you what likely happened, but not why it happened."
"System fonts load faster, improve legibility, and outperform fancy brand fonts in conversion."
"Guardrail metrics help catch unintended consequences like increased customer service calls despite higher conversions."
"You can’t fail 90 times out of 10 without laughing at yourself to keep going."
Or choose a question:
More Videos
"Cycles of successive approximation help us get closer and closer to great design through iteration."
Rebecca GimenezWork in Progress: Service Design at Airbnb
December 3, 2024
"Mastercard designers grew meaningfully for both people and business despite uncertainty."
Lada GorlenkoTheme 2 Intro
June 9, 2022
"We partnered with Consent Kit to create and maintain an open source consent form builder that addresses multiple legal frameworks."
Brigette Metzler Dana ChrisfieldResearch Repositories: A global project by the ResearchOps Community
August 27, 2020
"Align on what you’re looking for before the interview to evaluate non-traditional candidates fairly."
Discussion
June 9, 2017
"In traditional government projects, policy writers are often gone long before implementations fail or need revising."
Sarah Brooks Jennifer PahlkaFireside chat with Sarah Brooks and Jen Pahlka
October 21, 2021
"We constantly analyze our recruitment data and note that we still struggle to reach Hispanic or Latino origin individuals, older adults, and lower-income participants."
Lisa Spitz Nikki BrandBuilding Trust Through Equitable Research Practices
November 18, 2022
"Designers almost always partner with clinicians who are the domain experts in healthcare projects."
Theresa NeilDesigning for Wellness: Specializing in Healthcare
May 22, 2024
"Using tools like 11 Labs, we instantly generated voices with emotion and appropriate cadence for characters."
Maverick Chan Claire LinFrom Doodle to Demo: AI as Our Storytelling Partner
October 23, 2025
"Our nervous system does not know the difference between a tiger and an angry email from a manager—it just senses danger."
Alla WeinbergDesign Teams Need Psychological Safety: Here’s How to Create It
September 8, 2022