Experimenting on Humans - Advanced A/B Tests - QCon SF 2014

Experimenting on Humans

Aviran Mordo

Head of Back-end Engineering

@aviranm

http://www.linkedin.com/in/aviranhttp://www.aviransplace.com

Talya Gendler

Back-end Team Leader

www.linkedin.com/in/talyagendler

Wix In Numbers

Over 55M users + 1M new users/month

Static storage is >1.5Pb of data

3 data centers + 3 clouds (Google, Amazon, Azure)

1.5B HTTP requests/day

800 people work at Wix, of which ~ 300 in R&D

1542 (A/B Tests in 3 months)

Basic A/B testing

Experiment driven development

PETRI – Wix’s 3rd generation open source

experiment system

Challenges and best practices

Complexities and effect on product

Agenda

A/B Test

To B or NOT to B?

Home page results (How many registered)

Experiment Driven Development

This is the Wix editor

Our gallery manager

What can we improve?

Is this better?

Don’t be a loser

Product Experiments Toggles & Reporting

Infrastructure

How do you know what is running?

If I “know” it is better, do I really need to test it?

Why so many?

Sign-upChoose Templat

eEdit site Publish Premiu

The theory

Result = Fail

Intent matters

EVERY new feature is A/B tested

We open the new feature to a % of users

Measure success

If it is better, we keep it

If worse, we check why and improve

If flawed, the impact is just for % of our users

Conclusion

Start with 50% / 50% ?

New code can have bugs

Conversion can drop

Usage can drop

Unexpected cross test dependencies

Sh*t happens (Test could fail)

Language

Browser

User-agent

Minimize affected users (in case of failure)

Gradual exposure (percentage of…)

Company employees

User roles

Any other criteria you have (extendable)

All users

First time visitors = Never visited wix.com

New registered users = Untainted users

Not all users are equal

Start new experiment (limited population)

We need that feature

…and failure is not an option

Adding a mobile view

First trial failed

Performance had to be improved

Halting the test results in loss of data.

What can we do about it?

Solution – Pause the experiment!

• Maintain NEW experience for already exposed users

• No additional users will be exposed to the NEW feature

PETRI’s pause implementation

Use cookies to persist assignment

If user changes browser assignment is

unknown

Server side persistence solves this

You pay in performance & scalability

Decision

Keep feature Drop feature

Improve code & resume experiment

Keep backwards compatibility for exposed users forever?

Migrate users to another equivalent feature

Drop it all together (users lose data/work)

The road to success

Numbers look good but sample size is small

We need more data!

Expand

Reaching statistical significance

25% 50% 75% 100%

75% 50% 25% 0%Control Group (A)

Test Group (B)

Keep user experience consistent

Control Group

Test Group

Signed-in user (Editor)Test group assignment is determined by the user IDGuarantee toss persistency across browsers

Anonymous user (Home page)Test group assignment is randomly determinedCan not guarantee persistent experience if changing

browser

11% of Wix users use more than one desktop browser

Keeping persistent UX

Robots are users too!

Always exclude robots

Don’t let Google index a losing page

Don’t let bots affect statistics

There is MORE than one

# of active experiment

Possible # of states

10 1024

20 1,048,576

30 1,073,741,824

Possible states >= 2^(# experiments)

Wix has ~200 active experiments = 1.606938e+60

Supporting 2^N different users is challenging

How do you know which experiment causes errors?

Managing an ever changing production env.

Override options (URL parameters, cookies, headers…)

Near real time user BI tools

Specialized tools

Integrated into the product

Why should product care about

the system architecture

Share document with other users

Document owner is part of a test that enables a new video

component

What will the other user experience when editing a shared document ?

Owner Friend

Assignment may be different than owner’s

Owner (B) Friend (A)

Enable features by existing content

Enable features by document owner’s assignment

Exclude experimental features from shared documents

Possible solutions

A/B testing introduces complexity

Petri is more than just an A/B test framework

Feature toggle

A/B Test

Personalization

Internal testing

Continuous deployment

Jira integration

Experiments

Dynamic configuration

Automated testing

Petri is now an open source project

https://github.com/wix/petri

Aviran Mordo

Head of Back-end Engineering

@aviranm

http://www.linkedin.com/in/aviranhttp://www.aviransplace.com

Talya Gendler

Back-end Team Leader

www.linkedin.com/in/talyagendler

https://github.com/wix/petri

http://goo.gl/L7pHnd

Creditshttp://upload.wikimedia.org/wikipedia/commons/b/b2/Fiber_optics_testing.jpg

http://goo.gl/nEiepT

https://www.flickr.com/photos/ilo_oli/2421536836

https://www.flickr.com/photos/dexxus/5791228117

http://goo.gl/SdeJ0o

https://www.flickr.com/photos/112923805@N05/15005456062

https://www.flickr.com/photos/wiertz/8537791164

https://www.flickr.com/photos/laenulfean/5943132296

https://www.flickr.com/photos/torek/3470257377

https://www.flickr.com/photos/i5design/5393934753

https://www.flickr.com/photos/argonavigo/5320119828

Modeled experiment lifecycle

Open source (developed using TDD from day 1)

Running at scale on production

No deployment necessary

Both back-end and front-end experiment

Flexible architecture

Why Petri

PERTI Server Your app

Laboratory

DB Logs

Experimenting on Humans - Advanced A/B Tests - QCon SF 2014

Technology

Transcript of Experimenting on Humans - Advanced A/B Tests - QCon SF 2014

Experimenting Question

Facebook keynote-nicolas-qcon

Apache Hadoop Talk at QCon

QCon London - Gilt Microservices

It Probably Works - QCon 2015

Setup up for your QCON Pro & QCON EX controller section Note:

Qcon multi team sprint planning

QCon - New York · QCon - New York New York 18th June 2012 (June 18th for Americans) 1

2016 QCON NYC - Containers.pptx

1st4sport Level 3 Certificate in Supporting Physical Development …€¦ · QCON 1.1 Qualification workforce capacity and ratios QCON 1.2 Qualification administrator QCON 1.3 Qualification

Hadoop: Scalable Infrastructure for Big Data - QCon London · QCon London 2012 Hadoop: Scalable Infrastructure for Big Data QCon London 2012 Parand Tony Darugar Founder and CEO, Xpenser

Internship Sept 2016 - Feb 2017 Experimenting how humans ... · Internship Sept 2016 - Feb 2017 Experimenting how humans actively negotiate new linguistic conventions ... Participants

QCon 2015 Broken Performance Tools

2016 QCON NYC - Containers

Flashback: QCon San Francisco 2012

QCON London 2013

qcon.com.qa...1. Qcon Plant and Machinery Department Scope Welcome to Qcon, Qatar Engineering & Construction Company WI-L [Qcon] is a local Qatari Company. The Company was originally

Netflix keynote-adrian-qcon

QCon Theoretical

QCon 2015 - AngularJS Java EE