Cargando…

Online randomized controlled experiments at scale: lessons and extensions to medicine

BACKGROUND: Many technology companies, including Airbnb, Amazon, Booking.com, eBay, Facebook, Google, LinkedIn, Lyft, Microsoft, Netflix, Twitter, Uber, and Yahoo!/Oath, run online randomized controlled experiments at scale, namely hundreds of concurrent controlled experiments on millions of users e...

Descripción completa

Detalles Bibliográficos
Autores principales: Kohavi, Ron, Tang, Diane, Xu, Ya, Hemkens, Lars G., Ioannidis, John P. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7007661/
https://www.ncbi.nlm.nih.gov/pubmed/32033614
http://dx.doi.org/10.1186/s13063-020-4084-y
_version_ 1783495351008755712
author Kohavi, Ron
Tang, Diane
Xu, Ya
Hemkens, Lars G.
Ioannidis, John P. A.
author_facet Kohavi, Ron
Tang, Diane
Xu, Ya
Hemkens, Lars G.
Ioannidis, John P. A.
author_sort Kohavi, Ron
collection PubMed
description BACKGROUND: Many technology companies, including Airbnb, Amazon, Booking.com, eBay, Facebook, Google, LinkedIn, Lyft, Microsoft, Netflix, Twitter, Uber, and Yahoo!/Oath, run online randomized controlled experiments at scale, namely hundreds of concurrent controlled experiments on millions of users each, commonly referred to as A/B tests. Originally derived from the same statistical roots, randomized controlled trials (RCTs) in medicine are now criticized for being expensive and difficult, while in technology, the marginal cost of such experiments is approaching zero and the value for data-driven decision-making is broadly recognized. METHODS AND RESULTS: This is an overview of key scaling lessons learned in the technology field. They include (1) a focus on metrics, an overall evaluation criterion and thousands of metrics for insights and debugging, automatically computed for every experiment; (2) quick release cycles with automated ramp-up and shut-down that afford agile and safe experimentation, leading to consistent incremental progress over time; and (3) a culture of ‘test everything’ because most ideas fail and tiny changes sometimes show surprising outcomes worth millions of dollars annually. Technological advances, online interactions, and the availability of large-scale data allowed technology companies to take the science of RCTs and use them as online randomized controlled experiments at large scale with hundreds of such concurrent experiments running on any given day on a wide range of software products, be they web sites, mobile applications, or desktop applications. Rather than hindering innovation, these experiments enabled accelerated innovation with clear improvements to key metrics, including user experience and revenue. As healthcare increases interactions with patients utilizing these modern channels of web sites and digital health applications, many of the lessons apply. The most innovative technological field has recognized that systematic series of randomized trials with numerous failures of the most promising ideas leads to sustainable improvement. CONCLUSION: While there are many differences between technology and medicine, it is worth considering whether and how similar designs can be applied via simple RCTs that focus on healthcare decision-making or service delivery. Changes – small and large – should undergo continuous and repeated evaluations in randomized trials and learning from their results will enable accelerated healthcare improvements.
format Online
Article
Text
id pubmed-7007661
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70076612020-02-13 Online randomized controlled experiments at scale: lessons and extensions to medicine Kohavi, Ron Tang, Diane Xu, Ya Hemkens, Lars G. Ioannidis, John P. A. Trials Methodology BACKGROUND: Many technology companies, including Airbnb, Amazon, Booking.com, eBay, Facebook, Google, LinkedIn, Lyft, Microsoft, Netflix, Twitter, Uber, and Yahoo!/Oath, run online randomized controlled experiments at scale, namely hundreds of concurrent controlled experiments on millions of users each, commonly referred to as A/B tests. Originally derived from the same statistical roots, randomized controlled trials (RCTs) in medicine are now criticized for being expensive and difficult, while in technology, the marginal cost of such experiments is approaching zero and the value for data-driven decision-making is broadly recognized. METHODS AND RESULTS: This is an overview of key scaling lessons learned in the technology field. They include (1) a focus on metrics, an overall evaluation criterion and thousands of metrics for insights and debugging, automatically computed for every experiment; (2) quick release cycles with automated ramp-up and shut-down that afford agile and safe experimentation, leading to consistent incremental progress over time; and (3) a culture of ‘test everything’ because most ideas fail and tiny changes sometimes show surprising outcomes worth millions of dollars annually. Technological advances, online interactions, and the availability of large-scale data allowed technology companies to take the science of RCTs and use them as online randomized controlled experiments at large scale with hundreds of such concurrent experiments running on any given day on a wide range of software products, be they web sites, mobile applications, or desktop applications. Rather than hindering innovation, these experiments enabled accelerated innovation with clear improvements to key metrics, including user experience and revenue. As healthcare increases interactions with patients utilizing these modern channels of web sites and digital health applications, many of the lessons apply. The most innovative technological field has recognized that systematic series of randomized trials with numerous failures of the most promising ideas leads to sustainable improvement. CONCLUSION: While there are many differences between technology and medicine, it is worth considering whether and how similar designs can be applied via simple RCTs that focus on healthcare decision-making or service delivery. Changes – small and large – should undergo continuous and repeated evaluations in randomized trials and learning from their results will enable accelerated healthcare improvements. BioMed Central 2020-02-07 /pmc/articles/PMC7007661/ /pubmed/32033614 http://dx.doi.org/10.1186/s13063-020-4084-y Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Kohavi, Ron
Tang, Diane
Xu, Ya
Hemkens, Lars G.
Ioannidis, John P. A.
Online randomized controlled experiments at scale: lessons and extensions to medicine
title Online randomized controlled experiments at scale: lessons and extensions to medicine
title_full Online randomized controlled experiments at scale: lessons and extensions to medicine
title_fullStr Online randomized controlled experiments at scale: lessons and extensions to medicine
title_full_unstemmed Online randomized controlled experiments at scale: lessons and extensions to medicine
title_short Online randomized controlled experiments at scale: lessons and extensions to medicine
title_sort online randomized controlled experiments at scale: lessons and extensions to medicine
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7007661/
https://www.ncbi.nlm.nih.gov/pubmed/32033614
http://dx.doi.org/10.1186/s13063-020-4084-y
work_keys_str_mv AT kohaviron onlinerandomizedcontrolledexperimentsatscalelessonsandextensionstomedicine
AT tangdiane onlinerandomizedcontrolledexperimentsatscalelessonsandextensionstomedicine
AT xuya onlinerandomizedcontrolledexperimentsatscalelessonsandextensionstomedicine
AT hemkenslarsg onlinerandomizedcontrolledexperimentsatscalelessonsandextensionstomedicine
AT ioannidisjohnpa onlinerandomizedcontrolledexperimentsatscalelessonsandextensionstomedicine