Cargando…

Metalearners for estimating heterogeneous treatment effects using machine learning

There is growing interest in estimating and analyzing heterogeneous treatment effects in experimental and observational studies. We describe a number of metaalgorithms that can take advantage of any supervised learning or regression method in machine learning and statistics to estimate the condition...

Descripción completa

Detalles Bibliográficos
Autores principales: Künzel, Sören R., Sekhon, Jasjeet S., Bickel, Peter J., Yu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6410831/
https://www.ncbi.nlm.nih.gov/pubmed/30770453
http://dx.doi.org/10.1073/pnas.1804597116
_version_ 1783402316137758720
author Künzel, Sören R.
Sekhon, Jasjeet S.
Bickel, Peter J.
Yu, Bin
author_facet Künzel, Sören R.
Sekhon, Jasjeet S.
Bickel, Peter J.
Yu, Bin
author_sort Künzel, Sören R.
collection PubMed
description There is growing interest in estimating and analyzing heterogeneous treatment effects in experimental and observational studies. We describe a number of metaalgorithms that can take advantage of any supervised learning or regression method in machine learning and statistics to estimate the conditional average treatment effect (CATE) function. Metaalgorithms build on base algorithms—such as random forests (RFs), Bayesian additive regression trees (BARTs), or neural networks—to estimate the CATE, a function that the base algorithms are not designed to estimate directly. We introduce a metaalgorithm, the X-learner, that is provably efficient when the number of units in one treatment group is much larger than in the other and can exploit structural properties of the CATE function. For example, if the CATE function is linear and the response functions in treatment and control are Lipschitz-continuous, the X-learner can still achieve the parametric rate under regularity conditions. We then introduce versions of the X-learner that use RF and BART as base learners. In extensive simulation studies, the X-learner performs favorably, although none of the metalearners is uniformly the best. In two persuasion field experiments from political science, we demonstrate how our X-learner can be used to target treatment regimes and to shed light on underlying mechanisms. A software package is provided that implements our methods.
format Online
Article
Text
id pubmed-6410831
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-64108312019-03-13 Metalearners for estimating heterogeneous treatment effects using machine learning Künzel, Sören R. Sekhon, Jasjeet S. Bickel, Peter J. Yu, Bin Proc Natl Acad Sci U S A PNAS Plus There is growing interest in estimating and analyzing heterogeneous treatment effects in experimental and observational studies. We describe a number of metaalgorithms that can take advantage of any supervised learning or regression method in machine learning and statistics to estimate the conditional average treatment effect (CATE) function. Metaalgorithms build on base algorithms—such as random forests (RFs), Bayesian additive regression trees (BARTs), or neural networks—to estimate the CATE, a function that the base algorithms are not designed to estimate directly. We introduce a metaalgorithm, the X-learner, that is provably efficient when the number of units in one treatment group is much larger than in the other and can exploit structural properties of the CATE function. For example, if the CATE function is linear and the response functions in treatment and control are Lipschitz-continuous, the X-learner can still achieve the parametric rate under regularity conditions. We then introduce versions of the X-learner that use RF and BART as base learners. In extensive simulation studies, the X-learner performs favorably, although none of the metalearners is uniformly the best. In two persuasion field experiments from political science, we demonstrate how our X-learner can be used to target treatment regimes and to shed light on underlying mechanisms. A software package is provided that implements our methods. National Academy of Sciences 2019-03-05 2019-02-15 /pmc/articles/PMC6410831/ /pubmed/30770453 http://dx.doi.org/10.1073/pnas.1804597116 Text en Copyright © 2019 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle PNAS Plus
Künzel, Sören R.
Sekhon, Jasjeet S.
Bickel, Peter J.
Yu, Bin
Metalearners for estimating heterogeneous treatment effects using machine learning
title Metalearners for estimating heterogeneous treatment effects using machine learning
title_full Metalearners for estimating heterogeneous treatment effects using machine learning
title_fullStr Metalearners for estimating heterogeneous treatment effects using machine learning
title_full_unstemmed Metalearners for estimating heterogeneous treatment effects using machine learning
title_short Metalearners for estimating heterogeneous treatment effects using machine learning
title_sort metalearners for estimating heterogeneous treatment effects using machine learning
topic PNAS Plus
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6410831/
https://www.ncbi.nlm.nih.gov/pubmed/30770453
http://dx.doi.org/10.1073/pnas.1804597116
work_keys_str_mv AT kunzelsorenr metalearnersforestimatingheterogeneoustreatmenteffectsusingmachinelearning
AT sekhonjasjeets metalearnersforestimatingheterogeneoustreatmenteffectsusingmachinelearning
AT bickelpeterj metalearnersforestimatingheterogeneoustreatmenteffectsusingmachinelearning
AT yubin metalearnersforestimatingheterogeneoustreatmenteffectsusingmachinelearning