Cargando…

Targeted maximum likelihood estimation for a binary treatment: A tutorial

When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G‐formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric ou...

Descripción completa

Detalles Bibliográficos
Autores principales: Luque‐Fernandez, Miguel Angel, Schomaker, Michael, Rachet, Bernard, Schnitzer, Mireille E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6032875/
https://www.ncbi.nlm.nih.gov/pubmed/29687470
http://dx.doi.org/10.1002/sim.7628
_version_ 1783337590567469056
author Luque‐Fernandez, Miguel Angel
Schomaker, Michael
Rachet, Bernard
Schnitzer, Mireille E.
author_facet Luque‐Fernandez, Miguel Angel
Schomaker, Michael
Rachet, Bernard
Schnitzer, Mireille E.
author_sort Luque‐Fernandez, Miguel Angel
collection PubMed
description When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G‐formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double‐robust methods only require correct specification of either the outcome or the exposure model. Targeted maximum likelihood estimation is a semiparametric double‐robust method that improves the chances of correct model specification by allowing for flexible estimation using (nonparametric) machine‐learning methods. It therefore requires weaker assumptions than its competitors. We provide a step‐by‐step guided implementation of TMLE and illustrate it in a realistic scenario based on cancer epidemiology where assumptions about correct model specification and positivity (ie, when a study participant had 0 probability of receiving the treatment) are nearly violated. This article provides a concise and reproducible educational introduction to TMLE for a binary outcome and exposure. The reader should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. Extensive R‐code is provided in easy‐to‐read boxes throughout the article for replicability. Stata users will find a testing implementation of TMLE and additional material in the Appendix S1 and at the following GitHub repository: https://github.com/migariane/SIM-TMLE-tutorial
format Online
Article
Text
id pubmed-6032875
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-60328752018-07-12 Targeted maximum likelihood estimation for a binary treatment: A tutorial Luque‐Fernandez, Miguel Angel Schomaker, Michael Rachet, Bernard Schnitzer, Mireille E. Stat Med Tutorial in Biostatistics When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G‐formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double‐robust methods only require correct specification of either the outcome or the exposure model. Targeted maximum likelihood estimation is a semiparametric double‐robust method that improves the chances of correct model specification by allowing for flexible estimation using (nonparametric) machine‐learning methods. It therefore requires weaker assumptions than its competitors. We provide a step‐by‐step guided implementation of TMLE and illustrate it in a realistic scenario based on cancer epidemiology where assumptions about correct model specification and positivity (ie, when a study participant had 0 probability of receiving the treatment) are nearly violated. This article provides a concise and reproducible educational introduction to TMLE for a binary outcome and exposure. The reader should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. Extensive R‐code is provided in easy‐to‐read boxes throughout the article for replicability. Stata users will find a testing implementation of TMLE and additional material in the Appendix S1 and at the following GitHub repository: https://github.com/migariane/SIM-TMLE-tutorial John Wiley and Sons Inc. 2018-04-23 2018-07-20 /pmc/articles/PMC6032875/ /pubmed/29687470 http://dx.doi.org/10.1002/sim.7628 Text en © 2018 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Tutorial in Biostatistics
Luque‐Fernandez, Miguel Angel
Schomaker, Michael
Rachet, Bernard
Schnitzer, Mireille E.
Targeted maximum likelihood estimation for a binary treatment: A tutorial
title Targeted maximum likelihood estimation for a binary treatment: A tutorial
title_full Targeted maximum likelihood estimation for a binary treatment: A tutorial
title_fullStr Targeted maximum likelihood estimation for a binary treatment: A tutorial
title_full_unstemmed Targeted maximum likelihood estimation for a binary treatment: A tutorial
title_short Targeted maximum likelihood estimation for a binary treatment: A tutorial
title_sort targeted maximum likelihood estimation for a binary treatment: a tutorial
topic Tutorial in Biostatistics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6032875/
https://www.ncbi.nlm.nih.gov/pubmed/29687470
http://dx.doi.org/10.1002/sim.7628
work_keys_str_mv AT luquefernandezmiguelangel targetedmaximumlikelihoodestimationforabinarytreatmentatutorial
AT schomakermichael targetedmaximumlikelihoodestimationforabinarytreatmentatutorial
AT rachetbernard targetedmaximumlikelihoodestimationforabinarytreatmentatutorial
AT schnitzermireillee targetedmaximumlikelihoodestimationforabinarytreatmentatutorial