Cargando…

Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference

The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model m...

Descripción completa

Detalles Bibliográficos
Autores principales: Rostami, Mehdi, Saarela, Olli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497603/
https://www.ncbi.nlm.nih.gov/pubmed/36141175
http://dx.doi.org/10.3390/e24091290
_version_ 1784794546527272960
author Rostami, Mehdi
Saarela, Olli
author_facet Rostami, Mehdi
Saarela, Olli
author_sort Rostami, Mehdi
collection PubMed
description The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model misspecification in the first step has led researchers to utilize Machine Learning algorithms instead of parametric algorithms. However, the existence of strong confounders and/or Instrumental Variables (IVs) can lead the complex ML algorithms to provide perfect predictions for the treatment model which can violate the positivity assumption and elevate the variance of AIPW estimators. Thus the complexity of ML algorithms must be controlled to avoid perfect predictions for the treatment model while still learning the relationship between the confounders and the treatment and outcome. We use two NN architectures with an [Formula: see text]-regularization on specific NN parameters and investigate how their certain hyperparameters should be tuned in the presence of confounders and IVs to achieve a low bias-variance tradeoff for ATE estimators such as AIPW estimator. Through simulation results, we will provide recommendations as to how NNs can be employed for ATE estimation.
format Online
Article
Text
id pubmed-9497603
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94976032022-09-23 Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference Rostami, Mehdi Saarela, Olli Entropy (Basel) Article The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model misspecification in the first step has led researchers to utilize Machine Learning algorithms instead of parametric algorithms. However, the existence of strong confounders and/or Instrumental Variables (IVs) can lead the complex ML algorithms to provide perfect predictions for the treatment model which can violate the positivity assumption and elevate the variance of AIPW estimators. Thus the complexity of ML algorithms must be controlled to avoid perfect predictions for the treatment model while still learning the relationship between the confounders and the treatment and outcome. We use two NN architectures with an [Formula: see text]-regularization on specific NN parameters and investigate how their certain hyperparameters should be tuned in the presence of confounders and IVs to achieve a low bias-variance tradeoff for ATE estimators such as AIPW estimator. Through simulation results, we will provide recommendations as to how NNs can be employed for ATE estimation. MDPI 2022-09-13 /pmc/articles/PMC9497603/ /pubmed/36141175 http://dx.doi.org/10.3390/e24091290 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rostami, Mehdi
Saarela, Olli
Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title_full Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title_fullStr Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title_full_unstemmed Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title_short Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
title_sort targeted l(1)-regularization and joint modeling of neural networks for causal inference
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497603/
https://www.ncbi.nlm.nih.gov/pubmed/36141175
http://dx.doi.org/10.3390/e24091290
work_keys_str_mv AT rostamimehdi targetedl1regularizationandjointmodelingofneuralnetworksforcausalinference
AT saarelaolli targetedl1regularizationandjointmodelingofneuralnetworksforcausalinference