Cargando…
Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference
The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model m...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497603/ https://www.ncbi.nlm.nih.gov/pubmed/36141175 http://dx.doi.org/10.3390/e24091290 |
_version_ | 1784794546527272960 |
---|---|
author | Rostami, Mehdi Saarela, Olli |
author_facet | Rostami, Mehdi Saarela, Olli |
author_sort | Rostami, Mehdi |
collection | PubMed |
description | The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model misspecification in the first step has led researchers to utilize Machine Learning algorithms instead of parametric algorithms. However, the existence of strong confounders and/or Instrumental Variables (IVs) can lead the complex ML algorithms to provide perfect predictions for the treatment model which can violate the positivity assumption and elevate the variance of AIPW estimators. Thus the complexity of ML algorithms must be controlled to avoid perfect predictions for the treatment model while still learning the relationship between the confounders and the treatment and outcome. We use two NN architectures with an [Formula: see text]-regularization on specific NN parameters and investigate how their certain hyperparameters should be tuned in the presence of confounders and IVs to achieve a low bias-variance tradeoff for ATE estimators such as AIPW estimator. Through simulation results, we will provide recommendations as to how NNs can be employed for ATE estimation. |
format | Online Article Text |
id | pubmed-9497603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94976032022-09-23 Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference Rostami, Mehdi Saarela, Olli Entropy (Basel) Article The calculation of the Augmented Inverse Probability Weighting (AIPW) estimator of the Average Treatment Effect (ATE) is carried out in two steps, where in the first step, the treatment and outcome are modeled, and in the second step, the predictions are inserted into the AIPW estimator. The model misspecification in the first step has led researchers to utilize Machine Learning algorithms instead of parametric algorithms. However, the existence of strong confounders and/or Instrumental Variables (IVs) can lead the complex ML algorithms to provide perfect predictions for the treatment model which can violate the positivity assumption and elevate the variance of AIPW estimators. Thus the complexity of ML algorithms must be controlled to avoid perfect predictions for the treatment model while still learning the relationship between the confounders and the treatment and outcome. We use two NN architectures with an [Formula: see text]-regularization on specific NN parameters and investigate how their certain hyperparameters should be tuned in the presence of confounders and IVs to achieve a low bias-variance tradeoff for ATE estimators such as AIPW estimator. Through simulation results, we will provide recommendations as to how NNs can be employed for ATE estimation. MDPI 2022-09-13 /pmc/articles/PMC9497603/ /pubmed/36141175 http://dx.doi.org/10.3390/e24091290 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Rostami, Mehdi Saarela, Olli Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title | Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title_full | Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title_fullStr | Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title_full_unstemmed | Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title_short | Targeted L(1)-Regularization and Joint Modeling of Neural Networks for Causal Inference |
title_sort | targeted l(1)-regularization and joint modeling of neural networks for causal inference |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497603/ https://www.ncbi.nlm.nih.gov/pubmed/36141175 http://dx.doi.org/10.3390/e24091290 |
work_keys_str_mv | AT rostamimehdi targetedl1regularizationandjointmodelingofneuralnetworksforcausalinference AT saarelaolli targetedl1regularizationandjointmodelingofneuralnetworksforcausalinference |