Cargando…

Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)

Understanding causality is of crucial importance in biomedical sciences, where developing prediction models is insufficient because the models need to be actionable. However, data sources, such as electronic health records, are observational and often plagued with various types of biases, e.g. confo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ghosh, Shantanu, Boucher, Christina, Bian, Jiang, Prosperi, Mattia
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357304/ https://www.ncbi.nlm.nih.gov/pubmed/34386786 http://dx.doi.org/10.1016/j.cmpbup.2021.100020

_version_	1783737105714774016
author	Ghosh, Shantanu Boucher, Christina Bian, Jiang Prosperi, Mattia
author_facet	Ghosh, Shantanu Boucher, Christina Bian, Jiang Prosperi, Mattia
author_sort	Ghosh, Shantanu
collection	PubMed
description	Understanding causality is of crucial importance in biomedical sciences, where developing prediction models is insufficient because the models need to be actionable. However, data sources, such as electronic health records, are observational and often plagued with various types of biases, e.g. confounding. Although randomized controlled trials are the gold standard to estimate the causal effects of treatment interventions on health outcomes, they are not always possible. Propensity score matching (PSM) is a popular statistical technique for observational data that aims at balancing the characteristics of the population assigned either to a treatment or to a control group, making treatment assignment and outcome independent upon these characteristics. However, matching subjects can reduce the sample size. Inverse probability weighting (IPW) maintains the sample size, but extreme values can lead to instability. While PSM and IPW have been historically used in conjunction with linear regression, machine learning methods –including deep learning with propensity dropout– have been proposed to account for nonlinear treatment assignments. In this work, we propose a novel deep learning approach –the Propensity Score Synthetic Augmentation Matching using Generative Adversarial Networks (PSSAM-GAN)– that aims at keeping the sample size, without IPW, by generating synthetic matches. PSSAM-GAN can be used in conjunction with any other prediction method to estimate treatment effects. Experiments performed on both semi-synthetic (perinatal interventions) and real-world observational data (antibiotic treatments, and job interventions) show that the PSSAM-GAN approach effectively creates balanced datasets, relaxing the weighting/dropout needs for downstream methods, and providing competitive performance in effects estimation as compared to simple GAN and in conjunction with other deep counterfactual learning architectures, e.g. TARNet.
format	Online Article Text
id	pubmed-8357304
institution	National Center for Biotechnology Information
language	English
publishDate	2021
record_format	MEDLINE/PubMed
spelling	pubmed-83573042021-08-11 Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN) Ghosh, Shantanu Boucher, Christina Bian, Jiang Prosperi, Mattia Comput Methods Programs Biomed Update Article Understanding causality is of crucial importance in biomedical sciences, where developing prediction models is insufficient because the models need to be actionable. However, data sources, such as electronic health records, are observational and often plagued with various types of biases, e.g. confounding. Although randomized controlled trials are the gold standard to estimate the causal effects of treatment interventions on health outcomes, they are not always possible. Propensity score matching (PSM) is a popular statistical technique for observational data that aims at balancing the characteristics of the population assigned either to a treatment or to a control group, making treatment assignment and outcome independent upon these characteristics. However, matching subjects can reduce the sample size. Inverse probability weighting (IPW) maintains the sample size, but extreme values can lead to instability. While PSM and IPW have been historically used in conjunction with linear regression, machine learning methods –including deep learning with propensity dropout– have been proposed to account for nonlinear treatment assignments. In this work, we propose a novel deep learning approach –the Propensity Score Synthetic Augmentation Matching using Generative Adversarial Networks (PSSAM-GAN)– that aims at keeping the sample size, without IPW, by generating synthetic matches. PSSAM-GAN can be used in conjunction with any other prediction method to estimate treatment effects. Experiments performed on both semi-synthetic (perinatal interventions) and real-world observational data (antibiotic treatments, and job interventions) show that the PSSAM-GAN approach effectively creates balanced datasets, relaxing the weighting/dropout needs for downstream methods, and providing competitive performance in effects estimation as compared to simple GAN and in conjunction with other deep counterfactual learning architectures, e.g. TARNet. 2021-07-16 2021 /pmc/articles/PMC8357304/ /pubmed/34386786 http://dx.doi.org/10.1016/j.cmpbup.2021.100020 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle	Article Ghosh, Shantanu Boucher, Christina Bian, Jiang Prosperi, Mattia Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title	Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title_full	Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title_fullStr	Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title_full_unstemmed	Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title_short	Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)
title_sort	propensity score synthetic augmentation matching using generative adversarial networks (pssam-gan)
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357304/ https://www.ncbi.nlm.nih.gov/pubmed/34386786 http://dx.doi.org/10.1016/j.cmpbup.2021.100020
work_keys_str_mv	AT ghoshshantanu propensityscoresyntheticaugmentationmatchingusinggenerativeadversarialnetworkspssamgan AT boucherchristina propensityscoresyntheticaugmentationmatchingusinggenerativeadversarialnetworkspssamgan AT bianjiang propensityscoresyntheticaugmentationmatchingusinggenerativeadversarialnetworkspssamgan AT prosperimattia propensityscoresyntheticaugmentationmatchingusinggenerativeadversarialnetworkspssamgan

Propensity score synthetic augmentation matching using generative adversarial networks (PSSAM-GAN)

Ejemplares similares