Cargando…

Predicting prenatal depression and assessing model bias using machine learning models

Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models b...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yongchao, Alvernaz, Suzanne, Kim, Sage J., Maki, Pauline, Dai, Yang, Bernabé, Beatriz Penñalver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371186/
https://www.ncbi.nlm.nih.gov/pubmed/37503225
http://dx.doi.org/10.1101/2023.07.17.23292587
_version_ 1785078101283176448
author Huang, Yongchao
Alvernaz, Suzanne
Kim, Sage J.
Maki, Pauline
Dai, Yang
Bernabé, Beatriz Penñalver
author_facet Huang, Yongchao
Alvernaz, Suzanne
Kim, Sage J.
Maki, Pauline
Dai, Yang
Bernabé, Beatriz Penñalver
author_sort Huang, Yongchao
collection PubMed
description Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models based on Electronic Medical Records (EMRs) have been effective in predicting postpartum depression in middle-class White women but have rarely included sufficient proportions of racial and ethnic minorities, which contributed to biases in ML models for minority women. Our goal is to determine whether ML models could serve to predict depression in early pregnancy in racial/ethnic minority women by leveraging EMR data. We extracted EMRs from a hospital in a large urban city that mostly served low-income Black and Hispanic women (N=5,875) in the U.S. Depressive symptom severity was assessed from a self-reported questionnaire, PHQ-9. We investigated multiple ML classifiers, used Shapley Additive Explanations (SHAP) for model interpretation, and determined model prediction bias with two metrics, Disparate Impact, and Equal Opportunity Difference. While ML model (Elastic Net) performance was low (ROCAUC=0.67), we identified well-known factors associated with PND, such as unplanned pregnancy and being single, as well as underexplored factors, such as self-report pain levels, lower levels of prenatal vitamin supplement intake, asthma, carrying a male fetus, and lower platelet levels blood. Our findings showed that despite being based on a sample mostly composed of 75% low-income minority women (54% Black and 27% Latina), the model performance was lower for these communities. In conclusion, ML models based on EMRs could moderately predict depression in early pregnancy, but their performance is biased against low-income minority women.
format Online
Article
Text
id pubmed-10371186
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103711862023-07-27 Predicting prenatal depression and assessing model bias using machine learning models Huang, Yongchao Alvernaz, Suzanne Kim, Sage J. Maki, Pauline Dai, Yang Bernabé, Beatriz Penñalver medRxiv Article Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models based on Electronic Medical Records (EMRs) have been effective in predicting postpartum depression in middle-class White women but have rarely included sufficient proportions of racial and ethnic minorities, which contributed to biases in ML models for minority women. Our goal is to determine whether ML models could serve to predict depression in early pregnancy in racial/ethnic minority women by leveraging EMR data. We extracted EMRs from a hospital in a large urban city that mostly served low-income Black and Hispanic women (N=5,875) in the U.S. Depressive symptom severity was assessed from a self-reported questionnaire, PHQ-9. We investigated multiple ML classifiers, used Shapley Additive Explanations (SHAP) for model interpretation, and determined model prediction bias with two metrics, Disparate Impact, and Equal Opportunity Difference. While ML model (Elastic Net) performance was low (ROCAUC=0.67), we identified well-known factors associated with PND, such as unplanned pregnancy and being single, as well as underexplored factors, such as self-report pain levels, lower levels of prenatal vitamin supplement intake, asthma, carrying a male fetus, and lower platelet levels blood. Our findings showed that despite being based on a sample mostly composed of 75% low-income minority women (54% Black and 27% Latina), the model performance was lower for these communities. In conclusion, ML models based on EMRs could moderately predict depression in early pregnancy, but their performance is biased against low-income minority women. Cold Spring Harbor Laboratory 2023-07-18 /pmc/articles/PMC10371186/ /pubmed/37503225 http://dx.doi.org/10.1101/2023.07.17.23292587 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Huang, Yongchao
Alvernaz, Suzanne
Kim, Sage J.
Maki, Pauline
Dai, Yang
Bernabé, Beatriz Penñalver
Predicting prenatal depression and assessing model bias using machine learning models
title Predicting prenatal depression and assessing model bias using machine learning models
title_full Predicting prenatal depression and assessing model bias using machine learning models
title_fullStr Predicting prenatal depression and assessing model bias using machine learning models
title_full_unstemmed Predicting prenatal depression and assessing model bias using machine learning models
title_short Predicting prenatal depression and assessing model bias using machine learning models
title_sort predicting prenatal depression and assessing model bias using machine learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371186/
https://www.ncbi.nlm.nih.gov/pubmed/37503225
http://dx.doi.org/10.1101/2023.07.17.23292587
work_keys_str_mv AT huangyongchao predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels
AT alvernazsuzanne predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels
AT kimsagej predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels
AT makipauline predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels
AT daiyang predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels
AT bernabebeatrizpennalver predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels