Cargando…
Predicting prenatal depression and assessing model bias using machine learning models
Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models b...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371186/ https://www.ncbi.nlm.nih.gov/pubmed/37503225 http://dx.doi.org/10.1101/2023.07.17.23292587 |
_version_ | 1785078101283176448 |
---|---|
author | Huang, Yongchao Alvernaz, Suzanne Kim, Sage J. Maki, Pauline Dai, Yang Bernabé, Beatriz Penñalver |
author_facet | Huang, Yongchao Alvernaz, Suzanne Kim, Sage J. Maki, Pauline Dai, Yang Bernabé, Beatriz Penñalver |
author_sort | Huang, Yongchao |
collection | PubMed |
description | Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models based on Electronic Medical Records (EMRs) have been effective in predicting postpartum depression in middle-class White women but have rarely included sufficient proportions of racial and ethnic minorities, which contributed to biases in ML models for minority women. Our goal is to determine whether ML models could serve to predict depression in early pregnancy in racial/ethnic minority women by leveraging EMR data. We extracted EMRs from a hospital in a large urban city that mostly served low-income Black and Hispanic women (N=5,875) in the U.S. Depressive symptom severity was assessed from a self-reported questionnaire, PHQ-9. We investigated multiple ML classifiers, used Shapley Additive Explanations (SHAP) for model interpretation, and determined model prediction bias with two metrics, Disparate Impact, and Equal Opportunity Difference. While ML model (Elastic Net) performance was low (ROCAUC=0.67), we identified well-known factors associated with PND, such as unplanned pregnancy and being single, as well as underexplored factors, such as self-report pain levels, lower levels of prenatal vitamin supplement intake, asthma, carrying a male fetus, and lower platelet levels blood. Our findings showed that despite being based on a sample mostly composed of 75% low-income minority women (54% Black and 27% Latina), the model performance was lower for these communities. In conclusion, ML models based on EMRs could moderately predict depression in early pregnancy, but their performance is biased against low-income minority women. |
format | Online Article Text |
id | pubmed-10371186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-103711862023-07-27 Predicting prenatal depression and assessing model bias using machine learning models Huang, Yongchao Alvernaz, Suzanne Kim, Sage J. Maki, Pauline Dai, Yang Bernabé, Beatriz Penñalver medRxiv Article Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10–20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models based on Electronic Medical Records (EMRs) have been effective in predicting postpartum depression in middle-class White women but have rarely included sufficient proportions of racial and ethnic minorities, which contributed to biases in ML models for minority women. Our goal is to determine whether ML models could serve to predict depression in early pregnancy in racial/ethnic minority women by leveraging EMR data. We extracted EMRs from a hospital in a large urban city that mostly served low-income Black and Hispanic women (N=5,875) in the U.S. Depressive symptom severity was assessed from a self-reported questionnaire, PHQ-9. We investigated multiple ML classifiers, used Shapley Additive Explanations (SHAP) for model interpretation, and determined model prediction bias with two metrics, Disparate Impact, and Equal Opportunity Difference. While ML model (Elastic Net) performance was low (ROCAUC=0.67), we identified well-known factors associated with PND, such as unplanned pregnancy and being single, as well as underexplored factors, such as self-report pain levels, lower levels of prenatal vitamin supplement intake, asthma, carrying a male fetus, and lower platelet levels blood. Our findings showed that despite being based on a sample mostly composed of 75% low-income minority women (54% Black and 27% Latina), the model performance was lower for these communities. In conclusion, ML models based on EMRs could moderately predict depression in early pregnancy, but their performance is biased against low-income minority women. Cold Spring Harbor Laboratory 2023-07-18 /pmc/articles/PMC10371186/ /pubmed/37503225 http://dx.doi.org/10.1101/2023.07.17.23292587 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Huang, Yongchao Alvernaz, Suzanne Kim, Sage J. Maki, Pauline Dai, Yang Bernabé, Beatriz Penñalver Predicting prenatal depression and assessing model bias using machine learning models |
title | Predicting prenatal depression and assessing model bias using machine learning models |
title_full | Predicting prenatal depression and assessing model bias using machine learning models |
title_fullStr | Predicting prenatal depression and assessing model bias using machine learning models |
title_full_unstemmed | Predicting prenatal depression and assessing model bias using machine learning models |
title_short | Predicting prenatal depression and assessing model bias using machine learning models |
title_sort | predicting prenatal depression and assessing model bias using machine learning models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371186/ https://www.ncbi.nlm.nih.gov/pubmed/37503225 http://dx.doi.org/10.1101/2023.07.17.23292587 |
work_keys_str_mv | AT huangyongchao predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels AT alvernazsuzanne predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels AT kimsagej predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels AT makipauline predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels AT daiyang predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels AT bernabebeatrizpennalver predictingprenataldepressionandassessingmodelbiasusingmachinelearningmodels |