Cargando…

Estimation of postpartum depression risk from electronic health records using machine learning

BACKGROUND: Postpartum depression is a widespread disorder, adversely affecting the well-being of mothers and their newborns. We aim to utilize machine learning for predicting risk of postpartum depression (PPD) using primary care electronic health records (EHR) data, and to evaluate the potential v...

Descripción completa

Detalles Bibliográficos
Autores principales: Amit, Guy, Girshovitz, Irena, Marcus, Karni, Zhang, Yiye, Pathak, Jyotishman, Bar, Vered, Akiva, Pinchas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8447665/
https://www.ncbi.nlm.nih.gov/pubmed/34535116
http://dx.doi.org/10.1186/s12884-021-04087-8
_version_ 1784569066665541632
author Amit, Guy
Girshovitz, Irena
Marcus, Karni
Zhang, Yiye
Pathak, Jyotishman
Bar, Vered
Akiva, Pinchas
author_facet Amit, Guy
Girshovitz, Irena
Marcus, Karni
Zhang, Yiye
Pathak, Jyotishman
Bar, Vered
Akiva, Pinchas
author_sort Amit, Guy
collection PubMed
description BACKGROUND: Postpartum depression is a widespread disorder, adversely affecting the well-being of mothers and their newborns. We aim to utilize machine learning for predicting risk of postpartum depression (PPD) using primary care electronic health records (EHR) data, and to evaluate the potential value of EHR-based prediction in improving the accuracy of PPD screening and in early identification of women at risk. METHODS: We analyzed EHR data of 266,544 women from the UK who gave first live birth between 2000 and 2017. We extracted a multitude of socio-demographic and medical variables and constructed a machine learning model that predicts the risk of PPD during the year following childbirth. We evaluated the model’s performance using multiple validation methodologies and measured its accuracy as a stand-alone tool and as an adjunct to the standard questionnaire-based screening by Edinburgh postnatal depression scale (EPDS). RESULTS: The prevalence of PPD in the analyzed cohort was 13.4%. Combing EHR-based prediction with EPDS score increased the area under the receiver operator characteristics curve (AUC) from 0.805 to 0.844 and the sensitivity from 0.72 to 0.76, at specificity of 0.80. The AUC of the EHR-based prediction model alone varied from 0.72 to 0.74 and decreased by only 0.01–0.02 when applied as early as before the beginning of pregnancy. CONCLUSIONS: PPD risk prediction using EHR data may provide a complementary quantitative and objective tool for PPD screening, allowing earlier (pre-pregnancy) and more accurate identification of women at risk, timely interventions and potentially improved outcomes for the mother and child. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04087-8.
format Online
Article
Text
id pubmed-8447665
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84476652021-09-17 Estimation of postpartum depression risk from electronic health records using machine learning Amit, Guy Girshovitz, Irena Marcus, Karni Zhang, Yiye Pathak, Jyotishman Bar, Vered Akiva, Pinchas BMC Pregnancy Childbirth Research BACKGROUND: Postpartum depression is a widespread disorder, adversely affecting the well-being of mothers and their newborns. We aim to utilize machine learning for predicting risk of postpartum depression (PPD) using primary care electronic health records (EHR) data, and to evaluate the potential value of EHR-based prediction in improving the accuracy of PPD screening and in early identification of women at risk. METHODS: We analyzed EHR data of 266,544 women from the UK who gave first live birth between 2000 and 2017. We extracted a multitude of socio-demographic and medical variables and constructed a machine learning model that predicts the risk of PPD during the year following childbirth. We evaluated the model’s performance using multiple validation methodologies and measured its accuracy as a stand-alone tool and as an adjunct to the standard questionnaire-based screening by Edinburgh postnatal depression scale (EPDS). RESULTS: The prevalence of PPD in the analyzed cohort was 13.4%. Combing EHR-based prediction with EPDS score increased the area under the receiver operator characteristics curve (AUC) from 0.805 to 0.844 and the sensitivity from 0.72 to 0.76, at specificity of 0.80. The AUC of the EHR-based prediction model alone varied from 0.72 to 0.74 and decreased by only 0.01–0.02 when applied as early as before the beginning of pregnancy. CONCLUSIONS: PPD risk prediction using EHR data may provide a complementary quantitative and objective tool for PPD screening, allowing earlier (pre-pregnancy) and more accurate identification of women at risk, timely interventions and potentially improved outcomes for the mother and child. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12884-021-04087-8. BioMed Central 2021-09-17 /pmc/articles/PMC8447665/ /pubmed/34535116 http://dx.doi.org/10.1186/s12884-021-04087-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Amit, Guy
Girshovitz, Irena
Marcus, Karni
Zhang, Yiye
Pathak, Jyotishman
Bar, Vered
Akiva, Pinchas
Estimation of postpartum depression risk from electronic health records using machine learning
title Estimation of postpartum depression risk from electronic health records using machine learning
title_full Estimation of postpartum depression risk from electronic health records using machine learning
title_fullStr Estimation of postpartum depression risk from electronic health records using machine learning
title_full_unstemmed Estimation of postpartum depression risk from electronic health records using machine learning
title_short Estimation of postpartum depression risk from electronic health records using machine learning
title_sort estimation of postpartum depression risk from electronic health records using machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8447665/
https://www.ncbi.nlm.nih.gov/pubmed/34535116
http://dx.doi.org/10.1186/s12884-021-04087-8
work_keys_str_mv AT amitguy estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT girshovitzirena estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT marcuskarni estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT zhangyiye estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT pathakjyotishman estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT barvered estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning
AT akivapinchas estimationofpostpartumdepressionriskfromelectronichealthrecordsusingmachinelearning