Cargando…

Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression

OBJECTIVE: Evidence suggests that high-quality health education and effective communication within the framework of social support hold significant potential in preventing postpartum depression. Yet, developing trustworthy and engaging health education and communication materials requires extensive...

Descripción completa

Detalles Bibliográficos
Autores principales: Patra, Braja Gopal, Sun, Zhaoyi, Cheng, Zilin, Kumar, Praneet Kasi Reddy Jagadeesh, Altammami, Abdullah, Liu, Yiyang, Joly, Rochelle, Jedlicka, Caroline, Delgado, Diana, Pathak, Jyotishman, Peng, Yifan, Zhang, Yiye
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694448/
http://dx.doi.org/10.3389/fpsyt.2023.1258887
_version_ 1785153379679338496
author Patra, Braja Gopal
Sun, Zhaoyi
Cheng, Zilin
Kumar, Praneet Kasi Reddy Jagadeesh
Altammami, Abdullah
Liu, Yiyang
Joly, Rochelle
Jedlicka, Caroline
Delgado, Diana
Pathak, Jyotishman
Peng, Yifan
Zhang, Yiye
author_facet Patra, Braja Gopal
Sun, Zhaoyi
Cheng, Zilin
Kumar, Praneet Kasi Reddy Jagadeesh
Altammami, Abdullah
Liu, Yiyang
Joly, Rochelle
Jedlicka, Caroline
Delgado, Diana
Pathak, Jyotishman
Peng, Yifan
Zhang, Yiye
author_sort Patra, Braja Gopal
collection PubMed
description OBJECTIVE: Evidence suggests that high-quality health education and effective communication within the framework of social support hold significant potential in preventing postpartum depression. Yet, developing trustworthy and engaging health education and communication materials requires extensive expertise and substantial resources. In light of this, we propose an innovative approach that involves leveraging natural language processing (NLP) to classify publicly accessible lay articles based on their relevance and subject matter to pregnancy and mental health. MATERIALS AND METHODS: We manually reviewed online lay articles from credible and medically validated sources to create a gold standard corpus. This manual review process categorized the articles based on their pertinence to pregnancy and related subtopics. To streamline and expand the classification procedure for relevance and topics, we employed advanced NLP models such as Random Forest, Bidirectional Encoder Representations from Transformers (BERT), and Generative Pre-trained Transformer model (gpt-3.5-turbo). RESULTS: The gold standard corpus included 392 pregnancy-related articles. Our manual review process categorized the reading materials according to lifestyle factors associated with postpartum depression: diet, exercise, mental health, and health literacy. A BERT-based model performed best (F1 = 0.974) in an end-to-end classification of relevance and topics. In a two-step approach, given articles already classified as pregnancy-related, gpt-3.5-turbo performed best (F1 = 0.972) in classifying the above topics. DISCUSSION: Utilizing NLP, we can guide patients to high-quality lay reading materials as cost-effective, readily available health education and communication sources. This approach allows us to scale the information delivery specifically to individuals, enhancing the relevance and impact of the materials provided.
format Online
Article
Text
id pubmed-10694448
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106944482023-12-05 Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression Patra, Braja Gopal Sun, Zhaoyi Cheng, Zilin Kumar, Praneet Kasi Reddy Jagadeesh Altammami, Abdullah Liu, Yiyang Joly, Rochelle Jedlicka, Caroline Delgado, Diana Pathak, Jyotishman Peng, Yifan Zhang, Yiye Front Psychiatry Psychiatry OBJECTIVE: Evidence suggests that high-quality health education and effective communication within the framework of social support hold significant potential in preventing postpartum depression. Yet, developing trustworthy and engaging health education and communication materials requires extensive expertise and substantial resources. In light of this, we propose an innovative approach that involves leveraging natural language processing (NLP) to classify publicly accessible lay articles based on their relevance and subject matter to pregnancy and mental health. MATERIALS AND METHODS: We manually reviewed online lay articles from credible and medically validated sources to create a gold standard corpus. This manual review process categorized the articles based on their pertinence to pregnancy and related subtopics. To streamline and expand the classification procedure for relevance and topics, we employed advanced NLP models such as Random Forest, Bidirectional Encoder Representations from Transformers (BERT), and Generative Pre-trained Transformer model (gpt-3.5-turbo). RESULTS: The gold standard corpus included 392 pregnancy-related articles. Our manual review process categorized the reading materials according to lifestyle factors associated with postpartum depression: diet, exercise, mental health, and health literacy. A BERT-based model performed best (F1 = 0.974) in an end-to-end classification of relevance and topics. In a two-step approach, given articles already classified as pregnancy-related, gpt-3.5-turbo performed best (F1 = 0.972) in classifying the above topics. DISCUSSION: Utilizing NLP, we can guide patients to high-quality lay reading materials as cost-effective, readily available health education and communication sources. This approach allows us to scale the information delivery specifically to individuals, enhancing the relevance and impact of the materials provided. Frontiers Media S.A. 2023-11-20 /pmc/articles/PMC10694448/ http://dx.doi.org/10.3389/fpsyt.2023.1258887 Text en Copyright © 2023 Patra, Sun, Cheng, Kumar, Altammami, Liu, Joly, Jedlicka, Delgado, Pathak, Peng and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Patra, Braja Gopal
Sun, Zhaoyi
Cheng, Zilin
Kumar, Praneet Kasi Reddy Jagadeesh
Altammami, Abdullah
Liu, Yiyang
Joly, Rochelle
Jedlicka, Caroline
Delgado, Diana
Pathak, Jyotishman
Peng, Yifan
Zhang, Yiye
Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title_full Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title_fullStr Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title_full_unstemmed Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title_short Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
title_sort automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694448/
http://dx.doi.org/10.3389/fpsyt.2023.1258887
work_keys_str_mv AT patrabrajagopal automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT sunzhaoyi automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT chengzilin automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT kumarpraneetkasireddyjagadeesh automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT altammamiabdullah automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT liuyiyang automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT jolyrochelle automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT jedlickacaroline automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT delgadodiana automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT pathakjyotishman automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT pengyifan automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression
AT zhangyiye automatedclassificationoflayhealtharticlesusingnaturallanguageprocessingacasestudyonpregnancyhealthandpostpartumdepression