Cargando…

Predicting congenital syphilis cases: A performance evaluation of different machine learning models

BACKGROUND: Communicable diseases represent a huge economic burden for healthcare systems and for society. Sexually transmitted infections (STIs) are a concerning issue, especially in developing and underdeveloped countries, in which environmental factors and other determinants of health play a role...

Descripción completa

Detalles Bibliográficos
Autores principales: Teixeira, Igor Vitor, da Silva Leite, Morgana Thalita, de Morais Melo, Flávio Leandro, da Silva Rocha, Élisson, Sadok, Sara, Pessoa da Costa Carrarine, Ana Sofia, Santana, Marília, Pinheiro Rodrigues, Cristina, de Lima Oliveira, Ana Maria, Vieira Gadelha, Keduly, de Morais, Cleber Matos, Kelner, Judith, Endo, Patricia Takako
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237406/
https://www.ncbi.nlm.nih.gov/pubmed/37267293
http://dx.doi.org/10.1371/journal.pone.0276150
_version_ 1785053150568251392
author Teixeira, Igor Vitor
da Silva Leite, Morgana Thalita
de Morais Melo, Flávio Leandro
da Silva Rocha, Élisson
Sadok, Sara
Pessoa da Costa Carrarine, Ana Sofia
Santana, Marília
Pinheiro Rodrigues, Cristina
de Lima Oliveira, Ana Maria
Vieira Gadelha, Keduly
de Morais, Cleber Matos
Kelner, Judith
Endo, Patricia Takako
author_facet Teixeira, Igor Vitor
da Silva Leite, Morgana Thalita
de Morais Melo, Flávio Leandro
da Silva Rocha, Élisson
Sadok, Sara
Pessoa da Costa Carrarine, Ana Sofia
Santana, Marília
Pinheiro Rodrigues, Cristina
de Lima Oliveira, Ana Maria
Vieira Gadelha, Keduly
de Morais, Cleber Matos
Kelner, Judith
Endo, Patricia Takako
author_sort Teixeira, Igor Vitor
collection PubMed
description BACKGROUND: Communicable diseases represent a huge economic burden for healthcare systems and for society. Sexually transmitted infections (STIs) are a concerning issue, especially in developing and underdeveloped countries, in which environmental factors and other determinants of health play a role in contributing to its fast spread. In light of this situation, machine learning techniques have been explored to assess the incidence of syphilis and contribute to the epidemiological surveillance in this scenario. OBJECTIVE: The main goal of this work is to evaluate the performance of different machine learning models on predicting undesirable outcomes of congenital syphilis in order to assist resources allocation and optimize the healthcare actions, especially in a constrained health environment. METHOD: We use clinical and sociodemographic data from pregnant women that were assisted by a social program in Pernambuco, Brazil, named Mãe Coruja Pernambucana Program (PMCP). Based on a rigorous methodology, we propose six experiments using three feature selection techniques to select the most relevant attributes, pre-process and clean the data, apply hyperparameter optimization to tune the machine learning models, and train and test models to have a fair evaluation and discussion. RESULTS: The AdaBoost-BODS-Expert model, an Adaptive Boosting (AdaBoost) model that used attributes selected by health experts, presented the best results in terms of evaluation metrics and acceptance by health experts from PMCP. By using this model, the results are more reliable and allows adoption on a daily usage to classify possible outcomes of congenital syphilis using clinical and sociodemographic data.
format Online
Article
Text
id pubmed-10237406
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-102374062023-06-03 Predicting congenital syphilis cases: A performance evaluation of different machine learning models Teixeira, Igor Vitor da Silva Leite, Morgana Thalita de Morais Melo, Flávio Leandro da Silva Rocha, Élisson Sadok, Sara Pessoa da Costa Carrarine, Ana Sofia Santana, Marília Pinheiro Rodrigues, Cristina de Lima Oliveira, Ana Maria Vieira Gadelha, Keduly de Morais, Cleber Matos Kelner, Judith Endo, Patricia Takako PLoS One Research Article BACKGROUND: Communicable diseases represent a huge economic burden for healthcare systems and for society. Sexually transmitted infections (STIs) are a concerning issue, especially in developing and underdeveloped countries, in which environmental factors and other determinants of health play a role in contributing to its fast spread. In light of this situation, machine learning techniques have been explored to assess the incidence of syphilis and contribute to the epidemiological surveillance in this scenario. OBJECTIVE: The main goal of this work is to evaluate the performance of different machine learning models on predicting undesirable outcomes of congenital syphilis in order to assist resources allocation and optimize the healthcare actions, especially in a constrained health environment. METHOD: We use clinical and sociodemographic data from pregnant women that were assisted by a social program in Pernambuco, Brazil, named Mãe Coruja Pernambucana Program (PMCP). Based on a rigorous methodology, we propose six experiments using three feature selection techniques to select the most relevant attributes, pre-process and clean the data, apply hyperparameter optimization to tune the machine learning models, and train and test models to have a fair evaluation and discussion. RESULTS: The AdaBoost-BODS-Expert model, an Adaptive Boosting (AdaBoost) model that used attributes selected by health experts, presented the best results in terms of evaluation metrics and acceptance by health experts from PMCP. By using this model, the results are more reliable and allows adoption on a daily usage to classify possible outcomes of congenital syphilis using clinical and sociodemographic data. Public Library of Science 2023-06-02 /pmc/articles/PMC10237406/ /pubmed/37267293 http://dx.doi.org/10.1371/journal.pone.0276150 Text en © 2023 Teixeira et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Teixeira, Igor Vitor
da Silva Leite, Morgana Thalita
de Morais Melo, Flávio Leandro
da Silva Rocha, Élisson
Sadok, Sara
Pessoa da Costa Carrarine, Ana Sofia
Santana, Marília
Pinheiro Rodrigues, Cristina
de Lima Oliveira, Ana Maria
Vieira Gadelha, Keduly
de Morais, Cleber Matos
Kelner, Judith
Endo, Patricia Takako
Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title_full Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title_fullStr Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title_full_unstemmed Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title_short Predicting congenital syphilis cases: A performance evaluation of different machine learning models
title_sort predicting congenital syphilis cases: a performance evaluation of different machine learning models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237406/
https://www.ncbi.nlm.nih.gov/pubmed/37267293
http://dx.doi.org/10.1371/journal.pone.0276150
work_keys_str_mv AT teixeiraigorvitor predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT dasilvaleitemorganathalita predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT demoraismeloflavioleandro predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT dasilvarochaelisson predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT sadoksara predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT pessoadacostacarrarineanasofia predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT santanamarilia predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT pinheirorodriguescristina predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT delimaoliveiraanamaria predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT vieiragadelhakeduly predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT demoraisclebermatos predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT kelnerjudith predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels
AT endopatriciatakako predictingcongenitalsyphiliscasesaperformanceevaluationofdifferentmachinelearningmodels