Cargando…

An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors

This study aimed to investigate the important predictors related to predicting positive mammographic findings based on questionnaire-based demographic and obstetric/gynecological parameters using the proposed integrated machine learning (ML) scheme. The scheme combines the benefits of two well-known...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Cheuk-Kay, Tang, Yun-Xuan, Liu, Tzu-Chi, Lu, Chi-Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368335/
https://www.ncbi.nlm.nih.gov/pubmed/35955112
http://dx.doi.org/10.3390/ijerph19159756
_version_ 1784766095376252928
author Sun, Cheuk-Kay
Tang, Yun-Xuan
Liu, Tzu-Chi
Lu, Chi-Jie
author_facet Sun, Cheuk-Kay
Tang, Yun-Xuan
Liu, Tzu-Chi
Lu, Chi-Jie
author_sort Sun, Cheuk-Kay
collection PubMed
description This study aimed to investigate the important predictors related to predicting positive mammographic findings based on questionnaire-based demographic and obstetric/gynecological parameters using the proposed integrated machine learning (ML) scheme. The scheme combines the benefits of two well-known ML algorithms, namely, least absolute shrinkage and selection operator (Lasso) logistic regression and extreme gradient boosting (XGB), to provide adequate prediction for mammographic anomalies in high-risk individuals and the identification of significant risk factors. We collected questionnaire data on 18 breast-cancer-related risk factors from women who participated in a national mammographic screening program between January 2017 and December 2020 at a single tertiary referral hospital to correlate with their mammographic findings. The acquired data were retrospectively analyzed using the proposed integrated ML scheme. Based on the data from 21,107 valid questionnaires, the results showed that the Lasso logistic regression models with variable combinations generated by XGB could provide more effective prediction results. The top five significant predictors for positive mammography results were younger age, breast self-examination, older age at first childbirth, nulliparity, and history of mammography within 2 years, suggesting a need for timely mammographic screening for women with these risk factors.
format Online
Article
Text
id pubmed-9368335
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93683352022-08-12 An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors Sun, Cheuk-Kay Tang, Yun-Xuan Liu, Tzu-Chi Lu, Chi-Jie Int J Environ Res Public Health Article This study aimed to investigate the important predictors related to predicting positive mammographic findings based on questionnaire-based demographic and obstetric/gynecological parameters using the proposed integrated machine learning (ML) scheme. The scheme combines the benefits of two well-known ML algorithms, namely, least absolute shrinkage and selection operator (Lasso) logistic regression and extreme gradient boosting (XGB), to provide adequate prediction for mammographic anomalies in high-risk individuals and the identification of significant risk factors. We collected questionnaire data on 18 breast-cancer-related risk factors from women who participated in a national mammographic screening program between January 2017 and December 2020 at a single tertiary referral hospital to correlate with their mammographic findings. The acquired data were retrospectively analyzed using the proposed integrated ML scheme. Based on the data from 21,107 valid questionnaires, the results showed that the Lasso logistic regression models with variable combinations generated by XGB could provide more effective prediction results. The top five significant predictors for positive mammography results were younger age, breast self-examination, older age at first childbirth, nulliparity, and history of mammography within 2 years, suggesting a need for timely mammographic screening for women with these risk factors. MDPI 2022-08-08 /pmc/articles/PMC9368335/ /pubmed/35955112 http://dx.doi.org/10.3390/ijerph19159756 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Cheuk-Kay
Tang, Yun-Xuan
Liu, Tzu-Chi
Lu, Chi-Jie
An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title_full An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title_fullStr An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title_full_unstemmed An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title_short An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors
title_sort integrated machine learning scheme for predicting mammographic anomalies in high-risk individuals using questionnaire-based predictors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368335/
https://www.ncbi.nlm.nih.gov/pubmed/35955112
http://dx.doi.org/10.3390/ijerph19159756
work_keys_str_mv AT suncheukkay anintegratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT tangyunxuan anintegratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT liutzuchi anintegratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT luchijie anintegratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT suncheukkay integratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT tangyunxuan integratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT liutzuchi integratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors
AT luchijie integratedmachinelearningschemeforpredictingmammographicanomaliesinhighriskindividualsusingquestionnairebasedpredictors