Cargando…

Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer

SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for...

Descripción completa

Detalles Bibliográficos
Autores principales: Vy, Vu Pham Thao, Yao, Melissa Min-Szu, Khanh Le, Nguyen Quoc, Chan, Wing P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139618/
https://www.ncbi.nlm.nih.gov/pubmed/35626043
http://dx.doi.org/10.3390/cancers14102437
_version_ 1784714901488402432
author Vy, Vu Pham Thao
Yao, Melissa Min-Szu
Khanh Le, Nguyen Quoc
Chan, Wing P.
author_facet Vy, Vu Pham Thao
Yao, Melissa Min-Szu
Khanh Le, Nguyen Quoc
Chan, Wing P.
author_sort Vy, Vu Pham Thao
collection PubMed
description SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for the majority of the remainder. Early identification of types of breast cancers provides breast cancer patients with more options for less invasive therapy. Our study aimed to develop a machine-learning classification model to differentiate ductal carcinoma in situ and minimally invasive breast cancer using clinical characteristics, mammography findings, ultrasound findings, and histopathology features. Our model showed that the five most important features were calcifications on mammograms, lymph node presence, microcalcifications on histopathology, the shape of the mass on ultrasound, and the orientation of the mass on ultrasound. ABSTRACT: Purpose: Given that early identification of breast cancer type allows for less-invasive therapies, we aimed to develop a machine learning model to discriminate between ductal carcinoma in situ (DCIS) and minimally invasive breast cancer (MIBC). Methods: In this retrospective study, the health records of 420 women who underwent biopsies between 2010 and 2020 to confirm breast cancer were collected. A trained XGBoost algorithm was used to classify cancers as either DCIS or MIBC using clinical characteristics, mammographic findings, ultrasonographic findings, and histopathological features. Its performance was measured against other methods using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, precision, and F1 score. Results: The model was trained using 357 women and tested using 63 women with an overall 420 patients (mean [standard deviation] age, 57.1 [12.0] years). The model performed well when feature importance was determined, reaching an accuracy of 0.84 (95% confidence interval [CI], 0.76–0.91), an AUC of 0.93 (95% CI, 0.87–0.95), a specificity of 0.75 (95% CI, 0.67–0.83), and a sensitivity of 0.91 (95% CI, 0.76–0.94). Conclusion: The XGBoost model, combining clinical, mammographic, ultrasonographic, and histopathologic findings, can be used to discriminate DCIS from MIBC with an accuracy equivalent to that of experienced radiologists, thereby giving patients the widest range of therapeutic options.
format Online
Article
Text
id pubmed-9139618
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91396182022-05-28 Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer Vy, Vu Pham Thao Yao, Melissa Min-Szu Khanh Le, Nguyen Quoc Chan, Wing P. Cancers (Basel) Article SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for the majority of the remainder. Early identification of types of breast cancers provides breast cancer patients with more options for less invasive therapy. Our study aimed to develop a machine-learning classification model to differentiate ductal carcinoma in situ and minimally invasive breast cancer using clinical characteristics, mammography findings, ultrasound findings, and histopathology features. Our model showed that the five most important features were calcifications on mammograms, lymph node presence, microcalcifications on histopathology, the shape of the mass on ultrasound, and the orientation of the mass on ultrasound. ABSTRACT: Purpose: Given that early identification of breast cancer type allows for less-invasive therapies, we aimed to develop a machine learning model to discriminate between ductal carcinoma in situ (DCIS) and minimally invasive breast cancer (MIBC). Methods: In this retrospective study, the health records of 420 women who underwent biopsies between 2010 and 2020 to confirm breast cancer were collected. A trained XGBoost algorithm was used to classify cancers as either DCIS or MIBC using clinical characteristics, mammographic findings, ultrasonographic findings, and histopathological features. Its performance was measured against other methods using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, precision, and F1 score. Results: The model was trained using 357 women and tested using 63 women with an overall 420 patients (mean [standard deviation] age, 57.1 [12.0] years). The model performed well when feature importance was determined, reaching an accuracy of 0.84 (95% confidence interval [CI], 0.76–0.91), an AUC of 0.93 (95% CI, 0.87–0.95), a specificity of 0.75 (95% CI, 0.67–0.83), and a sensitivity of 0.91 (95% CI, 0.76–0.94). Conclusion: The XGBoost model, combining clinical, mammographic, ultrasonographic, and histopathologic findings, can be used to discriminate DCIS from MIBC with an accuracy equivalent to that of experienced radiologists, thereby giving patients the widest range of therapeutic options. MDPI 2022-05-15 /pmc/articles/PMC9139618/ /pubmed/35626043 http://dx.doi.org/10.3390/cancers14102437 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vy, Vu Pham Thao
Yao, Melissa Min-Szu
Khanh Le, Nguyen Quoc
Chan, Wing P.
Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title_full Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title_fullStr Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title_full_unstemmed Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title_short Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
title_sort machine learning algorithm for distinguishing ductal carcinoma in situ from invasive breast cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139618/
https://www.ncbi.nlm.nih.gov/pubmed/35626043
http://dx.doi.org/10.3390/cancers14102437
work_keys_str_mv AT vyvuphamthao machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer
AT yaomelissaminszu machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer
AT khanhlenguyenquoc machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer
AT chanwingp machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer