Cargando…

Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy

OBJECTIVE: Breast cancer patients who have a rapid diagnosis have been better prognosis than late diagnosis. The popular screening is mammogram or ultrasound. In recent years, researchers try to develop data driven models to predict early cancer staging from the first screening. However, data elemen...

Descripción completa

Detalles Bibliográficos
Autores principales: Yampaka, Tongjai, Noolek, Duangjai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: West Asia Organization for Cancer Prevention 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9080351/
https://www.ncbi.nlm.nih.gov/pubmed/34967591
http://dx.doi.org/10.31557/APJCP.2021.22.12.4069
_version_ 1784702765650411520
author Yampaka, Tongjai
Noolek, Duangjai
author_facet Yampaka, Tongjai
Noolek, Duangjai
author_sort Yampaka, Tongjai
collection PubMed
description OBJECTIVE: Breast cancer patients who have a rapid diagnosis have been better prognosis than late diagnosis. The popular screening is mammogram or ultrasound. In recent years, researchers try to develop data driven models to predict early cancer staging from the first screening. However, data elements are not complete such as lymph node status. Therefore, the Integrated dataset approach will be challenging. METHODS: Because the data elements are not collected from the same source, joining between mammography and biopsy data were performed using latent variables that determine by tumor severity. The datasets consist of 445 mammography reports and 183 pathological reports. The latent variables of the mammogram dataset were determined by the severity of mass, while latent variables of the pathological dataset were determined by TNM Staging. The latent variables were used to join between two datasets. Then, the prediction models were built using the machine learning technique. The modeling is divided into three steps; staging prediction, lymph node prediction, and prognosis. RESULTS: Integrated dataset from mammography and biopsy extend more factors and built the models to predict breast cancer staging in the mammography process. The staging prediction is 100% accuracy. The lymph node prediction are 72.47% accuracy, 73.94% specificity, and 72.5% sensitivity. An area under ROC curve is 0.74. The prognosis model prediction are 72.72% accuracy, 80% specificity, and 77% sensitivity. An area under ROC curve is 0.87. There are also built the rule for early staging, diagnosis, and prognosis. CONCLUSION: This study aims to build the models for early staging, diagnosis, and prognosis using the less aggressive method. The advantages are (1) predict staging from the first screening (2) estimate the lymph node metastases for planning to ALND or SLNB (3) evaluate overall survival time. These advantages help the physician planning the best treatment for cancer patients.
format Online
Article
Text
id pubmed-9080351
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher West Asia Organization for Cancer Prevention
record_format MEDLINE/PubMed
spelling pubmed-90803512022-07-06 Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy Yampaka, Tongjai Noolek, Duangjai Asian Pac J Cancer Prev Research Article OBJECTIVE: Breast cancer patients who have a rapid diagnosis have been better prognosis than late diagnosis. The popular screening is mammogram or ultrasound. In recent years, researchers try to develop data driven models to predict early cancer staging from the first screening. However, data elements are not complete such as lymph node status. Therefore, the Integrated dataset approach will be challenging. METHODS: Because the data elements are not collected from the same source, joining between mammography and biopsy data were performed using latent variables that determine by tumor severity. The datasets consist of 445 mammography reports and 183 pathological reports. The latent variables of the mammogram dataset were determined by the severity of mass, while latent variables of the pathological dataset were determined by TNM Staging. The latent variables were used to join between two datasets. Then, the prediction models were built using the machine learning technique. The modeling is divided into three steps; staging prediction, lymph node prediction, and prognosis. RESULTS: Integrated dataset from mammography and biopsy extend more factors and built the models to predict breast cancer staging in the mammography process. The staging prediction is 100% accuracy. The lymph node prediction are 72.47% accuracy, 73.94% specificity, and 72.5% sensitivity. An area under ROC curve is 0.74. The prognosis model prediction are 72.72% accuracy, 80% specificity, and 77% sensitivity. An area under ROC curve is 0.87. There are also built the rule for early staging, diagnosis, and prognosis. CONCLUSION: This study aims to build the models for early staging, diagnosis, and prognosis using the less aggressive method. The advantages are (1) predict staging from the first screening (2) estimate the lymph node metastases for planning to ALND or SLNB (3) evaluate overall survival time. These advantages help the physician planning the best treatment for cancer patients. West Asia Organization for Cancer Prevention 2021-12 /pmc/articles/PMC9080351/ /pubmed/34967591 http://dx.doi.org/10.31557/APJCP.2021.22.12.4069 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-Non Commercial 4.0 International License. https://creativecommons.org/licenses/by-nc/4.0/
spellingShingle Research Article
Yampaka, Tongjai
Noolek, Duangjai
Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title_full Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title_fullStr Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title_full_unstemmed Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title_short Data Driven for Early Breast Cancer Staging using Integrated Mammography and Biopsy
title_sort data driven for early breast cancer staging using integrated mammography and biopsy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9080351/
https://www.ncbi.nlm.nih.gov/pubmed/34967591
http://dx.doi.org/10.31557/APJCP.2021.22.12.4069
work_keys_str_mv AT yampakatongjai datadrivenforearlybreastcancerstagingusingintegratedmammographyandbiopsy
AT noolekduangjai datadrivenforearlybreastcancerstagingusingintegratedmammographyandbiopsy