Cargando…
Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of informatio...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008603/ https://www.ncbi.nlm.nih.gov/pubmed/36906618 http://dx.doi.org/10.1038/s41598-023-30143-8 |
_version_ | 1784905794108522496 |
---|---|
author | Arya, Nikhilanand Saha, Sriparna Mathur, Archana Saha, Snehanshu |
author_facet | Arya, Nikhilanand Saha, Sriparna Mathur, Archana Saha, Snehanshu |
author_sort | Arya, Nikhilanand |
collection | PubMed |
description | Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of information from several modalities to make the correct and feasible treatment plan for breast cancer patients and protect them from unnecessary therapies and their toxic side effects. The cancer patient’s related information can be collected using various modalities like clinical, copy number variation, DNA-methylation, microRNA sequencing, gene expression, and histopathological whole slide images. High dimensionality and heterogeneity in these modalities demand the development of some intelligent systems to understand related features to the prognosis and diagnosis of diseases and make correct predictions. In this work, we have studied some end-to-end systems having two main components : (a) dimensionality reduction techniques applied to original features from different modalities and (b) classification techniques applied to the fusion of reduced feature vectors from different modalities for automatic predictions of breast cancer patients into two categories: short-time and long-time survivors. Principal component analysis (PCA) and variational auto-encoders (VAEs) are used as the dimensionality reduction techniques, followed by support vector machines (SVM) or random forest as the machine learning classifiers. The study utilizes raw, PCA, and VAE extracted features of the TCGA-BRCA dataset from six different modalities as input to the machine learning classifiers. We conclude this study by suggesting that adding more modalities to the classifiers provides complementary information to the classifier and increases the stability and robustness of the classifiers. In this study, the multimodal classifiers have not been validated on primary data prospectively. |
format | Online Article Text |
id | pubmed-10008603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100086032023-03-13 Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers Arya, Nikhilanand Saha, Sriparna Mathur, Archana Saha, Snehanshu Sci Rep Article Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of information from several modalities to make the correct and feasible treatment plan for breast cancer patients and protect them from unnecessary therapies and their toxic side effects. The cancer patient’s related information can be collected using various modalities like clinical, copy number variation, DNA-methylation, microRNA sequencing, gene expression, and histopathological whole slide images. High dimensionality and heterogeneity in these modalities demand the development of some intelligent systems to understand related features to the prognosis and diagnosis of diseases and make correct predictions. In this work, we have studied some end-to-end systems having two main components : (a) dimensionality reduction techniques applied to original features from different modalities and (b) classification techniques applied to the fusion of reduced feature vectors from different modalities for automatic predictions of breast cancer patients into two categories: short-time and long-time survivors. Principal component analysis (PCA) and variational auto-encoders (VAEs) are used as the dimensionality reduction techniques, followed by support vector machines (SVM) or random forest as the machine learning classifiers. The study utilizes raw, PCA, and VAE extracted features of the TCGA-BRCA dataset from six different modalities as input to the machine learning classifiers. We conclude this study by suggesting that adding more modalities to the classifiers provides complementary information to the classifier and increases the stability and robustness of the classifiers. In this study, the multimodal classifiers have not been validated on primary data prospectively. Nature Publishing Group UK 2023-03-11 /pmc/articles/PMC10008603/ /pubmed/36906618 http://dx.doi.org/10.1038/s41598-023-30143-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Arya, Nikhilanand Saha, Sriparna Mathur, Archana Saha, Snehanshu Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title | Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title_full | Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title_fullStr | Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title_full_unstemmed | Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title_short | Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
title_sort | improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008603/ https://www.ncbi.nlm.nih.gov/pubmed/36906618 http://dx.doi.org/10.1038/s41598-023-30143-8 |
work_keys_str_mv | AT aryanikhilanand improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers AT sahasriparna improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers AT mathurarchana improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers AT sahasnehanshu improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers |