Cargando…

Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers

Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of informatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Arya, Nikhilanand, Saha, Sriparna, Mathur, Archana, Saha, Snehanshu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008603/
https://www.ncbi.nlm.nih.gov/pubmed/36906618
http://dx.doi.org/10.1038/s41598-023-30143-8
_version_ 1784905794108522496
author Arya, Nikhilanand
Saha, Sriparna
Mathur, Archana
Saha, Snehanshu
author_facet Arya, Nikhilanand
Saha, Sriparna
Mathur, Archana
Saha, Snehanshu
author_sort Arya, Nikhilanand
collection PubMed
description Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of information from several modalities to make the correct and feasible treatment plan for breast cancer patients and protect them from unnecessary therapies and their toxic side effects. The cancer patient’s related information can be collected using various modalities like clinical, copy number variation, DNA-methylation, microRNA sequencing, gene expression, and histopathological whole slide images. High dimensionality and heterogeneity in these modalities demand the development of some intelligent systems to understand related features to the prognosis and diagnosis of diseases and make correct predictions. In this work, we have studied some end-to-end systems having two main components : (a) dimensionality reduction techniques applied to original features from different modalities and (b) classification techniques applied to the fusion of reduced feature vectors from different modalities for automatic predictions of breast cancer patients into two categories: short-time and long-time survivors. Principal component analysis (PCA) and variational auto-encoders (VAEs) are used as the dimensionality reduction techniques, followed by support vector machines (SVM) or random forest as the machine learning classifiers. The study utilizes raw, PCA, and VAE extracted features of the TCGA-BRCA dataset from six different modalities as input to the machine learning classifiers. We conclude this study by suggesting that adding more modalities to the classifiers provides complementary information to the classifier and increases the stability and robustness of the classifiers. In this study, the multimodal classifiers have not been validated on primary data prospectively.
format Online
Article
Text
id pubmed-10008603
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100086032023-03-13 Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers Arya, Nikhilanand Saha, Sriparna Mathur, Archana Saha, Snehanshu Sci Rep Article Breast cancer is a deadly disease with a high mortality rate among PAN cancers. The advancements in biomedical information retrieval techniques have been beneficial in developing early prognosis and diagnosis systems for cancer patients. These systems provide the oncologist with plenty of information from several modalities to make the correct and feasible treatment plan for breast cancer patients and protect them from unnecessary therapies and their toxic side effects. The cancer patient’s related information can be collected using various modalities like clinical, copy number variation, DNA-methylation, microRNA sequencing, gene expression, and histopathological whole slide images. High dimensionality and heterogeneity in these modalities demand the development of some intelligent systems to understand related features to the prognosis and diagnosis of diseases and make correct predictions. In this work, we have studied some end-to-end systems having two main components : (a) dimensionality reduction techniques applied to original features from different modalities and (b) classification techniques applied to the fusion of reduced feature vectors from different modalities for automatic predictions of breast cancer patients into two categories: short-time and long-time survivors. Principal component analysis (PCA) and variational auto-encoders (VAEs) are used as the dimensionality reduction techniques, followed by support vector machines (SVM) or random forest as the machine learning classifiers. The study utilizes raw, PCA, and VAE extracted features of the TCGA-BRCA dataset from six different modalities as input to the machine learning classifiers. We conclude this study by suggesting that adding more modalities to the classifiers provides complementary information to the classifier and increases the stability and robustness of the classifiers. In this study, the multimodal classifiers have not been validated on primary data prospectively. Nature Publishing Group UK 2023-03-11 /pmc/articles/PMC10008603/ /pubmed/36906618 http://dx.doi.org/10.1038/s41598-023-30143-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Arya, Nikhilanand
Saha, Sriparna
Mathur, Archana
Saha, Snehanshu
Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title_full Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title_fullStr Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title_full_unstemmed Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title_short Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
title_sort improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008603/
https://www.ncbi.nlm.nih.gov/pubmed/36906618
http://dx.doi.org/10.1038/s41598-023-30143-8
work_keys_str_mv AT aryanikhilanand improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers
AT sahasriparna improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers
AT mathurarchana improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers
AT sahasnehanshu improvingtherobustnessandstabilityofamachinelearningmodelforbreastcancerprognosisthroughtheuseofmultimodalclassifiers