Cargando…
A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction
SIMPLE SUMMARY: Accurate prediction of breast cancer metastasis risks using gene expression data and machine learning can help improve cancer treatment and overall survival. However, breast cancer can be categorized into multiple subtypes, and a single predictive model may not work well for all pati...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658757/ https://www.ncbi.nlm.nih.gov/pubmed/36358745 http://dx.doi.org/10.3390/cancers14215327 |
_version_ | 1784830031737913344 |
---|---|
author | Adnan, Nahim Najnin, Tanzira Ruan, Jianhua |
author_facet | Adnan, Nahim Najnin, Tanzira Ruan, Jianhua |
author_sort | Adnan, Nahim |
collection | PubMed |
description | SIMPLE SUMMARY: Accurate prediction of breast cancer metastasis risks using gene expression data and machine learning can help improve cancer treatment and overall survival. However, breast cancer can be categorized into multiple subtypes, and a single predictive model may not work well for all patients. In this work, we propose a computational method to construct personalized models, where the key is to select a group of patients to train a different model for each testing patient. Experimental results on multiple datasets showed that the proposed method, termed Personalized Classifier with Multiple Thresholds (PCMT), achieved significantly better prediction accuracy than existing algorithms that train classifiers using all available patients or using patients belonging to a predefined subtype. In addition, the top features identified by PCMT are robust across different datasets, and include genes that are well known to be associated with subtype-specific metastasis. ABSTRACT: Accurate prediction of breast cancer metastasis in the early stages of cancer diagnosis is crucial to reduce cancer-related deaths. With the availability of gene expression datasets, many machine-learning models have been proposed to predict breast cancer metastasis using thousands of genes simultaneously. However, the prediction accuracy of the models using gene expression often suffers from the diverse molecular characteristics across different datasets. Additionally, breast cancer is known to have many subtypes, which hinders the performance of the models aimed at all subtypes. To overcome the heterogeneous nature of breast cancer, we propose a method to obtain personalized classifiers that are trained on subsets of patients selected using the similarities between training and testing patients. Results on multiple independent datasets showed that our proposed approach significantly improved prediction accuracy compared to the models trained on the complete training dataset and models trained on specific cancer subtypes. Our results also showed that personalized classifiers trained on positively and negatively correlated patients outperformed classifiers trained only on positively correlated patients, highlighting the importance of selecting proper patient subsets for constructing personalized classifiers. Additionally, our proposed approach obtained more robust features than the other models and identified different features for different patients, making it a promising tool for designing personalized medicine for cancer patients. |
format | Online Article Text |
id | pubmed-9658757 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96587572022-11-15 A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction Adnan, Nahim Najnin, Tanzira Ruan, Jianhua Cancers (Basel) Article SIMPLE SUMMARY: Accurate prediction of breast cancer metastasis risks using gene expression data and machine learning can help improve cancer treatment and overall survival. However, breast cancer can be categorized into multiple subtypes, and a single predictive model may not work well for all patients. In this work, we propose a computational method to construct personalized models, where the key is to select a group of patients to train a different model for each testing patient. Experimental results on multiple datasets showed that the proposed method, termed Personalized Classifier with Multiple Thresholds (PCMT), achieved significantly better prediction accuracy than existing algorithms that train classifiers using all available patients or using patients belonging to a predefined subtype. In addition, the top features identified by PCMT are robust across different datasets, and include genes that are well known to be associated with subtype-specific metastasis. ABSTRACT: Accurate prediction of breast cancer metastasis in the early stages of cancer diagnosis is crucial to reduce cancer-related deaths. With the availability of gene expression datasets, many machine-learning models have been proposed to predict breast cancer metastasis using thousands of genes simultaneously. However, the prediction accuracy of the models using gene expression often suffers from the diverse molecular characteristics across different datasets. Additionally, breast cancer is known to have many subtypes, which hinders the performance of the models aimed at all subtypes. To overcome the heterogeneous nature of breast cancer, we propose a method to obtain personalized classifiers that are trained on subsets of patients selected using the similarities between training and testing patients. Results on multiple independent datasets showed that our proposed approach significantly improved prediction accuracy compared to the models trained on the complete training dataset and models trained on specific cancer subtypes. Our results also showed that personalized classifiers trained on positively and negatively correlated patients outperformed classifiers trained only on positively correlated patients, highlighting the importance of selecting proper patient subsets for constructing personalized classifiers. Additionally, our proposed approach obtained more robust features than the other models and identified different features for different patients, making it a promising tool for designing personalized medicine for cancer patients. MDPI 2022-10-29 /pmc/articles/PMC9658757/ /pubmed/36358745 http://dx.doi.org/10.3390/cancers14215327 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Adnan, Nahim Najnin, Tanzira Ruan, Jianhua A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title | A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title_full | A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title_fullStr | A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title_full_unstemmed | A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title_short | A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction |
title_sort | robust personalized classification method for breast cancer metastasis prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658757/ https://www.ncbi.nlm.nih.gov/pubmed/36358745 http://dx.doi.org/10.3390/cancers14215327 |
work_keys_str_mv | AT adnannahim arobustpersonalizedclassificationmethodforbreastcancermetastasisprediction AT najnintanzira arobustpersonalizedclassificationmethodforbreastcancermetastasisprediction AT ruanjianhua arobustpersonalizedclassificationmethodforbreastcancermetastasisprediction AT adnannahim robustpersonalizedclassificationmethodforbreastcancermetastasisprediction AT najnintanzira robustpersonalizedclassificationmethodforbreastcancermetastasisprediction AT ruanjianhua robustpersonalizedclassificationmethodforbreastcancermetastasisprediction |