Cargando…

Breast Cancer Type Classification Using Machine Learning

Background: Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressiv...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Jiande, Hicks, Chindo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909418/ https://www.ncbi.nlm.nih.gov/pubmed/33498339 http://dx.doi.org/10.3390/jpm11020061

_version_	1783655926923788288
author	Wu, Jiande Hicks, Chindo
author_facet	Wu, Jiande Hicks, Chindo
author_sort	Wu, Jiande
collection	PubMed
description	Background: Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressive and lethal form of breast cancer, from non-triple negative breast cancer. Here we propose use of a machine learning (ML) approach for classification of triple negative breast cancer and non-triple negative breast cancer patients using gene expression data. Methods: We performed analysis of RNA-Sequence data from 110 triple negative and 992 non-triple negative breast cancer tumor samples from The Cancer Genome Atlas to select the features (genes) used in the development and validation of the classification models. We evaluated four different classification models including Support Vector Machines, K-nearest neighbor, Naïve Bayes and Decision tree using features selected at different threshold levels to train the models for classifying the two types of breast cancer. For performance evaluation and validation, the proposed methods were applied to independent gene expression datasets. Results: Among the four ML algorithms evaluated, the Support Vector Machine algorithm was able to classify breast cancer more accurately into triple negative and non-triple negative breast cancer and had less misclassification errors than the other three algorithms evaluated. Conclusions: The prediction results show that ML algorithms are efficient and can be used for classification of breast cancer into triple negative and non-triple negative breast cancer types.
format	Online Article Text
id	pubmed-7909418
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-79094182021-02-27 Breast Cancer Type Classification Using Machine Learning Wu, Jiande Hicks, Chindo J Pers Med Article Background: Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressive and lethal form of breast cancer, from non-triple negative breast cancer. Here we propose use of a machine learning (ML) approach for classification of triple negative breast cancer and non-triple negative breast cancer patients using gene expression data. Methods: We performed analysis of RNA-Sequence data from 110 triple negative and 992 non-triple negative breast cancer tumor samples from The Cancer Genome Atlas to select the features (genes) used in the development and validation of the classification models. We evaluated four different classification models including Support Vector Machines, K-nearest neighbor, Naïve Bayes and Decision tree using features selected at different threshold levels to train the models for classifying the two types of breast cancer. For performance evaluation and validation, the proposed methods were applied to independent gene expression datasets. Results: Among the four ML algorithms evaluated, the Support Vector Machine algorithm was able to classify breast cancer more accurately into triple negative and non-triple negative breast cancer and had less misclassification errors than the other three algorithms evaluated. Conclusions: The prediction results show that ML algorithms are efficient and can be used for classification of breast cancer into triple negative and non-triple negative breast cancer types. MDPI 2021-01-20 /pmc/articles/PMC7909418/ /pubmed/33498339 http://dx.doi.org/10.3390/jpm11020061 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wu, Jiande Hicks, Chindo Breast Cancer Type Classification Using Machine Learning
title	Breast Cancer Type Classification Using Machine Learning
title_full	Breast Cancer Type Classification Using Machine Learning
title_fullStr	Breast Cancer Type Classification Using Machine Learning
title_full_unstemmed	Breast Cancer Type Classification Using Machine Learning
title_short	Breast Cancer Type Classification Using Machine Learning
title_sort	breast cancer type classification using machine learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909418/ https://www.ncbi.nlm.nih.gov/pubmed/33498339 http://dx.doi.org/10.3390/jpm11020061
work_keys_str_mv	AT wujiande breastcancertypeclassificationusingmachinelearning AT hickschindo breastcancertypeclassificationusingmachinelearning

Breast Cancer Type Classification Using Machine Learning

Ejemplares similares