Cargando…

Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers

Machine learning techniques for cancer prediction and biomarker discovery can hasten cancer detection and significantly improve prognosis. Recent “OMICS” studies which include a variety of cancer and normal tissue samples along with machine learning approaches have the potential to further accelerat...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammed, Akram, Biegert, Greyson, Adamec, Jiri, Helikar, Tomáš
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5689641/
https://www.ncbi.nlm.nih.gov/pubmed/29156751
http://dx.doi.org/10.18632/oncotarget.21127
_version_ 1783279424283607040
author Mohammed, Akram
Biegert, Greyson
Adamec, Jiri
Helikar, Tomáš
author_facet Mohammed, Akram
Biegert, Greyson
Adamec, Jiri
Helikar, Tomáš
author_sort Mohammed, Akram
collection PubMed
description Machine learning techniques for cancer prediction and biomarker discovery can hasten cancer detection and significantly improve prognosis. Recent “OMICS” studies which include a variety of cancer and normal tissue samples along with machine learning approaches have the potential to further accelerate such discovery. To demonstrate this potential, 2,175 gene expression samples from nine tissue types were obtained to identify gene sets whose expression is characteristic of each cancer class. Using random forests classification and ten-fold cross-validation, we developed nine single-tissue classifiers, two multi-tissue cancer-versus-normal classifiers, and one multi-tissue normal classifier. Given a sample of a specified tissue type, the single-tissue models classified samples as cancer or normal with a testing accuracy between 85.29% and 100%. Given a sample of non-specific tissue type, the multi-tissue bi-class model classified the sample as cancer versus normal with a testing accuracy of 97.89%. Given a sample of non-specific tissue type, the multi-tissue multi-class model classified the sample as cancer versus normal and as a specific tissue type with a testing accuracy of 97.43%. Given a normal sample of any of the nine tissue types, the multi-tissue normal model classified the sample as a particular tissue type with a testing accuracy of 97.35%. The machine learning classifiers developed in this study identify potential cancer biomarkers with sensitivity and specificity that exceed those of existing biomarkers and pointed to pathways that are critical to tissue-specific tumor development. This study demonstrates the feasibility of predicting the tissue origin of carcinoma in the context of multiple cancer classes.
format Online
Article
Text
id pubmed-5689641
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Impact Journals LLC
record_format MEDLINE/PubMed
spelling pubmed-56896412017-11-17 Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers Mohammed, Akram Biegert, Greyson Adamec, Jiri Helikar, Tomáš Oncotarget Research Paper Machine learning techniques for cancer prediction and biomarker discovery can hasten cancer detection and significantly improve prognosis. Recent “OMICS” studies which include a variety of cancer and normal tissue samples along with machine learning approaches have the potential to further accelerate such discovery. To demonstrate this potential, 2,175 gene expression samples from nine tissue types were obtained to identify gene sets whose expression is characteristic of each cancer class. Using random forests classification and ten-fold cross-validation, we developed nine single-tissue classifiers, two multi-tissue cancer-versus-normal classifiers, and one multi-tissue normal classifier. Given a sample of a specified tissue type, the single-tissue models classified samples as cancer or normal with a testing accuracy between 85.29% and 100%. Given a sample of non-specific tissue type, the multi-tissue bi-class model classified the sample as cancer versus normal with a testing accuracy of 97.89%. Given a sample of non-specific tissue type, the multi-tissue multi-class model classified the sample as cancer versus normal and as a specific tissue type with a testing accuracy of 97.43%. Given a normal sample of any of the nine tissue types, the multi-tissue normal model classified the sample as a particular tissue type with a testing accuracy of 97.35%. The machine learning classifiers developed in this study identify potential cancer biomarkers with sensitivity and specificity that exceed those of existing biomarkers and pointed to pathways that are critical to tissue-specific tumor development. This study demonstrates the feasibility of predicting the tissue origin of carcinoma in the context of multiple cancer classes. Impact Journals LLC 2017-09-21 /pmc/articles/PMC5689641/ /pubmed/29156751 http://dx.doi.org/10.18632/oncotarget.21127 Text en Copyright: © 2017 Mohammed et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) 3.0 (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Mohammed, Akram
Biegert, Greyson
Adamec, Jiri
Helikar, Tomáš
Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title_full Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title_fullStr Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title_full_unstemmed Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title_short Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
title_sort identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5689641/
https://www.ncbi.nlm.nih.gov/pubmed/29156751
http://dx.doi.org/10.18632/oncotarget.21127
work_keys_str_mv AT mohammedakram identificationofpotentialtissuespecificcancerbiomarkersanddevelopmentofcancerversusnormalgenomicclassifiers
AT biegertgreyson identificationofpotentialtissuespecificcancerbiomarkersanddevelopmentofcancerversusnormalgenomicclassifiers
AT adamecjiri identificationofpotentialtissuespecificcancerbiomarkersanddevelopmentofcancerversusnormalgenomicclassifiers
AT helikartomas identificationofpotentialtissuespecificcancerbiomarkersanddevelopmentofcancerversusnormalgenomicclassifiers