Cargando…

Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles

Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) da...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Pei-Wei, Chen, Lei, Huang, Tao, Zhang, Ning, Kong, Xiang-Yin, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4378934/
https://www.ncbi.nlm.nih.gov/pubmed/25822500
http://dx.doi.org/10.1371/journal.pone.0123147
_version_ 1782364111446736896
author Zhang, Pei-Wei
Chen, Lei
Huang, Tao
Zhang, Ning
Kong, Xiang-Yin
Cai, Yu-Dong
author_facet Zhang, Pei-Wei
Chen, Lei
Huang, Tao
Zhang, Ning
Kong, Xiang-Yin
Cai, Yu-Dong
author_sort Zhang, Pei-Wei
collection PubMed
description Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) data, a powerful and robust antibody-based high-throughput approach for targeted proteomics, to perform our research. In this study, we proposed a computational framework to classify the patient samples into ten major cancer types based on the RPPA data using the SMO (Sequential minimal optimization) method. A careful feature selection procedure was employed to select 23 important proteins from the total of 187 proteins by mRMR (minimum Redundancy Maximum Relevance Feature Selection) and IFS (Incremental Feature Selection) on the training set. By using the 23 proteins, we successfully classified the ten cancer types with an MCC (Matthews Correlation Coefficient) of 0.904 on the training set, evaluated by 10-fold cross-validation, and an MCC of 0.936 on an independent test set. Further analysis of these 23 proteins was performed. Most of these proteins can present the hallmarks of cancer; Chk2, for example, plays an important role in the proliferation of cancer cells. Our analysis of these 23 proteins lends credence to the importance of these genes as indicators of cancer classification. We also believe our methods and findings may shed light on the discoveries of specific biomarkers of different types of cancers.
format Online
Article
Text
id pubmed-4378934
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43789342015-04-09 Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles Zhang, Pei-Wei Chen, Lei Huang, Tao Zhang, Ning Kong, Xiang-Yin Cai, Yu-Dong PLoS One Research Article Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) data, a powerful and robust antibody-based high-throughput approach for targeted proteomics, to perform our research. In this study, we proposed a computational framework to classify the patient samples into ten major cancer types based on the RPPA data using the SMO (Sequential minimal optimization) method. A careful feature selection procedure was employed to select 23 important proteins from the total of 187 proteins by mRMR (minimum Redundancy Maximum Relevance Feature Selection) and IFS (Incremental Feature Selection) on the training set. By using the 23 proteins, we successfully classified the ten cancer types with an MCC (Matthews Correlation Coefficient) of 0.904 on the training set, evaluated by 10-fold cross-validation, and an MCC of 0.936 on an independent test set. Further analysis of these 23 proteins was performed. Most of these proteins can present the hallmarks of cancer; Chk2, for example, plays an important role in the proliferation of cancer cells. Our analysis of these 23 proteins lends credence to the importance of these genes as indicators of cancer classification. We also believe our methods and findings may shed light on the discoveries of specific biomarkers of different types of cancers. Public Library of Science 2015-03-30 /pmc/articles/PMC4378934/ /pubmed/25822500 http://dx.doi.org/10.1371/journal.pone.0123147 Text en © 2015 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Pei-Wei
Chen, Lei
Huang, Tao
Zhang, Ning
Kong, Xiang-Yin
Cai, Yu-Dong
Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title_full Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title_fullStr Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title_full_unstemmed Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title_short Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
title_sort classifying ten types of major cancers based on reverse phase protein array profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4378934/
https://www.ncbi.nlm.nih.gov/pubmed/25822500
http://dx.doi.org/10.1371/journal.pone.0123147
work_keys_str_mv AT zhangpeiwei classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles
AT chenlei classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles
AT huangtao classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles
AT zhangning classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles
AT kongxiangyin classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles
AT caiyudong classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles