Cargando…
Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles
Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) da...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4378934/ https://www.ncbi.nlm.nih.gov/pubmed/25822500 http://dx.doi.org/10.1371/journal.pone.0123147 |
_version_ | 1782364111446736896 |
---|---|
author | Zhang, Pei-Wei Chen, Lei Huang, Tao Zhang, Ning Kong, Xiang-Yin Cai, Yu-Dong |
author_facet | Zhang, Pei-Wei Chen, Lei Huang, Tao Zhang, Ning Kong, Xiang-Yin Cai, Yu-Dong |
author_sort | Zhang, Pei-Wei |
collection | PubMed |
description | Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) data, a powerful and robust antibody-based high-throughput approach for targeted proteomics, to perform our research. In this study, we proposed a computational framework to classify the patient samples into ten major cancer types based on the RPPA data using the SMO (Sequential minimal optimization) method. A careful feature selection procedure was employed to select 23 important proteins from the total of 187 proteins by mRMR (minimum Redundancy Maximum Relevance Feature Selection) and IFS (Incremental Feature Selection) on the training set. By using the 23 proteins, we successfully classified the ten cancer types with an MCC (Matthews Correlation Coefficient) of 0.904 on the training set, evaluated by 10-fold cross-validation, and an MCC of 0.936 on an independent test set. Further analysis of these 23 proteins was performed. Most of these proteins can present the hallmarks of cancer; Chk2, for example, plays an important role in the proliferation of cancer cells. Our analysis of these 23 proteins lends credence to the importance of these genes as indicators of cancer classification. We also believe our methods and findings may shed light on the discoveries of specific biomarkers of different types of cancers. |
format | Online Article Text |
id | pubmed-4378934 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-43789342015-04-09 Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles Zhang, Pei-Wei Chen, Lei Huang, Tao Zhang, Ning Kong, Xiang-Yin Cai, Yu-Dong PLoS One Research Article Gathering vast data sets of cancer genomes requires more efficient and autonomous procedures to classify cancer types and to discover a few essential genes to distinguish different cancers. Because protein expression is more stable than gene expression, we chose reverse phase protein array (RPPA) data, a powerful and robust antibody-based high-throughput approach for targeted proteomics, to perform our research. In this study, we proposed a computational framework to classify the patient samples into ten major cancer types based on the RPPA data using the SMO (Sequential minimal optimization) method. A careful feature selection procedure was employed to select 23 important proteins from the total of 187 proteins by mRMR (minimum Redundancy Maximum Relevance Feature Selection) and IFS (Incremental Feature Selection) on the training set. By using the 23 proteins, we successfully classified the ten cancer types with an MCC (Matthews Correlation Coefficient) of 0.904 on the training set, evaluated by 10-fold cross-validation, and an MCC of 0.936 on an independent test set. Further analysis of these 23 proteins was performed. Most of these proteins can present the hallmarks of cancer; Chk2, for example, plays an important role in the proliferation of cancer cells. Our analysis of these 23 proteins lends credence to the importance of these genes as indicators of cancer classification. We also believe our methods and findings may shed light on the discoveries of specific biomarkers of different types of cancers. Public Library of Science 2015-03-30 /pmc/articles/PMC4378934/ /pubmed/25822500 http://dx.doi.org/10.1371/journal.pone.0123147 Text en © 2015 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhang, Pei-Wei Chen, Lei Huang, Tao Zhang, Ning Kong, Xiang-Yin Cai, Yu-Dong Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title | Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title_full | Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title_fullStr | Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title_full_unstemmed | Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title_short | Classifying Ten Types of Major Cancers Based on Reverse Phase Protein Array Profiles |
title_sort | classifying ten types of major cancers based on reverse phase protein array profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4378934/ https://www.ncbi.nlm.nih.gov/pubmed/25822500 http://dx.doi.org/10.1371/journal.pone.0123147 |
work_keys_str_mv | AT zhangpeiwei classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles AT chenlei classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles AT huangtao classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles AT zhangning classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles AT kongxiangyin classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles AT caiyudong classifyingtentypesofmajorcancersbasedonreversephaseproteinarrayprofiles |