Cargando…
Building a Statistical Model for Predicting Cancer Genes
More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499550/ https://www.ncbi.nlm.nih.gov/pubmed/23166609 http://dx.doi.org/10.1371/journal.pone.0049175 |
_version_ | 1782249988503371776 |
---|---|
author | Gorlov, Ivan P. Logothetis, Christopher J. Fang, Shenying Gorlova, Olga Y. Amos, Christopher |
author_facet | Gorlov, Ivan P. Logothetis, Christopher J. Fang, Shenying Gorlova, Olga Y. Amos, Christopher |
author_sort | Gorlov, Ivan P. |
collection | PubMed |
description | More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to build a binary logistic regression model identifying PCa genes. Internal and external validation of the model was conducted using a validation set (also from KnowledgeNet), permutations, and external data on genes with recurrent prostate tumor mutations. We evaluated a set of 33 gene characteristics as predictors. Sixteen of the original 33 predictors were significant in the model. We found that a typical PCa gene is a prostate-specific transcription factor, kinase, or phosphatase with high interindividual variance of the expression level in adjacent normal prostate tissue and differential expression between normal prostate tissue and primary tumor. PCa genes are likely to have an antiapoptotic effect and to play a role in cell proliferation, angiogenesis, and cell adhesion. Their proteins are likely to be ubiquitinated or sumoylated but not acetylated. A number of novel PCa candidates have been proposed. Functional annotations of novel candidates identified antiapoptosis, regulation of cell proliferation, positive regulation of kinase activity, positive regulation of transferase activity, angiogenesis, positive regulation of cell division, and cell adhesion as top functions. We provide the list of the top 200 predicted PCa genes, which can be used as candidates for experimental validation. The model may be modified to predict genes for other cancer sites. |
format | Online Article Text |
id | pubmed-3499550 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34995502012-11-19 Building a Statistical Model for Predicting Cancer Genes Gorlov, Ivan P. Logothetis, Christopher J. Fang, Shenying Gorlova, Olga Y. Amos, Christopher PLoS One Research Article More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to build a binary logistic regression model identifying PCa genes. Internal and external validation of the model was conducted using a validation set (also from KnowledgeNet), permutations, and external data on genes with recurrent prostate tumor mutations. We evaluated a set of 33 gene characteristics as predictors. Sixteen of the original 33 predictors were significant in the model. We found that a typical PCa gene is a prostate-specific transcription factor, kinase, or phosphatase with high interindividual variance of the expression level in adjacent normal prostate tissue and differential expression between normal prostate tissue and primary tumor. PCa genes are likely to have an antiapoptotic effect and to play a role in cell proliferation, angiogenesis, and cell adhesion. Their proteins are likely to be ubiquitinated or sumoylated but not acetylated. A number of novel PCa candidates have been proposed. Functional annotations of novel candidates identified antiapoptosis, regulation of cell proliferation, positive regulation of kinase activity, positive regulation of transferase activity, angiogenesis, positive regulation of cell division, and cell adhesion as top functions. We provide the list of the top 200 predicted PCa genes, which can be used as candidates for experimental validation. The model may be modified to predict genes for other cancer sites. Public Library of Science 2012-11-15 /pmc/articles/PMC3499550/ /pubmed/23166609 http://dx.doi.org/10.1371/journal.pone.0049175 Text en © 2012 Gorlov et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Gorlov, Ivan P. Logothetis, Christopher J. Fang, Shenying Gorlova, Olga Y. Amos, Christopher Building a Statistical Model for Predicting Cancer Genes |
title | Building a Statistical Model for Predicting Cancer Genes |
title_full | Building a Statistical Model for Predicting Cancer Genes |
title_fullStr | Building a Statistical Model for Predicting Cancer Genes |
title_full_unstemmed | Building a Statistical Model for Predicting Cancer Genes |
title_short | Building a Statistical Model for Predicting Cancer Genes |
title_sort | building a statistical model for predicting cancer genes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499550/ https://www.ncbi.nlm.nih.gov/pubmed/23166609 http://dx.doi.org/10.1371/journal.pone.0049175 |
work_keys_str_mv | AT gorlovivanp buildingastatisticalmodelforpredictingcancergenes AT logothetischristopherj buildingastatisticalmodelforpredictingcancergenes AT fangshenying buildingastatisticalmodelforpredictingcancergenes AT gorlovaolgay buildingastatisticalmodelforpredictingcancergenes AT amoschristopher buildingastatisticalmodelforpredictingcancergenes |