Cargando…

Building a Statistical Model for Predicting Cancer Genes

More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to...

Descripción completa

Detalles Bibliográficos
Autores principales: Gorlov, Ivan P., Logothetis, Christopher J., Fang, Shenying, Gorlova, Olga Y., Amos, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499550/
https://www.ncbi.nlm.nih.gov/pubmed/23166609
http://dx.doi.org/10.1371/journal.pone.0049175
_version_ 1782249988503371776
author Gorlov, Ivan P.
Logothetis, Christopher J.
Fang, Shenying
Gorlova, Olga Y.
Amos, Christopher
author_facet Gorlov, Ivan P.
Logothetis, Christopher J.
Fang, Shenying
Gorlova, Olga Y.
Amos, Christopher
author_sort Gorlov, Ivan P.
collection PubMed
description More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to build a binary logistic regression model identifying PCa genes. Internal and external validation of the model was conducted using a validation set (also from KnowledgeNet), permutations, and external data on genes with recurrent prostate tumor mutations. We evaluated a set of 33 gene characteristics as predictors. Sixteen of the original 33 predictors were significant in the model. We found that a typical PCa gene is a prostate-specific transcription factor, kinase, or phosphatase with high interindividual variance of the expression level in adjacent normal prostate tissue and differential expression between normal prostate tissue and primary tumor. PCa genes are likely to have an antiapoptotic effect and to play a role in cell proliferation, angiogenesis, and cell adhesion. Their proteins are likely to be ubiquitinated or sumoylated but not acetylated. A number of novel PCa candidates have been proposed. Functional annotations of novel candidates identified antiapoptosis, regulation of cell proliferation, positive regulation of kinase activity, positive regulation of transferase activity, angiogenesis, positive regulation of cell division, and cell adhesion as top functions. We provide the list of the top 200 predicted PCa genes, which can be used as candidates for experimental validation. The model may be modified to predict genes for other cancer sites.
format Online
Article
Text
id pubmed-3499550
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34995502012-11-19 Building a Statistical Model for Predicting Cancer Genes Gorlov, Ivan P. Logothetis, Christopher J. Fang, Shenying Gorlova, Olga Y. Amos, Christopher PLoS One Research Article More than 400 cancer genes have been identified in the human genome. The list is not yet complete. Statistical models predicting cancer genes may help with identification of novel cancer gene candidates. We used known prostate cancer (PCa) genes (identified through KnowledgeNet) as a training set to build a binary logistic regression model identifying PCa genes. Internal and external validation of the model was conducted using a validation set (also from KnowledgeNet), permutations, and external data on genes with recurrent prostate tumor mutations. We evaluated a set of 33 gene characteristics as predictors. Sixteen of the original 33 predictors were significant in the model. We found that a typical PCa gene is a prostate-specific transcription factor, kinase, or phosphatase with high interindividual variance of the expression level in adjacent normal prostate tissue and differential expression between normal prostate tissue and primary tumor. PCa genes are likely to have an antiapoptotic effect and to play a role in cell proliferation, angiogenesis, and cell adhesion. Their proteins are likely to be ubiquitinated or sumoylated but not acetylated. A number of novel PCa candidates have been proposed. Functional annotations of novel candidates identified antiapoptosis, regulation of cell proliferation, positive regulation of kinase activity, positive regulation of transferase activity, angiogenesis, positive regulation of cell division, and cell adhesion as top functions. We provide the list of the top 200 predicted PCa genes, which can be used as candidates for experimental validation. The model may be modified to predict genes for other cancer sites. Public Library of Science 2012-11-15 /pmc/articles/PMC3499550/ /pubmed/23166609 http://dx.doi.org/10.1371/journal.pone.0049175 Text en © 2012 Gorlov et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gorlov, Ivan P.
Logothetis, Christopher J.
Fang, Shenying
Gorlova, Olga Y.
Amos, Christopher
Building a Statistical Model for Predicting Cancer Genes
title Building a Statistical Model for Predicting Cancer Genes
title_full Building a Statistical Model for Predicting Cancer Genes
title_fullStr Building a Statistical Model for Predicting Cancer Genes
title_full_unstemmed Building a Statistical Model for Predicting Cancer Genes
title_short Building a Statistical Model for Predicting Cancer Genes
title_sort building a statistical model for predicting cancer genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499550/
https://www.ncbi.nlm.nih.gov/pubmed/23166609
http://dx.doi.org/10.1371/journal.pone.0049175
work_keys_str_mv AT gorlovivanp buildingastatisticalmodelforpredictingcancergenes
AT logothetischristopherj buildingastatisticalmodelforpredictingcancergenes
AT fangshenying buildingastatisticalmodelforpredictingcancergenes
AT gorlovaolgay buildingastatisticalmodelforpredictingcancergenes
AT amoschristopher buildingastatisticalmodelforpredictingcancergenes