Cargando…
Discovering cancer genes by integrating network and functional properties
BACKGROUND: Identification of novel cancer-causing genes is one of the main goals in cancer research. The rapid accumulation of genome-wide protein-protein interaction (PPI) data in humans has provided a new basis for studying the topological features of cancer genes in cellular networks. It is impo...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758898/ https://www.ncbi.nlm.nih.gov/pubmed/19765316 http://dx.doi.org/10.1186/1755-8794-2-61 |
_version_ | 1782172627252543488 |
---|---|
author | Li, Li Zhang, Kangyu Lee, James Cordes, Shaun Davis, David P Tang, Zhijun |
author_facet | Li, Li Zhang, Kangyu Lee, James Cordes, Shaun Davis, David P Tang, Zhijun |
author_sort | Li, Li |
collection | PubMed |
description | BACKGROUND: Identification of novel cancer-causing genes is one of the main goals in cancer research. The rapid accumulation of genome-wide protein-protein interaction (PPI) data in humans has provided a new basis for studying the topological features of cancer genes in cellular networks. It is important to integrate multiple genomic data sources, including PPI networks, protein domains and Gene Ontology (GO) annotations, to facilitate the identification of cancer genes. METHODS: Topological features of the PPI network, as well as protein domain compositions, enrichment of gene ontology categories, sequence and evolutionary conservation features were extracted and compared between cancer genes and other genes. The predictive power of various classifiers for identification of cancer genes was evaluated by cross validation. Experimental validation of a subset of the prediction results was conducted using siRNA knockdown and viability assays in human colon cancer cell line DLD-1. RESULTS: Cross validation demonstrated advantageous performance of classifiers based on support vector machines (SVMs) with the inclusion of the topological features from the PPI network, protein domain compositions and GO annotations. We then applied the trained SVM classifier to human genes to prioritize putative cancer genes. siRNA knock-down of several SVM predicted cancer genes displayed greatly reduced cell viability in human colon cancer cell line DLD-1. CONCLUSION: Topological features of PPI networks, protein domain compositions and GO annotations are good predictors of cancer genes. The SVM classifier integrates multiple features and as such is useful for prioritizing candidate cancer genes for experimental validations. |
format | Text |
id | pubmed-2758898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27588982009-10-08 Discovering cancer genes by integrating network and functional properties Li, Li Zhang, Kangyu Lee, James Cordes, Shaun Davis, David P Tang, Zhijun BMC Med Genomics Research Article BACKGROUND: Identification of novel cancer-causing genes is one of the main goals in cancer research. The rapid accumulation of genome-wide protein-protein interaction (PPI) data in humans has provided a new basis for studying the topological features of cancer genes in cellular networks. It is important to integrate multiple genomic data sources, including PPI networks, protein domains and Gene Ontology (GO) annotations, to facilitate the identification of cancer genes. METHODS: Topological features of the PPI network, as well as protein domain compositions, enrichment of gene ontology categories, sequence and evolutionary conservation features were extracted and compared between cancer genes and other genes. The predictive power of various classifiers for identification of cancer genes was evaluated by cross validation. Experimental validation of a subset of the prediction results was conducted using siRNA knockdown and viability assays in human colon cancer cell line DLD-1. RESULTS: Cross validation demonstrated advantageous performance of classifiers based on support vector machines (SVMs) with the inclusion of the topological features from the PPI network, protein domain compositions and GO annotations. We then applied the trained SVM classifier to human genes to prioritize putative cancer genes. siRNA knock-down of several SVM predicted cancer genes displayed greatly reduced cell viability in human colon cancer cell line DLD-1. CONCLUSION: Topological features of PPI networks, protein domain compositions and GO annotations are good predictors of cancer genes. The SVM classifier integrates multiple features and as such is useful for prioritizing candidate cancer genes for experimental validations. BioMed Central 2009-09-19 /pmc/articles/PMC2758898/ /pubmed/19765316 http://dx.doi.org/10.1186/1755-8794-2-61 Text en Copyright © 2009 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Li Zhang, Kangyu Lee, James Cordes, Shaun Davis, David P Tang, Zhijun Discovering cancer genes by integrating network and functional properties |
title | Discovering cancer genes by integrating network and functional properties |
title_full | Discovering cancer genes by integrating network and functional properties |
title_fullStr | Discovering cancer genes by integrating network and functional properties |
title_full_unstemmed | Discovering cancer genes by integrating network and functional properties |
title_short | Discovering cancer genes by integrating network and functional properties |
title_sort | discovering cancer genes by integrating network and functional properties |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758898/ https://www.ncbi.nlm.nih.gov/pubmed/19765316 http://dx.doi.org/10.1186/1755-8794-2-61 |
work_keys_str_mv | AT lili discoveringcancergenesbyintegratingnetworkandfunctionalproperties AT zhangkangyu discoveringcancergenesbyintegratingnetworkandfunctionalproperties AT leejames discoveringcancergenesbyintegratingnetworkandfunctionalproperties AT cordesshaun discoveringcancergenesbyintegratingnetworkandfunctionalproperties AT davisdavidp discoveringcancergenesbyintegratingnetworkandfunctionalproperties AT tangzhijun discoveringcancergenesbyintegratingnetworkandfunctionalproperties |