Cargando…

Predicting cancer involvement of genes from heterogeneous data

BACKGROUND: Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize betwee...

Descripción completa

Detalles Bibliográficos
Autores principales: Aragues, Ramon, Sander, Chris, Oliva, Baldo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2330045/
https://www.ncbi.nlm.nih.gov/pubmed/18371197
http://dx.doi.org/10.1186/1471-2105-9-172
_version_ 1782152779747295232
author Aragues, Ramon
Sander, Chris
Oliva, Baldo
author_facet Aragues, Ramon
Sander, Chris
Oliva, Baldo
author_sort Aragues, Ramon
collection PubMed
description BACKGROUND: Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize between genes putatively involved in cancer, usually based on further analyzing experimental data. RESULTS: We implemented a systematic method using the PIANA software that predicts cancer involvement of genes by integrating heterogeneous datasets. Specifically, we produced lists of genes likely to be involved in cancer by relying on: (i) protein-protein interactions; (ii) differential expression data; and (iii) structural and functional properties of cancer genes. The integrative approach that combines multiple sources of data obtained positive predictive values ranging from 23% (on a list of 811 genes) to 73% (on a list of 22 genes), outperforming the use of any of the data sources alone. We analyze a list of 20 cancer gene predictions, finding that most of them have been recently linked to cancer in literature. CONCLUSION: Our approach to identifying and prioritizing candidate cancer genes can be used to produce lists of genes likely to be involved in cancer. Our results suggest that differential expression studies yielding high numbers of candidate cancer genes can be filtered using protein interaction networks.
format Text
id pubmed-2330045
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23300452008-04-24 Predicting cancer involvement of genes from heterogeneous data Aragues, Ramon Sander, Chris Oliva, Baldo BMC Bioinformatics Research Article BACKGROUND: Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize between genes putatively involved in cancer, usually based on further analyzing experimental data. RESULTS: We implemented a systematic method using the PIANA software that predicts cancer involvement of genes by integrating heterogeneous datasets. Specifically, we produced lists of genes likely to be involved in cancer by relying on: (i) protein-protein interactions; (ii) differential expression data; and (iii) structural and functional properties of cancer genes. The integrative approach that combines multiple sources of data obtained positive predictive values ranging from 23% (on a list of 811 genes) to 73% (on a list of 22 genes), outperforming the use of any of the data sources alone. We analyze a list of 20 cancer gene predictions, finding that most of them have been recently linked to cancer in literature. CONCLUSION: Our approach to identifying and prioritizing candidate cancer genes can be used to produce lists of genes likely to be involved in cancer. Our results suggest that differential expression studies yielding high numbers of candidate cancer genes can be filtered using protein interaction networks. BioMed Central 2008-03-27 /pmc/articles/PMC2330045/ /pubmed/18371197 http://dx.doi.org/10.1186/1471-2105-9-172 Text en Copyright © 2008 Aragues et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Aragues, Ramon
Sander, Chris
Oliva, Baldo
Predicting cancer involvement of genes from heterogeneous data
title Predicting cancer involvement of genes from heterogeneous data
title_full Predicting cancer involvement of genes from heterogeneous data
title_fullStr Predicting cancer involvement of genes from heterogeneous data
title_full_unstemmed Predicting cancer involvement of genes from heterogeneous data
title_short Predicting cancer involvement of genes from heterogeneous data
title_sort predicting cancer involvement of genes from heterogeneous data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2330045/
https://www.ncbi.nlm.nih.gov/pubmed/18371197
http://dx.doi.org/10.1186/1471-2105-9-172
work_keys_str_mv AT araguesramon predictingcancerinvolvementofgenesfromheterogeneousdata
AT sanderchris predictingcancerinvolvementofgenesfromheterogeneousdata
AT olivabaldo predictingcancerinvolvementofgenesfromheterogeneousdata