Cargando…

Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data

BACKGROUND: High-throughput gene expression data can predict gene function through the “guilt by association” principle: coexpressed genes are likely to be functionally associated. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed publicly available expression data on normal human tissues. The analysis is...

Descripción completa

Detalles Bibliográficos
Autores principales: Miozzi, Laura, Piro, Rosario Michael, Rosa, Fabio, Ala, Ugo, Silengo, Lorenzo, Di Cunto, Ferdinando, Provero, Paolo
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2409962/
https://www.ncbi.nlm.nih.gov/pubmed/18560577
http://dx.doi.org/10.1371/journal.pone.0002439
_version_ 1782155911056326656
author Miozzi, Laura
Piro, Rosario Michael
Rosa, Fabio
Ala, Ugo
Silengo, Lorenzo
Di Cunto, Ferdinando
Provero, Paolo
author_facet Miozzi, Laura
Piro, Rosario Michael
Rosa, Fabio
Ala, Ugo
Silengo, Lorenzo
Di Cunto, Ferdinando
Provero, Paolo
author_sort Miozzi, Laura
collection PubMed
description BACKGROUND: High-throughput gene expression data can predict gene function through the “guilt by association” principle: coexpressed genes are likely to be functionally associated. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed publicly available expression data on normal human tissues. The analysis is based on the integration of data obtained with two experimental platforms (microarrays and SAGE) and of various measures of dissimilarity between expression profiles. The building blocks of the procedure are the Ranked Coexpression Groups (RCG), small sets of tightly coexpressed genes which are analyzed in terms of functional annotation. Functionally characterized RCGs are selected by means of the majority rule and used to predict new functional annotations. Functionally characterized RCGs are enriched in groups of genes associated to similar phenotypes. We exploit this fact to find new candidate disease genes for many OMIM phenotypes of unknown molecular origin. CONCLUSIONS/SIGNIFICANCE: We predict new functional annotations for many human genes, showing that the integration of different data sets and coexpression measures significantly improves the scope of the results. Combining gene expression data, functional annotation and known phenotype-gene associations we provide candidate genes for several genetic diseases of unknown molecular basis.
format Text
id pubmed-2409962
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-24099622008-06-18 Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data Miozzi, Laura Piro, Rosario Michael Rosa, Fabio Ala, Ugo Silengo, Lorenzo Di Cunto, Ferdinando Provero, Paolo PLoS One Research Article BACKGROUND: High-throughput gene expression data can predict gene function through the “guilt by association” principle: coexpressed genes are likely to be functionally associated. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed publicly available expression data on normal human tissues. The analysis is based on the integration of data obtained with two experimental platforms (microarrays and SAGE) and of various measures of dissimilarity between expression profiles. The building blocks of the procedure are the Ranked Coexpression Groups (RCG), small sets of tightly coexpressed genes which are analyzed in terms of functional annotation. Functionally characterized RCGs are selected by means of the majority rule and used to predict new functional annotations. Functionally characterized RCGs are enriched in groups of genes associated to similar phenotypes. We exploit this fact to find new candidate disease genes for many OMIM phenotypes of unknown molecular origin. CONCLUSIONS/SIGNIFICANCE: We predict new functional annotations for many human genes, showing that the integration of different data sets and coexpression measures significantly improves the scope of the results. Combining gene expression data, functional annotation and known phenotype-gene associations we provide candidate genes for several genetic diseases of unknown molecular basis. Public Library of Science 2008-06-18 /pmc/articles/PMC2409962/ /pubmed/18560577 http://dx.doi.org/10.1371/journal.pone.0002439 Text en Miozzi et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Miozzi, Laura
Piro, Rosario Michael
Rosa, Fabio
Ala, Ugo
Silengo, Lorenzo
Di Cunto, Ferdinando
Provero, Paolo
Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title_full Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title_fullStr Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title_full_unstemmed Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title_short Functional Annotation and Identification of Candidate Disease Genes by Computational Analysis of Normal Tissue Gene Expression Data
title_sort functional annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2409962/
https://www.ncbi.nlm.nih.gov/pubmed/18560577
http://dx.doi.org/10.1371/journal.pone.0002439
work_keys_str_mv AT miozzilaura functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT pirorosariomichael functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT rosafabio functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT alaugo functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT silengolorenzo functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT dicuntoferdinando functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata
AT proveropaolo functionalannotationandidentificationofcandidatediseasegenesbycomputationalanalysisofnormaltissuegeneexpressiondata