Cargando…

Integrating phenotype and gene expression data for predicting gene function

BACKGROUND: This paper presents a framework for integrating disparate data sets to predict gene function. The algorithm constructs a graph, called an integrated similarity graph, by computing similarities based upon both gene expression and textual phenotype data. This integrated graph is then used...

Descripción completa

Detalles Bibliográficos
Autores principales:	Malone, Brandon M, Perkins, Andy D, Bridges, Susan M
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2009
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226192/ https://www.ncbi.nlm.nih.gov/pubmed/19811686 http://dx.doi.org/10.1186/1471-2105-10-S11-S20

_version_	1782217579401576448
author	Malone, Brandon M Perkins, Andy D Bridges, Susan M
author_facet	Malone, Brandon M Perkins, Andy D Bridges, Susan M
author_sort	Malone, Brandon M
collection	PubMed
description	BACKGROUND: This paper presents a framework for integrating disparate data sets to predict gene function. The algorithm constructs a graph, called an integrated similarity graph, by computing similarities based upon both gene expression and textual phenotype data. This integrated graph is then used to make predictions about whether individual genes should be assigned a particular annotation from the Gene Ontology. RESULTS: A combined graph was generated from publicly-available gene expression data and phenotypic information from Saccharomyces cerevisiae. This graph was used to assign annotations to genes, as were graphs constructed from gene expression data and textual phenotype information alone. While the F-measure appeared similar for all three methods, annotations based upon the integrated similarity graph exhibited a better overall precision than gene expression or phenotype information alone can generate. The integrated approach was also able to assign almost as many annotations as the gene expression method alone, and generated significantly more total and correct assignments than the phenotype information could provide. CONCLUSION: These results suggest that augmenting standard gene expression data sets with publicly-available textual phenotype data can help generate more precise functional annotation predictions while mitigating the weaknesses of a standard textual phenotype approach.
format	Online Article Text
id	pubmed-3226192
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32261922011-11-30 Integrating phenotype and gene expression data for predicting gene function Malone, Brandon M Perkins, Andy D Bridges, Susan M BMC Bioinformatics Proceedings BACKGROUND: This paper presents a framework for integrating disparate data sets to predict gene function. The algorithm constructs a graph, called an integrated similarity graph, by computing similarities based upon both gene expression and textual phenotype data. This integrated graph is then used to make predictions about whether individual genes should be assigned a particular annotation from the Gene Ontology. RESULTS: A combined graph was generated from publicly-available gene expression data and phenotypic information from Saccharomyces cerevisiae. This graph was used to assign annotations to genes, as were graphs constructed from gene expression data and textual phenotype information alone. While the F-measure appeared similar for all three methods, annotations based upon the integrated similarity graph exhibited a better overall precision than gene expression or phenotype information alone can generate. The integrated approach was also able to assign almost as many annotations as the gene expression method alone, and generated significantly more total and correct assignments than the phenotype information could provide. CONCLUSION: These results suggest that augmenting standard gene expression data sets with publicly-available textual phenotype data can help generate more precise functional annotation predictions while mitigating the weaknesses of a standard textual phenotype approach. BioMed Central 2009-10-08 /pmc/articles/PMC3226192/ /pubmed/19811686 http://dx.doi.org/10.1186/1471-2105-10-S11-S20 Text en Copyright ©2009 Malone et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Malone, Brandon M Perkins, Andy D Bridges, Susan M Integrating phenotype and gene expression data for predicting gene function
title	Integrating phenotype and gene expression data for predicting gene function
title_full	Integrating phenotype and gene expression data for predicting gene function
title_fullStr	Integrating phenotype and gene expression data for predicting gene function
title_full_unstemmed	Integrating phenotype and gene expression data for predicting gene function
title_short	Integrating phenotype and gene expression data for predicting gene function
title_sort	integrating phenotype and gene expression data for predicting gene function
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226192/ https://www.ncbi.nlm.nih.gov/pubmed/19811686 http://dx.doi.org/10.1186/1471-2105-10-S11-S20
work_keys_str_mv	AT malonebrandonm integratingphenotypeandgeneexpressiondataforpredictinggenefunction AT perkinsandyd integratingphenotypeandgeneexpressiondataforpredictinggenefunction AT bridgessusanm integratingphenotypeandgeneexpressiondataforpredictinggenefunction

Integrating phenotype and gene expression data for predicting gene function

Ejemplares similares