Cargando…

Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function

BACKGROUND: Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-associat...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Weidong, Zhang, Lan V, Taşan, Murat, Gibbons, Francis D, King, Oliver D, Park, Julie, Wunderlich, Zeba, Cherry, J Michael, Roth, Frederick P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447541/
https://www.ncbi.nlm.nih.gov/pubmed/18613951
http://dx.doi.org/10.1186/gb-2008-9-s1-s7
_version_ 1782156963285565440
author Tian, Weidong
Zhang, Lan V
Taşan, Murat
Gibbons, Francis D
King, Oliver D
Park, Julie
Wunderlich, Zeba
Cherry, J Michael
Roth, Frederick P
author_facet Tian, Weidong
Zhang, Lan V
Taşan, Murat
Gibbons, Francis D
King, Oliver D
Park, Julie
Wunderlich, Zeba
Cherry, J Michael
Roth, Frederick P
author_sort Tian, Weidong
collection PubMed
description BACKGROUND: Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships. RESULTS: We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships. CONCLUSION: Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions.
format Text
id pubmed-2447541
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24475412008-07-10 Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function Tian, Weidong Zhang, Lan V Taşan, Murat Gibbons, Francis D King, Oliver D Park, Julie Wunderlich, Zeba Cherry, J Michael Roth, Frederick P Genome Biol Method BACKGROUND: Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships. RESULTS: We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships. CONCLUSION: Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions. BioMed Central 2008 2008-06-27 /pmc/articles/PMC2447541/ /pubmed/18613951 http://dx.doi.org/10.1186/gb-2008-9-s1-s7 Text en Copyright © 2008 Tian et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method
Tian, Weidong
Zhang, Lan V
Taşan, Murat
Gibbons, Francis D
King, Oliver D
Park, Julie
Wunderlich, Zeba
Cherry, J Michael
Roth, Frederick P
Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title_full Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title_fullStr Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title_full_unstemmed Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title_short Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function
title_sort combining guilt-by-association and guilt-by-profiling to predict saccharomyces cerevisiae gene function
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447541/
https://www.ncbi.nlm.nih.gov/pubmed/18613951
http://dx.doi.org/10.1186/gb-2008-9-s1-s7
work_keys_str_mv AT tianweidong combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT zhanglanv combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT tasanmurat combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT gibbonsfrancisd combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT kingoliverd combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT parkjulie combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT wunderlichzeba combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT cherryjmichael combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction
AT rothfrederickp combiningguiltbyassociationandguiltbyprofilingtopredictsaccharomycescerevisiaegenefunction