Cargando…

Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction

BACKGROUND: While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Jaeger, Samira, Sers, Christine T, Leser, Ulf
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017542/
https://www.ncbi.nlm.nih.gov/pubmed/21171995
http://dx.doi.org/10.1186/1471-2164-11-717
_version_ 1782195911406911488
author Jaeger, Samira
Sers, Christine T
Leser, Ulf
author_facet Jaeger, Samira
Sers, Christine T
Leser, Ulf
author_sort Jaeger, Samira
collection PubMed
description BACKGROUND: While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species. RESULTS: We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (MLH1, PMS2, EPHB4 ) and could confirm more than 73% of them based on evidence in the literature. CONCLUSIONS: The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.
format Text
id pubmed-3017542
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30175422011-01-10 Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction Jaeger, Samira Sers, Christine T Leser, Ulf BMC Genomics Research Article BACKGROUND: While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species. RESULTS: We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (MLH1, PMS2, EPHB4 ) and could confirm more than 73% of them based on evidence in the literature. CONCLUSIONS: The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage. BioMed Central 2010-12-20 /pmc/articles/PMC3017542/ /pubmed/21171995 http://dx.doi.org/10.1186/1471-2164-11-717 Text en Copyright ©2010 Jaeger et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jaeger, Samira
Sers, Christine T
Leser, Ulf
Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title_full Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title_fullStr Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title_full_unstemmed Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title_short Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
title_sort combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017542/
https://www.ncbi.nlm.nih.gov/pubmed/21171995
http://dx.doi.org/10.1186/1471-2164-11-717
work_keys_str_mv AT jaegersamira combiningmodularityconservationandinteractionsofproteinssignificantlyincreasesprecisionandcoverageofproteinfunctionprediction
AT serschristinet combiningmodularityconservationandinteractionsofproteinssignificantlyincreasesprecisionandcoverageofproteinfunctionprediction
AT leserulf combiningmodularityconservationandinteractionsofproteinssignificantlyincreasesprecisionandcoverageofproteinfunctionprediction