Cargando…

MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score

Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting fact...

Descripción completa

Detalles Bibliográficos
Autores principales: Pryszcz, Leszek P., Huerta-Cepas, Jaime, Gabaldón, Toni
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3061081/
https://www.ncbi.nlm.nih.gov/pubmed/21149260
http://dx.doi.org/10.1093/nar/gkq953
_version_ 1782200578341863424
author Pryszcz, Leszek P.
Huerta-Cepas, Jaime
Gabaldón, Toni
author_facet Pryszcz, Leszek P.
Huerta-Cepas, Jaime
Gabaldón, Toni
author_sort Pryszcz, Leszek P.
collection PubMed
description Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data (http://orthology.phylomedb.org), which provides users with a global repository of phylogeny-based orthology and paralogy predictions.
format Text
id pubmed-3061081
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30610812011-03-21 MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score Pryszcz, Leszek P. Huerta-Cepas, Jaime Gabaldón, Toni Nucleic Acids Res Methods Online Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data (http://orthology.phylomedb.org), which provides users with a global repository of phylogeny-based orthology and paralogy predictions. Oxford University Press 2011-03 2010-12-11 /pmc/articles/PMC3061081/ /pubmed/21149260 http://dx.doi.org/10.1093/nar/gkq953 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Pryszcz, Leszek P.
Huerta-Cepas, Jaime
Gabaldón, Toni
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title_full MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title_fullStr MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title_full_unstemmed MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title_short MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
title_sort metaphors: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3061081/
https://www.ncbi.nlm.nih.gov/pubmed/21149260
http://dx.doi.org/10.1093/nar/gkq953
work_keys_str_mv AT pryszczleszekp metaphorsorthologyandparalogypredictionsfrommultiplephylogeneticevidenceusingaconsistencybasedconfidencescore
AT huertacepasjaime metaphorsorthologyandparalogypredictionsfrommultiplephylogeneticevidenceusingaconsistencybasedconfidencescore
AT gabaldontoni metaphorsorthologyandparalogypredictionsfrommultiplephylogeneticevidenceusingaconsistencybasedconfidencescore