Cargando…

ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process

BACKGROUND: Phylogenetic profiling is a technique of scoring co-occurrence between a protein family and some other trait, usually another protein family, across a set of taxonomic groups. In spite of several refinements in recent years, the technique still invites significant improvement. To be its...

Descripción completa

Detalles Bibliográficos
Autores principales: Basu, Malay K, Selengut, Jeremy D, Haft, Daniel H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226654/
https://www.ncbi.nlm.nih.gov/pubmed/22070167
http://dx.doi.org/10.1186/1471-2105-12-434
_version_ 1782217662965743616
author Basu, Malay K
Selengut, Jeremy D
Haft, Daniel H
author_facet Basu, Malay K
Selengut, Jeremy D
Haft, Daniel H
author_sort Basu, Malay K
collection PubMed
description BACKGROUND: Phylogenetic profiling is a technique of scoring co-occurrence between a protein family and some other trait, usually another protein family, across a set of taxonomic groups. In spite of several refinements in recent years, the technique still invites significant improvement. To be its most effective, a phylogenetic profiling algorithm must be able to examine co-occurrences among protein families whose boundaries are uncertain within large homologous protein superfamilies. RESULTS: Partial Phylogenetic Profiling (PPP) is an iterative algorithm that scores a given taxonomic profile against the taxonomic distribution of families for all proteins in a genome. The method works through optimizing the boundary of each protein family, rather than by relying on prebuilt protein families or fixed sequence similarity thresholds. Double Partial Phylogenetic Profiling (DPPP) is a related procedure that begins with a single sequence and searches for optimal granularities for its surrounding protein family in order to generate the best query profiles for PPP. We present ProPhylo, a high-performance software package for phylogenetic profiling studies through creating individually optimized protein family boundaries. ProPhylo provides precomputed databases for immediate use and tools for manipulating the taxonomic profiles used as queries. CONCLUSION: ProPhylo results show universal markers of methanogenesis, a new DNA phosphorothioation-dependent restriction enzyme, and efficacy in guiding protein family construction. The software and the associated databases are freely available under the open source Perl Artistic License from ftp://ftp.jcvi.org/pub/data/ppp/.
format Online
Article
Text
id pubmed-3226654
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32266542011-11-30 ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process Basu, Malay K Selengut, Jeremy D Haft, Daniel H BMC Bioinformatics Software BACKGROUND: Phylogenetic profiling is a technique of scoring co-occurrence between a protein family and some other trait, usually another protein family, across a set of taxonomic groups. In spite of several refinements in recent years, the technique still invites significant improvement. To be its most effective, a phylogenetic profiling algorithm must be able to examine co-occurrences among protein families whose boundaries are uncertain within large homologous protein superfamilies. RESULTS: Partial Phylogenetic Profiling (PPP) is an iterative algorithm that scores a given taxonomic profile against the taxonomic distribution of families for all proteins in a genome. The method works through optimizing the boundary of each protein family, rather than by relying on prebuilt protein families or fixed sequence similarity thresholds. Double Partial Phylogenetic Profiling (DPPP) is a related procedure that begins with a single sequence and searches for optimal granularities for its surrounding protein family in order to generate the best query profiles for PPP. We present ProPhylo, a high-performance software package for phylogenetic profiling studies through creating individually optimized protein family boundaries. ProPhylo provides precomputed databases for immediate use and tools for manipulating the taxonomic profiles used as queries. CONCLUSION: ProPhylo results show universal markers of methanogenesis, a new DNA phosphorothioation-dependent restriction enzyme, and efficacy in guiding protein family construction. The software and the associated databases are freely available under the open source Perl Artistic License from ftp://ftp.jcvi.org/pub/data/ppp/. BioMed Central 2011-11-09 /pmc/articles/PMC3226654/ /pubmed/22070167 http://dx.doi.org/10.1186/1471-2105-12-434 Text en Copyright © 2011 Basu et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Basu, Malay K
Selengut, Jeremy D
Haft, Daniel H
ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title_full ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title_fullStr ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title_full_unstemmed ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title_short ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
title_sort prophylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226654/
https://www.ncbi.nlm.nih.gov/pubmed/22070167
http://dx.doi.org/10.1186/1471-2105-12-434
work_keys_str_mv AT basumalayk prophylopartialphylogeneticprofilingtoguideproteinfamilyconstructionandassignmentofbiologicalprocess
AT selengutjeremyd prophylopartialphylogeneticprofilingtoguideproteinfamilyconstructionandassignmentofbiologicalprocess
AT haftdanielh prophylopartialphylogeneticprofilingtoguideproteinfamilyconstructionandassignmentofbiologicalprocess