Cargando…

PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees

BACKGROUND: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. R...

Descripción completa

Detalles Bibliográficos
Autores principales: Gouret, Philippe, Thompson, Julie D, Pontarotti, Pierre
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759962/
https://www.ncbi.nlm.nih.gov/pubmed/19765311
http://dx.doi.org/10.1186/1471-2105-10-298
_version_ 1782172715741872128
author Gouret, Philippe
Thompson, Julie D
Pontarotti, Pierre
author_facet Gouret, Philippe
Thompson, Julie D
Pontarotti, Pierre
author_sort Gouret, Philippe
collection PubMed
description BACKGROUND: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. RESULTS: Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other. CONCLUSION: PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.
format Text
id pubmed-2759962
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27599622009-10-11 PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees Gouret, Philippe Thompson, Julie D Pontarotti, Pierre BMC Bioinformatics Software BACKGROUND: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. RESULTS: Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other. CONCLUSION: PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern. BioMed Central 2009-09-19 /pmc/articles/PMC2759962/ /pubmed/19765311 http://dx.doi.org/10.1186/1471-2105-10-298 Text en Copyright © 2009 Gouret et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Gouret, Philippe
Thompson, Julie D
Pontarotti, Pierre
PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title_full PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title_fullStr PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title_full_unstemmed PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title_short PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees
title_sort phylopattern: regular expressions to identify complex patterns in phylogenetic trees
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759962/
https://www.ncbi.nlm.nih.gov/pubmed/19765311
http://dx.doi.org/10.1186/1471-2105-10-298
work_keys_str_mv AT gouretphilippe phylopatternregularexpressionstoidentifycomplexpatternsinphylogenetictrees
AT thompsonjulied phylopatternregularexpressionstoidentifycomplexpatternsinphylogenetictrees
AT pontarottipierre phylopatternregularexpressionstoidentifycomplexpatternsinphylogenetictrees