Cargando…

PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes

Estimating multiple sequence alignments (MSAs) and inferring phylogenies are essential for many aspects of comparative biology. Yet, many bioinformatics tools for such analyses have focused on specific clades, with greatest attention paid to plants, animals, and fungi. The rapid increase in high-thr...

Descripción completa

Detalles Bibliográficos
Autores principales: Cerón-Romero, Mario A, Maurer-Alcalá, Xyrus X, Grattepanche, Jean-David, Yan, Ying, Fonseca, Miguel M, Katz, L A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657734/
https://www.ncbi.nlm.nih.gov/pubmed/31062861
http://dx.doi.org/10.1093/molbev/msz103
_version_ 1783438842924105728
author Cerón-Romero, Mario A
Maurer-Alcalá, Xyrus X
Grattepanche, Jean-David
Yan, Ying
Fonseca, Miguel M
Katz, L A
author_facet Cerón-Romero, Mario A
Maurer-Alcalá, Xyrus X
Grattepanche, Jean-David
Yan, Ying
Fonseca, Miguel M
Katz, L A
author_sort Cerón-Romero, Mario A
collection PubMed
description Estimating multiple sequence alignments (MSAs) and inferring phylogenies are essential for many aspects of comparative biology. Yet, many bioinformatics tools for such analyses have focused on specific clades, with greatest attention paid to plants, animals, and fungi. The rapid increase in high-throughput sequencing (HTS) data from diverse lineages now provides opportunities to estimate evolutionary relationships and gene family evolution across the eukaryotic tree of life. At the same time, these types of data are known to be error-prone (e.g., substitutions, contamination). To address these opportunities and challenges, we have refined a phylogenomic pipeline, now named PhyloToL, to allow easy incorporation of data from HTS studies, to automate production of both MSAs and gene trees, and to identify and remove contaminants. PhyloToL is designed for phylogenomic analyses of diverse lineages across the tree of life (i.e., at scales of >100 My). We demonstrate the power of PhyloToL by assessing stop codon usage in Ciliophora, identifying contamination in a taxon- and gene-rich database and exploring the evolutionary history of chromosomes in the kinetoplastid parasite Trypanosoma brucei, the causative agent of African sleeping sickness. Benchmarking PhyloToL’s homology assessment against that of OrthoMCL and a published paper on superfamilies of bacterial and eukaryotic organellar outer membrane pore-forming proteins demonstrates the power of our approach for determining gene family membership and inferring gene trees. PhyloToL is highly flexible and allows users to easily explore HTS data, test hypotheses about phylogeny and gene family evolution and combine outputs with third-party tools (e.g., PhyloChromoMap, iGTP).
format Online
Article
Text
id pubmed-6657734
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66577342019-08-02 PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes Cerón-Romero, Mario A Maurer-Alcalá, Xyrus X Grattepanche, Jean-David Yan, Ying Fonseca, Miguel M Katz, L A Mol Biol Evol Resources Estimating multiple sequence alignments (MSAs) and inferring phylogenies are essential for many aspects of comparative biology. Yet, many bioinformatics tools for such analyses have focused on specific clades, with greatest attention paid to plants, animals, and fungi. The rapid increase in high-throughput sequencing (HTS) data from diverse lineages now provides opportunities to estimate evolutionary relationships and gene family evolution across the eukaryotic tree of life. At the same time, these types of data are known to be error-prone (e.g., substitutions, contamination). To address these opportunities and challenges, we have refined a phylogenomic pipeline, now named PhyloToL, to allow easy incorporation of data from HTS studies, to automate production of both MSAs and gene trees, and to identify and remove contaminants. PhyloToL is designed for phylogenomic analyses of diverse lineages across the tree of life (i.e., at scales of >100 My). We demonstrate the power of PhyloToL by assessing stop codon usage in Ciliophora, identifying contamination in a taxon- and gene-rich database and exploring the evolutionary history of chromosomes in the kinetoplastid parasite Trypanosoma brucei, the causative agent of African sleeping sickness. Benchmarking PhyloToL’s homology assessment against that of OrthoMCL and a published paper on superfamilies of bacterial and eukaryotic organellar outer membrane pore-forming proteins demonstrates the power of our approach for determining gene family membership and inferring gene trees. PhyloToL is highly flexible and allows users to easily explore HTS data, test hypotheses about phylogeny and gene family evolution and combine outputs with third-party tools (e.g., PhyloChromoMap, iGTP). Oxford University Press 2019-08 2019-05-07 /pmc/articles/PMC6657734/ /pubmed/31062861 http://dx.doi.org/10.1093/molbev/msz103 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Resources
Cerón-Romero, Mario A
Maurer-Alcalá, Xyrus X
Grattepanche, Jean-David
Yan, Ying
Fonseca, Miguel M
Katz, L A
PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title_full PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title_fullStr PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title_full_unstemmed PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title_short PhyloToL: A Taxon/Gene-Rich Phylogenomic Pipeline to Explore Genome Evolution of Diverse Eukaryotes
title_sort phylotol: a taxon/gene-rich phylogenomic pipeline to explore genome evolution of diverse eukaryotes
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657734/
https://www.ncbi.nlm.nih.gov/pubmed/31062861
http://dx.doi.org/10.1093/molbev/msz103
work_keys_str_mv AT ceronromeromarioa phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes
AT maureralcalaxyrusx phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes
AT grattepanchejeandavid phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes
AT yanying phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes
AT fonsecamiguelm phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes
AT katzla phylotolataxongenerichphylogenomicpipelinetoexploregenomeevolutionofdiverseeukaryotes