Cargando…

PanACoTA: a modular tool for massive microbial comparative genomics

The study of the gene repertoires of microbial species, their pangenomes, has become a key part of microbial evolution and functional genomics. Yet, the increasing number of genomes available complicates the establishment of the basic building blocks of comparative genomics. Here, we present PanACoT...

Descripción completa

Detalles Bibliográficos
Autores principales: Perrin, Amandine, Rocha, Eduardo P C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7803007/
https://www.ncbi.nlm.nih.gov/pubmed/33575648
http://dx.doi.org/10.1093/nargab/lqaa106
_version_ 1783635860519911424
author Perrin, Amandine
Rocha, Eduardo P C
author_facet Perrin, Amandine
Rocha, Eduardo P C
author_sort Perrin, Amandine
collection PubMed
description The study of the gene repertoires of microbial species, their pangenomes, has become a key part of microbial evolution and functional genomics. Yet, the increasing number of genomes available complicates the establishment of the basic building blocks of comparative genomics. Here, we present PanACoTA (https://github.com/gem-pasteur/PanACoTA), a tool that allows to download all genomes of a species, build a database with those passing quality and redundancy controls, uniformly annotate and then build their pangenome, several variants of core genomes, their alignments and a rapid but accurate phylogenetic tree. While many programs building pangenomes have become available in the last few years, we have focused on a modular method, that tackles all the key steps of the process, from download to phylogenetic inference. While all steps are integrated, they can also be run separately and multiple times to allow rapid and extensive exploration of the parameters of interest. PanACoTA is built in Python3, includes a singularity container and features to facilitate its future development. We believe PanACoTa is an interesting addition to the current set of comparative genomics tools, since it will accelerate and standardize the more routine parts of the work, allowing microbial genomicists to more quickly tackle their specific questions.
format Online
Article
Text
id pubmed-7803007
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78030072021-02-10 PanACoTA: a modular tool for massive microbial comparative genomics Perrin, Amandine Rocha, Eduardo P C NAR Genom Bioinform Standard Article The study of the gene repertoires of microbial species, their pangenomes, has become a key part of microbial evolution and functional genomics. Yet, the increasing number of genomes available complicates the establishment of the basic building blocks of comparative genomics. Here, we present PanACoTA (https://github.com/gem-pasteur/PanACoTA), a tool that allows to download all genomes of a species, build a database with those passing quality and redundancy controls, uniformly annotate and then build their pangenome, several variants of core genomes, their alignments and a rapid but accurate phylogenetic tree. While many programs building pangenomes have become available in the last few years, we have focused on a modular method, that tackles all the key steps of the process, from download to phylogenetic inference. While all steps are integrated, they can also be run separately and multiple times to allow rapid and extensive exploration of the parameters of interest. PanACoTA is built in Python3, includes a singularity container and features to facilitate its future development. We believe PanACoTa is an interesting addition to the current set of comparative genomics tools, since it will accelerate and standardize the more routine parts of the work, allowing microbial genomicists to more quickly tackle their specific questions. Oxford University Press 2021-01-12 /pmc/articles/PMC7803007/ /pubmed/33575648 http://dx.doi.org/10.1093/nargab/lqaa106 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Perrin, Amandine
Rocha, Eduardo P C
PanACoTA: a modular tool for massive microbial comparative genomics
title PanACoTA: a modular tool for massive microbial comparative genomics
title_full PanACoTA: a modular tool for massive microbial comparative genomics
title_fullStr PanACoTA: a modular tool for massive microbial comparative genomics
title_full_unstemmed PanACoTA: a modular tool for massive microbial comparative genomics
title_short PanACoTA: a modular tool for massive microbial comparative genomics
title_sort panacota: a modular tool for massive microbial comparative genomics
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7803007/
https://www.ncbi.nlm.nih.gov/pubmed/33575648
http://dx.doi.org/10.1093/nargab/lqaa106
work_keys_str_mv AT perrinamandine panacotaamodulartoolformassivemicrobialcomparativegenomics
AT rochaeduardopc panacotaamodulartoolformassivemicrobialcomparativegenomics