Cargando…

Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria

The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated a...

Descripción completa

Detalles Bibliográficos
Autores principales: Hitch, Thomas C. A., Riedel, Thomas, Oren, Aharon, Overmann, Jörg, Lawley, Trevor D., Clavel, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723785/
https://www.ncbi.nlm.nih.gov/pubmed/36732617
http://dx.doi.org/10.1038/s43705-021-00017-z
_version_ 1784844266156064768
author Hitch, Thomas C. A.
Riedel, Thomas
Oren, Aharon
Overmann, Jörg
Lawley, Trevor D.
Clavel, Thomas
author_facet Hitch, Thomas C. A.
Riedel, Thomas
Oren, Aharon
Overmann, Jörg
Lawley, Trevor D.
Clavel, Thomas
author_sort Hitch, Thomas C. A.
collection PubMed
description The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated and their properties are described. The accepted format for this is the protologue, which can be time-consuming to create. Hence, many research fields in microbiology and biotechnology will greatly benefit from new approaches that reduce the workload and harmonise the generation of protologues. We have developed Protologger, a bioinformatic tool that automatically generates all the necessary readouts for writing a detailed protologue. By producing multiple taxonomic outputs, functional features and ecological analysis using the 16S rRNA gene and genome sequences from a single species, the time needed to gather the information for describing novel taxa is substantially reduced. The usefulness of Protologger was demonstrated by using three published isolate collections to describe 34 novel taxa, encompassing 17 novel species and 17 novel genera, including the automatic generation of ecologically and functionally relevant names. We also highlight the need to utilise multiple taxonomic delineation methods, as while inconsistencies between each method occur, a combined approach provides robust placement. Protologger is open source; all scripts and datasets are available, along with a webserver at www.protologger.de
format Online
Article
Text
id pubmed-9723785
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97237852023-01-04 Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria Hitch, Thomas C. A. Riedel, Thomas Oren, Aharon Overmann, Jörg Lawley, Trevor D. Clavel, Thomas ISME Commun Article The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated and their properties are described. The accepted format for this is the protologue, which can be time-consuming to create. Hence, many research fields in microbiology and biotechnology will greatly benefit from new approaches that reduce the workload and harmonise the generation of protologues. We have developed Protologger, a bioinformatic tool that automatically generates all the necessary readouts for writing a detailed protologue. By producing multiple taxonomic outputs, functional features and ecological analysis using the 16S rRNA gene and genome sequences from a single species, the time needed to gather the information for describing novel taxa is substantially reduced. The usefulness of Protologger was demonstrated by using three published isolate collections to describe 34 novel taxa, encompassing 17 novel species and 17 novel genera, including the automatic generation of ecologically and functionally relevant names. We also highlight the need to utilise multiple taxonomic delineation methods, as while inconsistencies between each method occur, a combined approach provides robust placement. Protologger is open source; all scripts and datasets are available, along with a webserver at www.protologger.de Nature Publishing Group UK 2021-05-20 /pmc/articles/PMC9723785/ /pubmed/36732617 http://dx.doi.org/10.1038/s43705-021-00017-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Hitch, Thomas C. A.
Riedel, Thomas
Oren, Aharon
Overmann, Jörg
Lawley, Trevor D.
Clavel, Thomas
Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title_full Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title_fullStr Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title_full_unstemmed Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title_short Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
title_sort automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723785/
https://www.ncbi.nlm.nih.gov/pubmed/36732617
http://dx.doi.org/10.1038/s43705-021-00017-z
work_keys_str_mv AT hitchthomasca automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria
AT riedelthomas automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria
AT orenaharon automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria
AT overmannjorg automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria
AT lawleytrevord automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria
AT clavelthomas automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria