Cargando…
Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria
The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723785/ https://www.ncbi.nlm.nih.gov/pubmed/36732617 http://dx.doi.org/10.1038/s43705-021-00017-z |
_version_ | 1784844266156064768 |
---|---|
author | Hitch, Thomas C. A. Riedel, Thomas Oren, Aharon Overmann, Jörg Lawley, Trevor D. Clavel, Thomas |
author_facet | Hitch, Thomas C. A. Riedel, Thomas Oren, Aharon Overmann, Jörg Lawley, Trevor D. Clavel, Thomas |
author_sort | Hitch, Thomas C. A. |
collection | PubMed |
description | The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated and their properties are described. The accepted format for this is the protologue, which can be time-consuming to create. Hence, many research fields in microbiology and biotechnology will greatly benefit from new approaches that reduce the workload and harmonise the generation of protologues. We have developed Protologger, a bioinformatic tool that automatically generates all the necessary readouts for writing a detailed protologue. By producing multiple taxonomic outputs, functional features and ecological analysis using the 16S rRNA gene and genome sequences from a single species, the time needed to gather the information for describing novel taxa is substantially reduced. The usefulness of Protologger was demonstrated by using three published isolate collections to describe 34 novel taxa, encompassing 17 novel species and 17 novel genera, including the automatic generation of ecologically and functionally relevant names. We also highlight the need to utilise multiple taxonomic delineation methods, as while inconsistencies between each method occur, a combined approach provides robust placement. Protologger is open source; all scripts and datasets are available, along with a webserver at www.protologger.de |
format | Online Article Text |
id | pubmed-9723785 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97237852023-01-04 Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria Hitch, Thomas C. A. Riedel, Thomas Oren, Aharon Overmann, Jörg Lawley, Trevor D. Clavel, Thomas ISME Commun Article The study of microbial communities is hampered by the large fraction of still unknown bacteria. However, many of these species have been isolated, yet lack a validly published name or description. The validation of names for novel bacteria requires that the uniqueness of those taxa is demonstrated and their properties are described. The accepted format for this is the protologue, which can be time-consuming to create. Hence, many research fields in microbiology and biotechnology will greatly benefit from new approaches that reduce the workload and harmonise the generation of protologues. We have developed Protologger, a bioinformatic tool that automatically generates all the necessary readouts for writing a detailed protologue. By producing multiple taxonomic outputs, functional features and ecological analysis using the 16S rRNA gene and genome sequences from a single species, the time needed to gather the information for describing novel taxa is substantially reduced. The usefulness of Protologger was demonstrated by using three published isolate collections to describe 34 novel taxa, encompassing 17 novel species and 17 novel genera, including the automatic generation of ecologically and functionally relevant names. We also highlight the need to utilise multiple taxonomic delineation methods, as while inconsistencies between each method occur, a combined approach provides robust placement. Protologger is open source; all scripts and datasets are available, along with a webserver at www.protologger.de Nature Publishing Group UK 2021-05-20 /pmc/articles/PMC9723785/ /pubmed/36732617 http://dx.doi.org/10.1038/s43705-021-00017-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Hitch, Thomas C. A. Riedel, Thomas Oren, Aharon Overmann, Jörg Lawley, Trevor D. Clavel, Thomas Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title | Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title_full | Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title_fullStr | Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title_full_unstemmed | Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title_short | Automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
title_sort | automated analysis of genomic sequences facilitates high-throughput and comprehensive description of bacteria |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723785/ https://www.ncbi.nlm.nih.gov/pubmed/36732617 http://dx.doi.org/10.1038/s43705-021-00017-z |
work_keys_str_mv | AT hitchthomasca automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria AT riedelthomas automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria AT orenaharon automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria AT overmannjorg automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria AT lawleytrevord automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria AT clavelthomas automatedanalysisofgenomicsequencesfacilitateshighthroughputandcomprehensivedescriptionofbacteria |