Cargando…
Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations
Species delineation based on bacterial genomes is an essential part of the research of prokaryotes. In silico genome-to-genome comparison methods are computationally demanding, but much less tedious and error prone than the wet-lab methods. In this paper, we present a novel method for the delineatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6352304/ https://www.ncbi.nlm.nih.gov/pubmed/30728919 http://dx.doi.org/10.1016/j.csbj.2018.12.006 |
_version_ | 1783390805448196096 |
---|---|
author | Maderankova, Denisa Jugas, Robin Sedlar, Karel Vitek, Martin Skutkova, Helena |
author_facet | Maderankova, Denisa Jugas, Robin Sedlar, Karel Vitek, Martin Skutkova, Helena |
author_sort | Maderankova, Denisa |
collection | PubMed |
description | Species delineation based on bacterial genomes is an essential part of the research of prokaryotes. In silico genome-to-genome comparison methods are computationally demanding, but much less tedious and error prone than the wet-lab methods. In this paper, we present a novel method for the delineation of bacterial genomes based on genomic signal processing. The proposed method uses numerical representations of whole bacterial genomes, phase signal and cumulated phase signal, from which four parameters are derived for each genome. The parameters characterize a genome and their calculation is independent of the other genomes comprising a delineation dataset. The delineation itself is processed as a calculation of the parameters' average similarity. The method was statistically verified on 1826 bacterial genomes. A similarity threshold of 96% was set based on the receiver operating characteristic curve that featured sensitivity of 99.78% and specificity of 97.25%. Additionally, comparative analysis on another 33 bacterial genomes was conducted using standard delineation tools as these tools were not able to process the dataset of 1826 genomes using desktop computer. The proposed method achieved comparable or better delineation results in comparison with the standard tools. Besides the excellent delineation results, another great advantage of the method is its small computational demands, which enables the delineation of thousands of genomes on a desktop computer. The calculation of the parameters takes tens of minutes for thousands of genomes. Moreover, they can be calculated in advance by creating a database, meaning the delineation itself is then completed in a matter of seconds. |
format | Online Article Text |
id | pubmed-6352304 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-63523042019-02-06 Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations Maderankova, Denisa Jugas, Robin Sedlar, Karel Vitek, Martin Skutkova, Helena Comput Struct Biotechnol J Short Survey Species delineation based on bacterial genomes is an essential part of the research of prokaryotes. In silico genome-to-genome comparison methods are computationally demanding, but much less tedious and error prone than the wet-lab methods. In this paper, we present a novel method for the delineation of bacterial genomes based on genomic signal processing. The proposed method uses numerical representations of whole bacterial genomes, phase signal and cumulated phase signal, from which four parameters are derived for each genome. The parameters characterize a genome and their calculation is independent of the other genomes comprising a delineation dataset. The delineation itself is processed as a calculation of the parameters' average similarity. The method was statistically verified on 1826 bacterial genomes. A similarity threshold of 96% was set based on the receiver operating characteristic curve that featured sensitivity of 99.78% and specificity of 97.25%. Additionally, comparative analysis on another 33 bacterial genomes was conducted using standard delineation tools as these tools were not able to process the dataset of 1826 genomes using desktop computer. The proposed method achieved comparable or better delineation results in comparison with the standard tools. Besides the excellent delineation results, another great advantage of the method is its small computational demands, which enables the delineation of thousands of genomes on a desktop computer. The calculation of the parameters takes tens of minutes for thousands of genomes. Moreover, they can be calculated in advance by creating a database, meaning the delineation itself is then completed in a matter of seconds. Research Network of Computational and Structural Biotechnology 2019-01-09 /pmc/articles/PMC6352304/ /pubmed/30728919 http://dx.doi.org/10.1016/j.csbj.2018.12.006 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Short Survey Maderankova, Denisa Jugas, Robin Sedlar, Karel Vitek, Martin Skutkova, Helena Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title | Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title_full | Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title_fullStr | Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title_full_unstemmed | Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title_short | Rapid Bacterial Species Delineation Based on Parameters Derived From Genome Numerical Representations |
title_sort | rapid bacterial species delineation based on parameters derived from genome numerical representations |
topic | Short Survey |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6352304/ https://www.ncbi.nlm.nih.gov/pubmed/30728919 http://dx.doi.org/10.1016/j.csbj.2018.12.006 |
work_keys_str_mv | AT maderankovadenisa rapidbacterialspeciesdelineationbasedonparametersderivedfromgenomenumericalrepresentations AT jugasrobin rapidbacterialspeciesdelineationbasedonparametersderivedfromgenomenumericalrepresentations AT sedlarkarel rapidbacterialspeciesdelineationbasedonparametersderivedfromgenomenumericalrepresentations AT vitekmartin rapidbacterialspeciesdelineationbasedonparametersderivedfromgenomenumericalrepresentations AT skutkovahelena rapidbacterialspeciesdelineationbasedonparametersderivedfromgenomenumericalrepresentations |