Cargando…
A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805505/ https://www.ncbi.nlm.nih.gov/pubmed/31638897 http://dx.doi.org/10.1186/s12864-019-6119-x |
_version_ | 1783461401139871744 |
---|---|
author | Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei |
author_facet | Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei |
author_sort | Zhou, Yizhuang |
collection | PubMed |
description | BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. RESULTS: Here, we initially determine that the “tetranucleotide-derived z-value Pearson correlation coefficient” (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called “Tetranucleotide-derived Z-value Manhattan Distance” (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. CONCLUSIONS: Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future. |
format | Online Article Text |
id | pubmed-6805505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68055052019-10-24 A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei BMC Genomics Methodology Article BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. RESULTS: Here, we initially determine that the “tetranucleotide-derived z-value Pearson correlation coefficient” (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called “Tetranucleotide-derived Z-value Manhattan Distance” (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. CONCLUSIONS: Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future. BioMed Central 2019-10-21 /pmc/articles/PMC6805505/ /pubmed/31638897 http://dx.doi.org/10.1186/s12864-019-6119-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title_full | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title_fullStr | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title_full_unstemmed | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title_short | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
title_sort | high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805505/ https://www.ncbi.nlm.nih.gov/pubmed/31638897 http://dx.doi.org/10.1186/s12864-019-6119-x |
work_keys_str_mv | AT zhouyizhuang ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT zhangwenting ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT wuhuixian ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT huangkai ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT jinjunfei ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT zhouyizhuang highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT zhangwenting highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT wuhuixian highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT huangkai highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms AT jinjunfei highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms |