Cargando…

A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms

BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thu...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yizhuang, Zhang, Wenting, Wu, Huixian, Huang, Kai, Jin, Junfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805505/
https://www.ncbi.nlm.nih.gov/pubmed/31638897
http://dx.doi.org/10.1186/s12864-019-6119-x
_version_ 1783461401139871744
author Zhou, Yizhuang
Zhang, Wenting
Wu, Huixian
Huang, Kai
Jin, Junfei
author_facet Zhou, Yizhuang
Zhang, Wenting
Wu, Huixian
Huang, Kai
Jin, Junfei
author_sort Zhou, Yizhuang
collection PubMed
description BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. RESULTS: Here, we initially determine that the “tetranucleotide-derived z-value Pearson correlation coefficient” (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called “Tetranucleotide-derived Z-value Manhattan Distance” (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. CONCLUSIONS: Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future.
format Online
Article
Text
id pubmed-6805505
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68055052019-10-24 A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei BMC Genomics Methodology Article BACKGROUND: Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. RESULTS: Here, we initially determine that the “tetranucleotide-derived z-value Pearson correlation coefficient” (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called “Tetranucleotide-derived Z-value Manhattan Distance” (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. CONCLUSIONS: Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future. BioMed Central 2019-10-21 /pmc/articles/PMC6805505/ /pubmed/31638897 http://dx.doi.org/10.1186/s12864-019-6119-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Zhou, Yizhuang
Zhang, Wenting
Wu, Huixian
Huang, Kai
Jin, Junfei
A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title_full A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title_fullStr A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title_full_unstemmed A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title_short A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
title_sort high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805505/
https://www.ncbi.nlm.nih.gov/pubmed/31638897
http://dx.doi.org/10.1186/s12864-019-6119-x
work_keys_str_mv AT zhouyizhuang ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT zhangwenting ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT wuhuixian ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT huangkai ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT jinjunfei ahighresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT zhouyizhuang highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT zhangwenting highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT wuhuixian highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT huangkai highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms
AT jinjunfei highresolutiongenomiccompositionbasedmethodwiththeabilitytodistinguishsimilarbacterialorganisms