Cargando…

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes

Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cabal, Adriana, Jun, Se-Ran, Jenjaroenpun, Piroon, Wanchai, Visanu, Nookaew, Intawat, Wongsurawat, Thidathip, Burgess, Mary J., Kothari, Atul, Wassenaar, Trudy M., Ussery, David W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2018
Materias:	Genes and Genomes
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6132499/ https://www.ncbi.nlm.nih.gov/pubmed/29445826 http://dx.doi.org/10.1007/s00248-018-1155-7

_version_	1783354335821824000
author	Cabal, Adriana Jun, Se-Ran Jenjaroenpun, Piroon Wanchai, Visanu Nookaew, Intawat Wongsurawat, Thidathip Burgess, Mary J. Kothari, Atul Wassenaar, Trudy M. Ussery, David W.
author_facet	Cabal, Adriana Jun, Se-Ran Jenjaroenpun, Piroon Wanchai, Visanu Nookaew, Intawat Wongsurawat, Thidathip Burgess, Mary J. Kothari, Atul Wassenaar, Trudy M. Ussery, David W.
author_sort	Cabal, Adriana
collection	PubMed
description	Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs to be analyzed and compared in such a way that the outcome is useful for clinicians or epidemiologists. Here, we compare 663 public available complete genome sequences of C. difficile using average amino acid identity (AAI) scores. This analysis revealed that most of these genomes (640, 96.5%) clearly belong to the same species, while the remaining 23 genomes produce four distinct clusters within the Clostridioides genus. The main C. difficile cluster can be further divided into sub-clusters, depending on the chosen cutoff. We demonstrate that MLST, either based on partial or full gene-length, results in biased estimates of genetic differences and does not capture the true degree of similarity or differences of complete genomes. Presence of genes coding for C. difficile toxins A and B (ToxA/B), as well as the binary C. difficile toxin (CDT), was deduced from their unique PfamA domain architectures. Out of the 663 C. difficile genomes, 535 (80.7%) contained at least one copy of ToxA or ToxB, while these genes were missing from 128 genomes. Although some clusters were enriched for toxin presence, these genes are variably present in a given genetic background. The CDT genes were found in 191 genomes, which were restricted to a few clusters only, and only one cluster lacked the toxin A/B genes consistently. A total of 310 genomes contained ToxA/B without CDT (47%). Further, published metagenomic data from stools were used to assess the presence of C. difficile sequences in blinded cases of C. difficile infection (CDI) and controls, to test if metagenomic analysis is sensitive enough to detect the pathogen, and to establish strain relationships between cases from the same hospital. We conclude that metagenomics can contribute to the identification of CDI and can assist in characterization of the most probable causative strain in CDI patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00248-018-1155-7) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6132499
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-61324992018-09-14 Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes Cabal, Adriana Jun, Se-Ran Jenjaroenpun, Piroon Wanchai, Visanu Nookaew, Intawat Wongsurawat, Thidathip Burgess, Mary J. Kothari, Atul Wassenaar, Trudy M. Ussery, David W. Microb Ecol Genes and Genomes Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs to be analyzed and compared in such a way that the outcome is useful for clinicians or epidemiologists. Here, we compare 663 public available complete genome sequences of C. difficile using average amino acid identity (AAI) scores. This analysis revealed that most of these genomes (640, 96.5%) clearly belong to the same species, while the remaining 23 genomes produce four distinct clusters within the Clostridioides genus. The main C. difficile cluster can be further divided into sub-clusters, depending on the chosen cutoff. We demonstrate that MLST, either based on partial or full gene-length, results in biased estimates of genetic differences and does not capture the true degree of similarity or differences of complete genomes. Presence of genes coding for C. difficile toxins A and B (ToxA/B), as well as the binary C. difficile toxin (CDT), was deduced from their unique PfamA domain architectures. Out of the 663 C. difficile genomes, 535 (80.7%) contained at least one copy of ToxA or ToxB, while these genes were missing from 128 genomes. Although some clusters were enriched for toxin presence, these genes are variably present in a given genetic background. The CDT genes were found in 191 genomes, which were restricted to a few clusters only, and only one cluster lacked the toxin A/B genes consistently. A total of 310 genomes contained ToxA/B without CDT (47%). Further, published metagenomic data from stools were used to assess the presence of C. difficile sequences in blinded cases of C. difficile infection (CDI) and controls, to test if metagenomic analysis is sensitive enough to detect the pathogen, and to establish strain relationships between cases from the same hospital. We conclude that metagenomics can contribute to the identification of CDI and can assist in characterization of the most probable causative strain in CDI patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00248-018-1155-7) contains supplementary material, which is available to authorized users. Springer US 2018-02-14 2018 /pmc/articles/PMC6132499/ /pubmed/29445826 http://dx.doi.org/10.1007/s00248-018-1155-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Genes and Genomes Cabal, Adriana Jun, Se-Ran Jenjaroenpun, Piroon Wanchai, Visanu Nookaew, Intawat Wongsurawat, Thidathip Burgess, Mary J. Kothari, Atul Wassenaar, Trudy M. Ussery, David W. Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title	Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title_full	Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title_fullStr	Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title_full_unstemmed	Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title_short	Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes
title_sort	genome-based comparison of clostridioides difficile: average amino acid identity analysis of core genomes
topic	Genes and Genomes
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6132499/ https://www.ncbi.nlm.nih.gov/pubmed/29445826 http://dx.doi.org/10.1007/s00248-018-1155-7
work_keys_str_mv	AT cabaladriana genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT junseran genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT jenjaroenpunpiroon genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT wanchaivisanu genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT nookaewintawat genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT wongsurawatthidathip genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT burgessmaryj genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT kothariatul genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT wassenaartrudym genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes AT usserydavidw genomebasedcomparisonofclostridioidesdifficileaverageaminoacididentityanalysisofcoregenomes

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes

Ejemplares similares