Cargando…

FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny

Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features e...

Descripción completa

Detalles Bibliográficos
Autores principales: Patel, Isha R., Gangiredla, Jayanthi, Lacher, David W., Mammel, Mark K., Jackson, Scott A., Lampel, Keith A., Elkins, Christopher A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959244/
https://www.ncbi.nlm.nih.gov/pubmed/27037122
http://dx.doi.org/10.1128/AEM.04077-15
_version_ 1782444374922100736
author Patel, Isha R.
Gangiredla, Jayanthi
Lacher, David W.
Mammel, Mark K.
Jackson, Scott A.
Lampel, Keith A.
Elkins, Christopher A.
author_facet Patel, Isha R.
Gangiredla, Jayanthi
Lacher, David W.
Mammel, Mark K.
Jackson, Scott A.
Lampel, Keith A.
Elkins, Christopher A.
author_sort Patel, Isha R.
collection PubMed
description Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features extracted from 368 sequence sets for rapid and high-throughput pathogen identification. The FDA Escherichia coli Identification (FDA-ECID) platform contains three sets of molecularly informative features that together stratify strain identification and relatedness. First, 53 known flagellin alleles, 103 alleles of wzx and wzy, and 5 alleles of wzm provide molecular serotyping utility. Second, 41,932 probe sets representing the pan-genome of E. coli provide strain-level gene content information. Third, approximately 125,000 single nucleotide polymorphisms (SNPs) of available whole-genome sequences (WGS) were distilled to 9,984 SNPs capable of recapitulating the E. coli phylogeny. We analyzed 103 diverse E. coli strains with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy. The array was able to accurately identify the molecular O and H serotypes, potentially correcting serological failures and providing better resolution for H-nontypeable/nonmotile phenotypes. In addition, molecular risk assessment was possible with key virulence marker identifications. Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 food and clinical isolates. Finally, a 99.7% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advanced genome typing. Our study demonstrates FDA-ECID as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli. IMPORTANCE This study describes a robust, state-of-the-art platform developed from available whole-genome sequences of E. coli and Shigella spp. by distilling useful signatures for epidemiology and molecular risk assessment into one assay. The FDA-ECID microarray contains features that enable comprehensive molecular serotyping and virulence profiling along with genome-scale genotyping and SNP analysis. Hence, it is a molecular toolbox that stratifies strain identification and pathogenic potential in the contexts of epidemiology and phylogeny. We applied this tool to strains from food, environmental, and clinical sources, resulting in significantly greater phylogenetic and strain-specific resolution than previously reported for available typing methods.
format Online
Article
Text
id pubmed-4959244
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-49592442016-07-26 FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny Patel, Isha R. Gangiredla, Jayanthi Lacher, David W. Mammel, Mark K. Jackson, Scott A. Lampel, Keith A. Elkins, Christopher A. Appl Environ Microbiol Methods Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features extracted from 368 sequence sets for rapid and high-throughput pathogen identification. The FDA Escherichia coli Identification (FDA-ECID) platform contains three sets of molecularly informative features that together stratify strain identification and relatedness. First, 53 known flagellin alleles, 103 alleles of wzx and wzy, and 5 alleles of wzm provide molecular serotyping utility. Second, 41,932 probe sets representing the pan-genome of E. coli provide strain-level gene content information. Third, approximately 125,000 single nucleotide polymorphisms (SNPs) of available whole-genome sequences (WGS) were distilled to 9,984 SNPs capable of recapitulating the E. coli phylogeny. We analyzed 103 diverse E. coli strains with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy. The array was able to accurately identify the molecular O and H serotypes, potentially correcting serological failures and providing better resolution for H-nontypeable/nonmotile phenotypes. In addition, molecular risk assessment was possible with key virulence marker identifications. Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 food and clinical isolates. Finally, a 99.7% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advanced genome typing. Our study demonstrates FDA-ECID as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli. IMPORTANCE This study describes a robust, state-of-the-art platform developed from available whole-genome sequences of E. coli and Shigella spp. by distilling useful signatures for epidemiology and molecular risk assessment into one assay. The FDA-ECID microarray contains features that enable comprehensive molecular serotyping and virulence profiling along with genome-scale genotyping and SNP analysis. Hence, it is a molecular toolbox that stratifies strain identification and pathogenic potential in the contexts of epidemiology and phylogeny. We applied this tool to strains from food, environmental, and clinical sources, resulting in significantly greater phylogenetic and strain-specific resolution than previously reported for available typing methods. American Society for Microbiology 2016-05-16 /pmc/articles/PMC4959244/ /pubmed/27037122 http://dx.doi.org/10.1128/AEM.04077-15 Text en Copyright © 2016 Patel et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods
Patel, Isha R.
Gangiredla, Jayanthi
Lacher, David W.
Mammel, Mark K.
Jackson, Scott A.
Lampel, Keith A.
Elkins, Christopher A.
FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title_full FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title_fullStr FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title_full_unstemmed FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title_short FDA Escherichia coli Identification (FDA-ECID) Microarray: a Pangenome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny
title_sort fda escherichia coli identification (fda-ecid) microarray: a pangenome molecular toolbox for serotyping, virulence profiling, molecular epidemiology, and phylogeny
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959244/
https://www.ncbi.nlm.nih.gov/pubmed/27037122
http://dx.doi.org/10.1128/AEM.04077-15
work_keys_str_mv AT patelishar fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT gangiredlajayanthi fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT lacherdavidw fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT mammelmarkk fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT jacksonscotta fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT lampelkeitha fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny
AT elkinschristophera fdaescherichiacoliidentificationfdaecidmicroarrayapangenomemoleculartoolboxforserotypingvirulenceprofilingmolecularepidemiologyandphylogeny