Cargando…

In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages

The lipopolysaccharide (O) and flagellar (H) surface antigens of Escherichia coli are targets for serotyping that have traditionally been used to identify pathogenic lineages. These surface antigens are important for the survival of E. coli within mammalian hosts. However, traditional serotyping has...

Descripción completa

Detalles Bibliográficos
Autores principales: Ingle, Danielle J., Valcanis, Mary, Kuzevski, Alex, Tauschek, Marija, Inouye, Michael, Stinear, Tim, Levine, Myron M., Robins-Browne, Roy M., Holt, Kathryn E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5343136/
https://www.ncbi.nlm.nih.gov/pubmed/28348859
http://dx.doi.org/10.1099/mgen.0.000064
_version_ 1782513313843773440
author Ingle, Danielle J.
Valcanis, Mary
Kuzevski, Alex
Tauschek, Marija
Inouye, Michael
Stinear, Tim
Levine, Myron M.
Robins-Browne, Roy M.
Holt, Kathryn E.
author_facet Ingle, Danielle J.
Valcanis, Mary
Kuzevski, Alex
Tauschek, Marija
Inouye, Michael
Stinear, Tim
Levine, Myron M.
Robins-Browne, Roy M.
Holt, Kathryn E.
author_sort Ingle, Danielle J.
collection PubMed
description The lipopolysaccharide (O) and flagellar (H) surface antigens of Escherichia coli are targets for serotyping that have traditionally been used to identify pathogenic lineages. These surface antigens are important for the survival of E. coli within mammalian hosts. However, traditional serotyping has several limitations, and public health reference laboratories are increasingly moving towards whole genome sequencing (WGS) to characterize bacterial isolates. Here we present a method to rapidly and accurately serotype E. coli isolates from raw, short read WGS data. Our approach bypasses the need for de novo genome assembly by directly screening WGS reads against a curated database of alleles linked to known and novel E. coli O-groups and H-types (the EcOH database) using the software package srst2. We validated the approach by comparing in silico results for 197 enteropathogenic E. coli isolates with those obtained by serological phenotyping in an independent laboratory. We then demonstrated the utility of our method to characterize isolates in public health and clinical settings, and to explore the genetic diversity of >1500 E. coli genomes from multiple sources. Importantly, we showed that transfer of O- and H-antigen loci between E. coli chromosomal backbones is common, with little evidence of constraints by host or pathotype, suggesting that E. coli ‘strain space’ may be virtually unlimited, even within specific pathotypes. Our findings show that serotyping is most useful when used in combination with strain genotyping to characterize microevolution events within an inferred population structure.
format Online
Article
Text
id pubmed-5343136
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-53431362017-03-27 In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages Ingle, Danielle J. Valcanis, Mary Kuzevski, Alex Tauschek, Marija Inouye, Michael Stinear, Tim Levine, Myron M. Robins-Browne, Roy M. Holt, Kathryn E. Microb Genom Research Paper The lipopolysaccharide (O) and flagellar (H) surface antigens of Escherichia coli are targets for serotyping that have traditionally been used to identify pathogenic lineages. These surface antigens are important for the survival of E. coli within mammalian hosts. However, traditional serotyping has several limitations, and public health reference laboratories are increasingly moving towards whole genome sequencing (WGS) to characterize bacterial isolates. Here we present a method to rapidly and accurately serotype E. coli isolates from raw, short read WGS data. Our approach bypasses the need for de novo genome assembly by directly screening WGS reads against a curated database of alleles linked to known and novel E. coli O-groups and H-types (the EcOH database) using the software package srst2. We validated the approach by comparing in silico results for 197 enteropathogenic E. coli isolates with those obtained by serological phenotyping in an independent laboratory. We then demonstrated the utility of our method to characterize isolates in public health and clinical settings, and to explore the genetic diversity of >1500 E. coli genomes from multiple sources. Importantly, we showed that transfer of O- and H-antigen loci between E. coli chromosomal backbones is common, with little evidence of constraints by host or pathotype, suggesting that E. coli ‘strain space’ may be virtually unlimited, even within specific pathotypes. Our findings show that serotyping is most useful when used in combination with strain genotyping to characterize microevolution events within an inferred population structure. Microbiology Society 2016-07-11 /pmc/articles/PMC5343136/ /pubmed/28348859 http://dx.doi.org/10.1099/mgen.0.000064 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Ingle, Danielle J.
Valcanis, Mary
Kuzevski, Alex
Tauschek, Marija
Inouye, Michael
Stinear, Tim
Levine, Myron M.
Robins-Browne, Roy M.
Holt, Kathryn E.
In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title_full In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title_fullStr In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title_full_unstemmed In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title_short In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
title_sort in silico serotyping of e. coli from short read data identifies limited novel o-loci but extensive diversity of o:h serotype combinations within and between pathogenic lineages
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5343136/
https://www.ncbi.nlm.nih.gov/pubmed/28348859
http://dx.doi.org/10.1099/mgen.0.000064
work_keys_str_mv AT ingledaniellej insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT valcanismary insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT kuzevskialex insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT tauschekmarija insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT inouyemichael insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT stineartim insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT levinemyronm insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT robinsbrowneroym insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages
AT holtkathryne insilicoserotypingofecolifromshortreaddataidentifieslimitednovelolocibutextensivediversityofohserotypecombinationswithinandbetweenpathogeniclineages