Cargando…

Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project

Background: Most publicly available genomes of Salmonella enterica are from human disease in the US and the UK, or from domesticated animals in the US. Methods: Here we describe a historical collection of 10,000 strains isolated between 1891-2010 in 73 different countries. They encompass a broad ran...

Descripción completa

Detalles Bibliográficos
Autores principales: Achtman, Mark, Zhou, Zhemin, Alikhan, Nabil-Fareed, Tyne, William, Parkhill, Julian, Cormican, Martin, Chiou, Chien-Shun, Torpdahl, Mia, Litrup, Eva, Prendergast, Deirdre M., Moore, John E., Strain, Sam, Kornschober, Christian, Meinersmann, Richard, Uesbeck, Alexandra, Weill, François-Xavier, Coffey, Aidan, Andrews-Polymenis, Helene, Curtiss 3rd, Roy, Fanning, Séamus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869069/
https://www.ncbi.nlm.nih.gov/pubmed/33614977
http://dx.doi.org/10.12688/wellcomeopenres.16291.2
_version_ 1783648558432387072
author Achtman, Mark
Zhou, Zhemin
Alikhan, Nabil-Fareed
Tyne, William
Parkhill, Julian
Cormican, Martin
Chiou, Chien-Shun
Torpdahl, Mia
Litrup, Eva
Prendergast, Deirdre M.
Moore, John E.
Strain, Sam
Kornschober, Christian
Meinersmann, Richard
Uesbeck, Alexandra
Weill, François-Xavier
Coffey, Aidan
Andrews-Polymenis, Helene
Curtiss 3rd, Roy
Fanning, Séamus
author_facet Achtman, Mark
Zhou, Zhemin
Alikhan, Nabil-Fareed
Tyne, William
Parkhill, Julian
Cormican, Martin
Chiou, Chien-Shun
Torpdahl, Mia
Litrup, Eva
Prendergast, Deirdre M.
Moore, John E.
Strain, Sam
Kornschober, Christian
Meinersmann, Richard
Uesbeck, Alexandra
Weill, François-Xavier
Coffey, Aidan
Andrews-Polymenis, Helene
Curtiss 3rd, Roy
Fanning, Séamus
author_sort Achtman, Mark
collection PubMed
description Background: Most publicly available genomes of Salmonella enterica are from human disease in the US and the UK, or from domesticated animals in the US. Methods: Here we describe a historical collection of 10,000 strains isolated between 1891-2010 in 73 different countries. They encompass a broad range of sources, ranging from rivers through reptiles to the diversity of all S. enterica isolated on the island of Ireland between 2000 and 2005. Genomic DNA was isolated, and sequenced by Illumina short read sequencing. Results: The short reads are publicly available in the Short Reads Archive. They were also uploaded to EnteroBase, which assembled and annotated draft genomes. 9769 draft genomes which passed quality control were genotyped with multiple levels of multilocus sequence typing, and used to predict serovars. Genomes were assigned to hierarchical clusters on the basis of numbers of pair-wise allelic differences in core genes, which were mapped to genetic Lineages within phylogenetic trees. Conclusions: The University of Warwick/University College Cork (UoWUCC) project greatly extends the geographic sources, dates and core genomic diversity of publicly available S. enterica genomes. We illustrate these features by an overview of core genomic Lineages within 33,000 publicly available Salmonella genomes whose strains were isolated before 2011. We also present detailed examinations of HC400, HC900 and HC2000 hierarchical clusters within exemplar Lineages, including serovars Typhimurium, Enteritidis and Mbandaka. These analyses confirm the polyphyletic nature of multiple serovars while showing that discrete clusters with geographical specificity can be reliably recognized by hierarchical clustering approaches. The results also demonstrate that the genomes sequenced here provide an important counterbalance to the sampling bias which is so dominant in current genomic sequencing.
format Online
Article
Text
id pubmed-7869069
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-78690692021-02-19 Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project Achtman, Mark Zhou, Zhemin Alikhan, Nabil-Fareed Tyne, William Parkhill, Julian Cormican, Martin Chiou, Chien-Shun Torpdahl, Mia Litrup, Eva Prendergast, Deirdre M. Moore, John E. Strain, Sam Kornschober, Christian Meinersmann, Richard Uesbeck, Alexandra Weill, François-Xavier Coffey, Aidan Andrews-Polymenis, Helene Curtiss 3rd, Roy Fanning, Séamus Wellcome Open Res Research Article Background: Most publicly available genomes of Salmonella enterica are from human disease in the US and the UK, or from domesticated animals in the US. Methods: Here we describe a historical collection of 10,000 strains isolated between 1891-2010 in 73 different countries. They encompass a broad range of sources, ranging from rivers through reptiles to the diversity of all S. enterica isolated on the island of Ireland between 2000 and 2005. Genomic DNA was isolated, and sequenced by Illumina short read sequencing. Results: The short reads are publicly available in the Short Reads Archive. They were also uploaded to EnteroBase, which assembled and annotated draft genomes. 9769 draft genomes which passed quality control were genotyped with multiple levels of multilocus sequence typing, and used to predict serovars. Genomes were assigned to hierarchical clusters on the basis of numbers of pair-wise allelic differences in core genes, which were mapped to genetic Lineages within phylogenetic trees. Conclusions: The University of Warwick/University College Cork (UoWUCC) project greatly extends the geographic sources, dates and core genomic diversity of publicly available S. enterica genomes. We illustrate these features by an overview of core genomic Lineages within 33,000 publicly available Salmonella genomes whose strains were isolated before 2011. We also present detailed examinations of HC400, HC900 and HC2000 hierarchical clusters within exemplar Lineages, including serovars Typhimurium, Enteritidis and Mbandaka. These analyses confirm the polyphyletic nature of multiple serovars while showing that discrete clusters with geographical specificity can be reliably recognized by hierarchical clustering approaches. The results also demonstrate that the genomes sequenced here provide an important counterbalance to the sampling bias which is so dominant in current genomic sequencing. F1000 Research Limited 2021-02-01 /pmc/articles/PMC7869069/ /pubmed/33614977 http://dx.doi.org/10.12688/wellcomeopenres.16291.2 Text en Copyright: © 2021 Achtman M et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Achtman, Mark
Zhou, Zhemin
Alikhan, Nabil-Fareed
Tyne, William
Parkhill, Julian
Cormican, Martin
Chiou, Chien-Shun
Torpdahl, Mia
Litrup, Eva
Prendergast, Deirdre M.
Moore, John E.
Strain, Sam
Kornschober, Christian
Meinersmann, Richard
Uesbeck, Alexandra
Weill, François-Xavier
Coffey, Aidan
Andrews-Polymenis, Helene
Curtiss 3rd, Roy
Fanning, Séamus
Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title_full Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title_fullStr Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title_full_unstemmed Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title_short Genomic diversity of Salmonella enterica -The UoWUCC 10K genomes project
title_sort genomic diversity of salmonella enterica -the uowucc 10k genomes project
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869069/
https://www.ncbi.nlm.nih.gov/pubmed/33614977
http://dx.doi.org/10.12688/wellcomeopenres.16291.2
work_keys_str_mv AT achtmanmark genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT zhouzhemin genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT alikhannabilfareed genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT tynewilliam genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT parkhilljulian genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT cormicanmartin genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT chiouchienshun genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT torpdahlmia genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT litrupeva genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT prendergastdeirdrem genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT moorejohne genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT strainsam genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT kornschoberchristian genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT meinersmannrichard genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT uesbeckalexandra genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT weillfrancoisxavier genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT coffeyaidan genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT andrewspolymenishelene genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT curtiss3rdroy genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject
AT fanningseamus genomicdiversityofsalmonellaentericatheuowucc10kgenomesproject