Cargando…

MOST: a modified MLST typing tool based on short read sequencing

Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional...

Descripción completa

Detalles Bibliográficos
Autores principales: Tewolde, Rediat, Dallman, Timothy, Schaefer, Ulf, Sheppard, Carmen L., Ashton, Philip, Pichon, Bruno, Ellington, Matthew, Swift, Craig, Green, Jonathan, Underwood, Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4991843/
https://www.ncbi.nlm.nih.gov/pubmed/27602279
http://dx.doi.org/10.7717/peerj.2308
_version_ 1782448915706019840
author Tewolde, Rediat
Dallman, Timothy
Schaefer, Ulf
Sheppard, Carmen L.
Ashton, Philip
Pichon, Bruno
Ellington, Matthew
Swift, Craig
Green, Jonathan
Underwood, Anthony
author_facet Tewolde, Rediat
Dallman, Timothy
Schaefer, Ulf
Sheppard, Carmen L.
Ashton, Philip
Pichon, Bruno
Ellington, Matthew
Swift, Craig
Green, Jonathan
Underwood, Anthony
author_sort Tewolde, Rediat
collection PubMed
description Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches.
format Online
Article
Text
id pubmed-4991843
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-49918432016-09-06 MOST: a modified MLST typing tool based on short read sequencing Tewolde, Rediat Dallman, Timothy Schaefer, Ulf Sheppard, Carmen L. Ashton, Philip Pichon, Bruno Ellington, Matthew Swift, Craig Green, Jonathan Underwood, Anthony PeerJ Bioinformatics Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches. PeerJ Inc. 2016-08-17 /pmc/articles/PMC4991843/ /pubmed/27602279 http://dx.doi.org/10.7717/peerj.2308 Text en ©2016 Tewolde et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Tewolde, Rediat
Dallman, Timothy
Schaefer, Ulf
Sheppard, Carmen L.
Ashton, Philip
Pichon, Bruno
Ellington, Matthew
Swift, Craig
Green, Jonathan
Underwood, Anthony
MOST: a modified MLST typing tool based on short read sequencing
title MOST: a modified MLST typing tool based on short read sequencing
title_full MOST: a modified MLST typing tool based on short read sequencing
title_fullStr MOST: a modified MLST typing tool based on short read sequencing
title_full_unstemmed MOST: a modified MLST typing tool based on short read sequencing
title_short MOST: a modified MLST typing tool based on short read sequencing
title_sort most: a modified mlst typing tool based on short read sequencing
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4991843/
https://www.ncbi.nlm.nih.gov/pubmed/27602279
http://dx.doi.org/10.7717/peerj.2308
work_keys_str_mv AT tewolderediat mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT dallmantimothy mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT schaeferulf mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT sheppardcarmenl mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT ashtonphilip mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT pichonbruno mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT ellingtonmatthew mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT swiftcraig mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT greenjonathan mostamodifiedmlsttypingtoolbasedonshortreadsequencing
AT underwoodanthony mostamodifiedmlsttypingtoolbasedonshortreadsequencing