Cargando…
MOST: a modified MLST typing tool based on short read sequencing
Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4991843/ https://www.ncbi.nlm.nih.gov/pubmed/27602279 http://dx.doi.org/10.7717/peerj.2308 |
_version_ | 1782448915706019840 |
---|---|
author | Tewolde, Rediat Dallman, Timothy Schaefer, Ulf Sheppard, Carmen L. Ashton, Philip Pichon, Bruno Ellington, Matthew Swift, Craig Green, Jonathan Underwood, Anthony |
author_facet | Tewolde, Rediat Dallman, Timothy Schaefer, Ulf Sheppard, Carmen L. Ashton, Philip Pichon, Bruno Ellington, Matthew Swift, Craig Green, Jonathan Underwood, Anthony |
author_sort | Tewolde, Rediat |
collection | PubMed |
description | Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches. |
format | Online Article Text |
id | pubmed-4991843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-49918432016-09-06 MOST: a modified MLST typing tool based on short read sequencing Tewolde, Rediat Dallman, Timothy Schaefer, Ulf Sheppard, Carmen L. Ashton, Philip Pichon, Bruno Ellington, Matthew Swift, Craig Green, Jonathan Underwood, Anthony PeerJ Bioinformatics Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches. PeerJ Inc. 2016-08-17 /pmc/articles/PMC4991843/ /pubmed/27602279 http://dx.doi.org/10.7717/peerj.2308 Text en ©2016 Tewolde et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Tewolde, Rediat Dallman, Timothy Schaefer, Ulf Sheppard, Carmen L. Ashton, Philip Pichon, Bruno Ellington, Matthew Swift, Craig Green, Jonathan Underwood, Anthony MOST: a modified MLST typing tool based on short read sequencing |
title | MOST: a modified MLST typing tool based on short read sequencing |
title_full | MOST: a modified MLST typing tool based on short read sequencing |
title_fullStr | MOST: a modified MLST typing tool based on short read sequencing |
title_full_unstemmed | MOST: a modified MLST typing tool based on short read sequencing |
title_short | MOST: a modified MLST typing tool based on short read sequencing |
title_sort | most: a modified mlst typing tool based on short read sequencing |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4991843/ https://www.ncbi.nlm.nih.gov/pubmed/27602279 http://dx.doi.org/10.7717/peerj.2308 |
work_keys_str_mv | AT tewolderediat mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT dallmantimothy mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT schaeferulf mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT sheppardcarmenl mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT ashtonphilip mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT pichonbruno mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT ellingtonmatthew mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT swiftcraig mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT greenjonathan mostamodifiedmlsttypingtoolbasedonshortreadsequencing AT underwoodanthony mostamodifiedmlsttypingtoolbasedonshortreadsequencing |