Cargando…

SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences

Summary: Spoligotyping is a well-established genotyping technique based on the presence of unique DNA sequences in Mycobacterium tuberculosis (Mtb), the causal agent of tuberculosis disease (TB). Although advances in sequencing technologies are leading to whole-genome bacterial characterization, ten...

Descripción completa

Detalles Bibliográficos
Autores principales: Coll, Francesc, Mallard, Kim, Preston, Mark D., Bentley, Stephen, Parkhill, Julian, McNerney, Ruth, Martin, Nigel, Clark, Taane G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496340/
https://www.ncbi.nlm.nih.gov/pubmed/23014632
http://dx.doi.org/10.1093/bioinformatics/bts544
_version_ 1782249619714998272
author Coll, Francesc
Mallard, Kim
Preston, Mark D.
Bentley, Stephen
Parkhill, Julian
McNerney, Ruth
Martin, Nigel
Clark, Taane G.
author_facet Coll, Francesc
Mallard, Kim
Preston, Mark D.
Bentley, Stephen
Parkhill, Julian
McNerney, Ruth
Martin, Nigel
Clark, Taane G.
author_sort Coll, Francesc
collection PubMed
description Summary: Spoligotyping is a well-established genotyping technique based on the presence of unique DNA sequences in Mycobacterium tuberculosis (Mtb), the causal agent of tuberculosis disease (TB). Although advances in sequencing technologies are leading to whole-genome bacterial characterization, tens of thousands of isolates have been spoligotyped, giving a global view of Mtb strain diversity. To bridge the gap, we have developed SpolPred, a software to predict the spoligotype from raw sequence reads. Our approach is compared with experimentally and de novo assembly determined strain types in a set of 44 Mtb isolates. In silico and experimental results are identical for almost all isolates (39/44). However, SpolPred detected five experimentally false spoligotypes and was more accurate and faster than the assembling strategy. Application of SpolPred to an additional seven isolates with no laboratory data led to types that clustered with identical experimental types in a phylogenetic analysis using single-nucleotide polymorphisms. Our results demonstrate the usefulness of the tool and its role in revealing experimental limitations. Availability and implementation: SpolPred is written in C and is available from www.pathogenseq.org/spolpred. Contact: francesc.coll@lshtm.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online.
format Online
Article
Text
id pubmed-3496340
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34963402012-12-12 SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences Coll, Francesc Mallard, Kim Preston, Mark D. Bentley, Stephen Parkhill, Julian McNerney, Ruth Martin, Nigel Clark, Taane G. Bioinformatics Applications Note Summary: Spoligotyping is a well-established genotyping technique based on the presence of unique DNA sequences in Mycobacterium tuberculosis (Mtb), the causal agent of tuberculosis disease (TB). Although advances in sequencing technologies are leading to whole-genome bacterial characterization, tens of thousands of isolates have been spoligotyped, giving a global view of Mtb strain diversity. To bridge the gap, we have developed SpolPred, a software to predict the spoligotype from raw sequence reads. Our approach is compared with experimentally and de novo assembly determined strain types in a set of 44 Mtb isolates. In silico and experimental results are identical for almost all isolates (39/44). However, SpolPred detected five experimentally false spoligotypes and was more accurate and faster than the assembling strategy. Application of SpolPred to an additional seven isolates with no laboratory data led to types that clustered with identical experimental types in a phylogenetic analysis using single-nucleotide polymorphisms. Our results demonstrate the usefulness of the tool and its role in revealing experimental limitations. Availability and implementation: SpolPred is written in C and is available from www.pathogenseq.org/spolpred. Contact: francesc.coll@lshtm.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-11-15 2012-09-26 /pmc/articles/PMC3496340/ /pubmed/23014632 http://dx.doi.org/10.1093/bioinformatics/bts544 Text en © The Author 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Coll, Francesc
Mallard, Kim
Preston, Mark D.
Bentley, Stephen
Parkhill, Julian
McNerney, Ruth
Martin, Nigel
Clark, Taane G.
SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title_full SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title_fullStr SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title_full_unstemmed SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title_short SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences
title_sort spolpred: rapid and accurate prediction of mycobacterium tuberculosis spoligotypes from short genomic sequences
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496340/
https://www.ncbi.nlm.nih.gov/pubmed/23014632
http://dx.doi.org/10.1093/bioinformatics/bts544
work_keys_str_mv AT collfrancesc spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT mallardkim spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT prestonmarkd spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT bentleystephen spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT parkhilljulian spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT mcnerneyruth spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT martinnigel spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences
AT clarktaaneg spolpredrapidandaccuratepredictionofmycobacteriumtuberculosisspoligotypesfromshortgenomicsequences