Cargando…

RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species

BACKGROUND: Molecular identification of Mycobacterium species has two primary advantages when compared to phenotypic identification: rapid turn-around time and improved accuracy. The information content of the 5' end of the 16S ribosomal RNA gene (16S rDNA) is sufficient for identification of m...

Descripción completa

Detalles Bibliográficos
Autores principales: Harmsen, Dag, Dostal, Stefan, Roth, Andreas, Niemann, Stefan, Rothgänger, Jörg, Sammeth, Michael, Albert, Jürgen, Frosch, Matthias, Richter, Elvira
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC280682/
https://www.ncbi.nlm.nih.gov/pubmed/14611664
http://dx.doi.org/10.1186/1471-2334-3-26
_version_ 1782121062369067008
author Harmsen, Dag
Dostal, Stefan
Roth, Andreas
Niemann, Stefan
Rothgänger, Jörg
Sammeth, Michael
Albert, Jürgen
Frosch, Matthias
Richter, Elvira
author_facet Harmsen, Dag
Dostal, Stefan
Roth, Andreas
Niemann, Stefan
Rothgänger, Jörg
Sammeth, Michael
Albert, Jürgen
Frosch, Matthias
Richter, Elvira
author_sort Harmsen, Dag
collection PubMed
description BACKGROUND: Molecular identification of Mycobacterium species has two primary advantages when compared to phenotypic identification: rapid turn-around time and improved accuracy. The information content of the 5' end of the 16S ribosomal RNA gene (16S rDNA) is sufficient for identification of most bacterial species. However, reliable sequence-based identification is hampered by many faulty and some missing sequence entries in publicly accessible databases. METHODS: In order to establish an improved 16S rDNA sequence database for the identification of clinical and environmental isolates, we sequenced both strands of the 5' end of 16S rDNA (Escherichia coli positions 54 to 510) from 199 mycobacterial culture collection isolates. All validly described species (n = 89; up to March 21, 2000) and nearly all published sequevar variants were included. If the 16S rDNA sequences were not discriminatory, the internal transcribed spacer (ITS) region sequences (n = 84) were also determined. RESULTS: Using 5'-16S rDNA sequencing a total of 64 different mycobacterial species (71.9%) could be identified. With the additional input of the ITS sequence, a further 16 species or subspecies could be differentiated. Only Mycobacterium tuberculosis complex species, M. marinum / M. ulcerans and the M. avium subspecies could not be differentiated using 5'-16S rDNA or ITS sequencing. A total of 77 culture collection strain sequences, exhibiting an overlap of at least 80% and identical by strain number to the isolates used in this study, were found in the GenBank. Comparing these with our sequences revealed that an average of 4.31 nucleotide differences (SD ± 0.57) were present. CONCLUSIONS: The data from this analysis show that it is possible to differentiate most mycobacterial species by sequence analysis of partial 16S rDNA. The high-quality sequences reported here, together with ancillary information (e.g., taxonomic, medical), are available in a public database, which is currently being expanded in the RIDOM project ), for similarity searches.
format Text
id pubmed-280682
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-2806822003-12-02 RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species Harmsen, Dag Dostal, Stefan Roth, Andreas Niemann, Stefan Rothgänger, Jörg Sammeth, Michael Albert, Jürgen Frosch, Matthias Richter, Elvira BMC Infect Dis Research Article BACKGROUND: Molecular identification of Mycobacterium species has two primary advantages when compared to phenotypic identification: rapid turn-around time and improved accuracy. The information content of the 5' end of the 16S ribosomal RNA gene (16S rDNA) is sufficient for identification of most bacterial species. However, reliable sequence-based identification is hampered by many faulty and some missing sequence entries in publicly accessible databases. METHODS: In order to establish an improved 16S rDNA sequence database for the identification of clinical and environmental isolates, we sequenced both strands of the 5' end of 16S rDNA (Escherichia coli positions 54 to 510) from 199 mycobacterial culture collection isolates. All validly described species (n = 89; up to March 21, 2000) and nearly all published sequevar variants were included. If the 16S rDNA sequences were not discriminatory, the internal transcribed spacer (ITS) region sequences (n = 84) were also determined. RESULTS: Using 5'-16S rDNA sequencing a total of 64 different mycobacterial species (71.9%) could be identified. With the additional input of the ITS sequence, a further 16 species or subspecies could be differentiated. Only Mycobacterium tuberculosis complex species, M. marinum / M. ulcerans and the M. avium subspecies could not be differentiated using 5'-16S rDNA or ITS sequencing. A total of 77 culture collection strain sequences, exhibiting an overlap of at least 80% and identical by strain number to the isolates used in this study, were found in the GenBank. Comparing these with our sequences revealed that an average of 4.31 nucleotide differences (SD ± 0.57) were present. CONCLUSIONS: The data from this analysis show that it is possible to differentiate most mycobacterial species by sequence analysis of partial 16S rDNA. The high-quality sequences reported here, together with ancillary information (e.g., taxonomic, medical), are available in a public database, which is currently being expanded in the RIDOM project ), for similarity searches. BioMed Central 2003-11-11 /pmc/articles/PMC280682/ /pubmed/14611664 http://dx.doi.org/10.1186/1471-2334-3-26 Text en Copyright © 2003 Harmsen et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research Article
Harmsen, Dag
Dostal, Stefan
Roth, Andreas
Niemann, Stefan
Rothgänger, Jörg
Sammeth, Michael
Albert, Jürgen
Frosch, Matthias
Richter, Elvira
RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title_full RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title_fullStr RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title_full_unstemmed RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title_short RIDOM: Comprehensive and public sequence database for identification of Mycobacterium species
title_sort ridom: comprehensive and public sequence database for identification of mycobacterium species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC280682/
https://www.ncbi.nlm.nih.gov/pubmed/14611664
http://dx.doi.org/10.1186/1471-2334-3-26
work_keys_str_mv AT harmsendag ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT dostalstefan ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT rothandreas ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT niemannstefan ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT rothgangerjorg ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT sammethmichael ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT albertjurgen ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT froschmatthias ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies
AT richterelvira ridomcomprehensiveandpublicsequencedatabaseforidentificationofmycobacteriumspecies