Cargando…

Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry

Rapid and accurate identification and classification of microorganisms is of paramount importance to public health and safety. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is complicat...

Descripción completa

Detalles Bibliográficos
Autores principales: Alves, Gelio, Wang, Guanghui, Ogurtsov, Aleksey Y., Drake, Steven K., Gucek, Marjan, Sacks, David B., Yu, Yi-Kuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6061032/
https://www.ncbi.nlm.nih.gov/pubmed/29873019
http://dx.doi.org/10.1007/s13361-018-1986-y
_version_ 1783342135381065728
author Alves, Gelio
Wang, Guanghui
Ogurtsov, Aleksey Y.
Drake, Steven K.
Gucek, Marjan
Sacks, David B.
Yu, Yi-Kuo
author_facet Alves, Gelio
Wang, Guanghui
Ogurtsov, Aleksey Y.
Drake, Steven K.
Gucek, Marjan
Sacks, David B.
Yu, Yi-Kuo
author_sort Alves, Gelio
collection PubMed
description Rapid and accurate identification and classification of microorganisms is of paramount importance to public health and safety. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is complicating correct microbial identification even in a simple sample due to the large number of candidates present. To properly untwine candidate microbes in samples containing one or more microbes, one needs to go beyond apparent morphology or simple “fingerprinting”; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptide-centric representations of microbes to better separate them and by augmenting our earlier analysis method that yields accurate statistical significance. Here, we present an updated analysis workflow that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using 226 MS/MS publicly available data files (each containing from 2500 to nearly 100,000 MS/MS spectra) and 4000 additional MS/MS data files, that the updated workflow can correctly identify multiple microbes at the genus and often the species level for samples containing more than one microbe. We have also shown that the proposed workflow computes accurate statistical significances, i.e., E values for identified peptides and unified E values for identified microbes. Our updated analysis workflow MiCId, a freely available software for Microorganism Classification and Identification, is available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s13361-018-1986-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6061032
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-60610322018-08-09 Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry Alves, Gelio Wang, Guanghui Ogurtsov, Aleksey Y. Drake, Steven K. Gucek, Marjan Sacks, David B. Yu, Yi-Kuo J Am Soc Mass Spectrom Research Article Rapid and accurate identification and classification of microorganisms is of paramount importance to public health and safety. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is complicating correct microbial identification even in a simple sample due to the large number of candidates present. To properly untwine candidate microbes in samples containing one or more microbes, one needs to go beyond apparent morphology or simple “fingerprinting”; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptide-centric representations of microbes to better separate them and by augmenting our earlier analysis method that yields accurate statistical significance. Here, we present an updated analysis workflow that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using 226 MS/MS publicly available data files (each containing from 2500 to nearly 100,000 MS/MS spectra) and 4000 additional MS/MS data files, that the updated workflow can correctly identify multiple microbes at the genus and often the species level for samples containing more than one microbe. We have also shown that the proposed workflow computes accurate statistical significances, i.e., E values for identified peptides and unified E values for identified microbes. Our updated analysis workflow MiCId, a freely available software for Microorganism Classification and Identification, is available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s13361-018-1986-y) contains supplementary material, which is available to authorized users. Springer US 2018-06-05 2018 /pmc/articles/PMC6061032/ /pubmed/29873019 http://dx.doi.org/10.1007/s13361-018-1986-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research Article
Alves, Gelio
Wang, Guanghui
Ogurtsov, Aleksey Y.
Drake, Steven K.
Gucek, Marjan
Sacks, David B.
Yu, Yi-Kuo
Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title_full Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title_fullStr Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title_full_unstemmed Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title_short Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry
title_sort rapid classification and identification of multiple microorganisms with accurate statistical significance via high-resolution tandem mass spectrometry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6061032/
https://www.ncbi.nlm.nih.gov/pubmed/29873019
http://dx.doi.org/10.1007/s13361-018-1986-y
work_keys_str_mv AT alvesgelio rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT wangguanghui rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT ogurtsovalekseyy rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT drakestevenk rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT gucekmarjan rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT sacksdavidb rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry
AT yuyikuo rapidclassificationandidentificationofmultiplemicroorganismswithaccuratestatisticalsignificanceviahighresolutiontandemmassspectrometry