Cargando…

Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data

The isolation of microorganisms from microbial community samples often yields a large number of conspecific isolates. Increasing the diversity covered by an isolate collection entails the implementation of methods and protocols to minimize the number of redundant isolates. Matrix-assisted laser deso...

Descripción completa

Detalles Bibliográficos
Autores principales: Dumolin, Charles, Aerts, Maarten, Verheyde, Bart, Schellaert, Simon, Vandamme, Tim, Van der Jeugt, Felix, De Canck, Evelien, Cnockaert, Margo, Wieme, Anneleen D., Cleenwerck, Ilse, Peiren, Jindrich, Dawyndt, Peter, Vandamme, Peter, Carlier, Aurélien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739102/
https://www.ncbi.nlm.nih.gov/pubmed/31506264
http://dx.doi.org/10.1128/mSystems.00437-19
_version_ 1783450895394013184
author Dumolin, Charles
Aerts, Maarten
Verheyde, Bart
Schellaert, Simon
Vandamme, Tim
Van der Jeugt, Felix
De Canck, Evelien
Cnockaert, Margo
Wieme, Anneleen D.
Cleenwerck, Ilse
Peiren, Jindrich
Dawyndt, Peter
Vandamme, Peter
Carlier, Aurélien
author_facet Dumolin, Charles
Aerts, Maarten
Verheyde, Bart
Schellaert, Simon
Vandamme, Tim
Van der Jeugt, Felix
De Canck, Evelien
Cnockaert, Margo
Wieme, Anneleen D.
Cleenwerck, Ilse
Peiren, Jindrich
Dawyndt, Peter
Vandamme, Peter
Carlier, Aurélien
author_sort Dumolin, Charles
collection PubMed
description The isolation of microorganisms from microbial community samples often yields a large number of conspecific isolates. Increasing the diversity covered by an isolate collection entails the implementation of methods and protocols to minimize the number of redundant isolates. Matrix-assisted laser desorption–ionization time-of-flight (MALDI-TOF) mass spectrometry methods are ideally suited to this dereplication problem because of their low cost and high throughput. However, the available software tools are cumbersome and rely either on the prior development of reference databases or on global similarity analyses, which are inconvenient and offer low taxonomic resolution. We introduce SPeDE, a user-friendly spectral data analysis tool for the dereplication of MALDI-TOF mass spectra. Rather than relying on global similarity approaches to classify spectra, SPeDE determines the number of unique spectral features by a mix of global and local peak comparisons. This approach allows the identification of a set of nonredundant spectra linked to operational isolation units. We evaluated SPeDE on a data set of 5,228 spectra representing 167 bacterial strains belonging to 132 genera across six phyla and on a data set of 312 spectra of 78 strains measured before and after lyophilization and subculturing. SPeDE was able to dereplicate with high efficiency by identifying redundant spectra while retrieving reference spectra for all strains in a sample. SPeDE can identify distinguishing features between spectra, and its performance exceeds that of established methods in speed and precision. SPeDE is open source under the MIT license and is available from https://github.com/LM-UGent/SPeDE. IMPORTANCE Estimation of the operational isolation units present in a MALDI-TOF mass spectral data set involves an essential dereplication step to identify redundant spectra in a rapid manner and without sacrificing biological resolution. We describe SPeDE, a new algorithm which facilitates culture-dependent clinical or environmental studies. SPeDE enables the rapid analysis and dereplication of isolates, a critical feature when long-term storage of cultures is limited or not feasible. We show that SPeDE can efficiently identify sets of similar spectra at the level of the species or strain, exceeding the taxonomic resolution of other methods. The high-throughput capacity, speed, and low cost of MALDI-TOF mass spectrometry and SPeDE dereplication over traditional gene marker-based sequencing approaches should facilitate adoption of the culturomics approach to bacterial isolation campaigns.
format Online
Article
Text
id pubmed-6739102
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-67391022019-09-16 Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data Dumolin, Charles Aerts, Maarten Verheyde, Bart Schellaert, Simon Vandamme, Tim Van der Jeugt, Felix De Canck, Evelien Cnockaert, Margo Wieme, Anneleen D. Cleenwerck, Ilse Peiren, Jindrich Dawyndt, Peter Vandamme, Peter Carlier, Aurélien mSystems Methods and Protocols The isolation of microorganisms from microbial community samples often yields a large number of conspecific isolates. Increasing the diversity covered by an isolate collection entails the implementation of methods and protocols to minimize the number of redundant isolates. Matrix-assisted laser desorption–ionization time-of-flight (MALDI-TOF) mass spectrometry methods are ideally suited to this dereplication problem because of their low cost and high throughput. However, the available software tools are cumbersome and rely either on the prior development of reference databases or on global similarity analyses, which are inconvenient and offer low taxonomic resolution. We introduce SPeDE, a user-friendly spectral data analysis tool for the dereplication of MALDI-TOF mass spectra. Rather than relying on global similarity approaches to classify spectra, SPeDE determines the number of unique spectral features by a mix of global and local peak comparisons. This approach allows the identification of a set of nonredundant spectra linked to operational isolation units. We evaluated SPeDE on a data set of 5,228 spectra representing 167 bacterial strains belonging to 132 genera across six phyla and on a data set of 312 spectra of 78 strains measured before and after lyophilization and subculturing. SPeDE was able to dereplicate with high efficiency by identifying redundant spectra while retrieving reference spectra for all strains in a sample. SPeDE can identify distinguishing features between spectra, and its performance exceeds that of established methods in speed and precision. SPeDE is open source under the MIT license and is available from https://github.com/LM-UGent/SPeDE. IMPORTANCE Estimation of the operational isolation units present in a MALDI-TOF mass spectral data set involves an essential dereplication step to identify redundant spectra in a rapid manner and without sacrificing biological resolution. We describe SPeDE, a new algorithm which facilitates culture-dependent clinical or environmental studies. SPeDE enables the rapid analysis and dereplication of isolates, a critical feature when long-term storage of cultures is limited or not feasible. We show that SPeDE can efficiently identify sets of similar spectra at the level of the species or strain, exceeding the taxonomic resolution of other methods. The high-throughput capacity, speed, and low cost of MALDI-TOF mass spectrometry and SPeDE dereplication over traditional gene marker-based sequencing approaches should facilitate adoption of the culturomics approach to bacterial isolation campaigns. American Society for Microbiology 2019-09-10 /pmc/articles/PMC6739102/ /pubmed/31506264 http://dx.doi.org/10.1128/mSystems.00437-19 Text en Copyright © 2019 Dumolin et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods and Protocols
Dumolin, Charles
Aerts, Maarten
Verheyde, Bart
Schellaert, Simon
Vandamme, Tim
Van der Jeugt, Felix
De Canck, Evelien
Cnockaert, Margo
Wieme, Anneleen D.
Cleenwerck, Ilse
Peiren, Jindrich
Dawyndt, Peter
Vandamme, Peter
Carlier, Aurélien
Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title_full Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title_fullStr Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title_full_unstemmed Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title_short Introducing SPeDE: High-Throughput Dereplication and Accurate Determination of Microbial Diversity from Matrix-Assisted Laser Desorption–Ionization Time of Flight Mass Spectrometry Data
title_sort introducing spede: high-throughput dereplication and accurate determination of microbial diversity from matrix-assisted laser desorption–ionization time of flight mass spectrometry data
topic Methods and Protocols
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739102/
https://www.ncbi.nlm.nih.gov/pubmed/31506264
http://dx.doi.org/10.1128/mSystems.00437-19
work_keys_str_mv AT dumolincharles introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT aertsmaarten introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT verheydebart introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT schellaertsimon introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT vandammetim introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT vanderjeugtfelix introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT decanckevelien introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT cnockaertmargo introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT wiemeanneleend introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT cleenwerckilse introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT peirenjindrich introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT dawyndtpeter introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT vandammepeter introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata
AT carlieraurelien introducingspedehighthroughputdereplicationandaccuratedeterminationofmicrobialdiversityfrommatrixassistedlaserdesorptionionizationtimeofflightmassspectrometrydata