Cargando…

Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra

Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Chia-Ru, Wang, Hsin-Yao, Chou, Po-Han, Wu, Li-Ching, Lu, Jang-Jih, Horng, Jorng-Tzong, Lee, Tzong-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9865071/
https://www.ncbi.nlm.nih.gov/pubmed/36674514
http://dx.doi.org/10.3390/ijms24020998
_version_ 1784875744743129088
author Chung, Chia-Ru
Wang, Hsin-Yao
Chou, Po-Han
Wu, Li-Ching
Lu, Jang-Jih
Horng, Jorng-Tzong
Lee, Tzong-Yi
author_facet Chung, Chia-Ru
Wang, Hsin-Yao
Chou, Po-Han
Wu, Li-Ching
Lu, Jang-Jih
Horng, Jorng-Tzong
Lee, Tzong-Yi
author_sort Chung, Chia-Ru
collection PubMed
description Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Different preprocessing methods yield different data, and the optimal approach is unclear. In this study, we adopted an ensemble of multiple preprocessing methods––FlexAnalysis, MALDIquant, and continuous wavelet transform-based methods––to detect peaks and build machine learning classifiers, including logistic regressions, naïve Bayes classifiers, random forests, and a support vector machine. The aim was to identify antibiotic resistance in Acinetobacter baumannii, Acinetobacter nosocomialis, Enterococcus faecium, and Group B Streptococci (GBS) based on MALDI-TOF MS spectra collected from two branches of a referral tertiary medical center. The ensemble method was compared with the individual methods. Random forest models built with the data preprocessed by the ensemble method outperformed individual preprocessing methods and achieved the highest accuracy, with values of 84.37% (A. baumannii), 90.96% (A. nosocomialis), 78.54% (E. faecium), and 70.12% (GBS) on independent testing datasets. Through feature selection, important peaks related to antibiotic resistance could be detected from integrated information. The prediction model can provide an opinion for clinicians. The discriminative peaks enabling better prediction performance can provide a reference for further investigation of the resistance mechanism.
format Online
Article
Text
id pubmed-9865071
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98650712023-01-22 Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra Chung, Chia-Ru Wang, Hsin-Yao Chou, Po-Han Wu, Li-Ching Lu, Jang-Jih Horng, Jorng-Tzong Lee, Tzong-Yi Int J Mol Sci Article Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Different preprocessing methods yield different data, and the optimal approach is unclear. In this study, we adopted an ensemble of multiple preprocessing methods––FlexAnalysis, MALDIquant, and continuous wavelet transform-based methods––to detect peaks and build machine learning classifiers, including logistic regressions, naïve Bayes classifiers, random forests, and a support vector machine. The aim was to identify antibiotic resistance in Acinetobacter baumannii, Acinetobacter nosocomialis, Enterococcus faecium, and Group B Streptococci (GBS) based on MALDI-TOF MS spectra collected from two branches of a referral tertiary medical center. The ensemble method was compared with the individual methods. Random forest models built with the data preprocessed by the ensemble method outperformed individual preprocessing methods and achieved the highest accuracy, with values of 84.37% (A. baumannii), 90.96% (A. nosocomialis), 78.54% (E. faecium), and 70.12% (GBS) on independent testing datasets. Through feature selection, important peaks related to antibiotic resistance could be detected from integrated information. The prediction model can provide an opinion for clinicians. The discriminative peaks enabling better prediction performance can provide a reference for further investigation of the resistance mechanism. MDPI 2023-01-05 /pmc/articles/PMC9865071/ /pubmed/36674514 http://dx.doi.org/10.3390/ijms24020998 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chung, Chia-Ru
Wang, Hsin-Yao
Chou, Po-Han
Wu, Li-Ching
Lu, Jang-Jih
Horng, Jorng-Tzong
Lee, Tzong-Yi
Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title_full Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title_fullStr Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title_full_unstemmed Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title_short Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
title_sort towards accurate identification of antibiotic-resistant pathogens through the ensemble of multiple preprocessing methods based on maldi-tof spectra
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9865071/
https://www.ncbi.nlm.nih.gov/pubmed/36674514
http://dx.doi.org/10.3390/ijms24020998
work_keys_str_mv AT chungchiaru towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT wanghsinyao towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT choupohan towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT wuliching towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT lujangjih towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT horngjorngtzong towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra
AT leetzongyi towardsaccurateidentificationofantibioticresistantpathogensthroughtheensembleofmultiplepreprocessingmethodsbasedonmalditofspectra