Cargando…

Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species

Raman spectroscopy (RS) is a widely used analytical technique based on the detection of molecular vibrations in a defined system, which generates Raman spectra that contain unique and highly resolved fingerprints of the system. However, the low intensity of normal Raman scattering effect greatly hin...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Jia-Wei, Liu, Qing-Hua, Yin, Xiao-Cong, Pan, Ya-Cheng, Wen, Peng-Bo, Liu, Xin, Kang, Xing-Xing, Gu, Bing, Zhu, Zuo-Bin, Wang, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8439569/
https://www.ncbi.nlm.nih.gov/pubmed/34531835
http://dx.doi.org/10.3389/fmicb.2021.696921
_version_ 1783752542681825280
author Tang, Jia-Wei
Liu, Qing-Hua
Yin, Xiao-Cong
Pan, Ya-Cheng
Wen, Peng-Bo
Liu, Xin
Kang, Xing-Xing
Gu, Bing
Zhu, Zuo-Bin
Wang, Liang
author_facet Tang, Jia-Wei
Liu, Qing-Hua
Yin, Xiao-Cong
Pan, Ya-Cheng
Wen, Peng-Bo
Liu, Xin
Kang, Xing-Xing
Gu, Bing
Zhu, Zuo-Bin
Wang, Liang
author_sort Tang, Jia-Wei
collection PubMed
description Raman spectroscopy (RS) is a widely used analytical technique based on the detection of molecular vibrations in a defined system, which generates Raman spectra that contain unique and highly resolved fingerprints of the system. However, the low intensity of normal Raman scattering effect greatly hinders its application. Recently, the newly emerged surface enhanced Raman spectroscopy (SERS) technique overcomes the problem by mixing metal nanoparticles such as gold and silver with samples, which greatly enhances signal intensity of Raman effects by orders of magnitudes when compared with regular RS. In clinical and research laboratories, SERS provides a great potential for fast, sensitive, label-free, and non-destructive microbial detection and identification with the assistance of appropriate machine learning (ML) algorithms. However, choosing an appropriate algorithm for a specific group of bacterial species remains challenging, because with the large volumes of data generated during SERS analysis not all algorithms could achieve a relatively high accuracy. In this study, we compared three unsupervised machine learning methods and 10 supervised machine learning methods, respectively, on 2,752 SERS spectra from 117 Staphylococcus strains belonging to nine clinically important Staphylococcus species in order to test the capacity of different machine learning methods for bacterial rapid differentiation and accurate prediction. According to the results, density-based spatial clustering of applications with noise (DBSCAN) showed the best clustering capacity (Rand index 0.9733) while convolutional neural network (CNN) topped all other supervised machine learning methods as the best model for predicting Staphylococcus species via SERS spectra (ACC 98.21%, AUC 99.93%). Taken together, this study shows that machine learning methods are capable of distinguishing closely related Staphylococcus species and therefore have great application potentials for bacterial pathogen diagnosis in clinical settings.
format Online
Article
Text
id pubmed-8439569
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84395692021-09-15 Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species Tang, Jia-Wei Liu, Qing-Hua Yin, Xiao-Cong Pan, Ya-Cheng Wen, Peng-Bo Liu, Xin Kang, Xing-Xing Gu, Bing Zhu, Zuo-Bin Wang, Liang Front Microbiol Microbiology Raman spectroscopy (RS) is a widely used analytical technique based on the detection of molecular vibrations in a defined system, which generates Raman spectra that contain unique and highly resolved fingerprints of the system. However, the low intensity of normal Raman scattering effect greatly hinders its application. Recently, the newly emerged surface enhanced Raman spectroscopy (SERS) technique overcomes the problem by mixing metal nanoparticles such as gold and silver with samples, which greatly enhances signal intensity of Raman effects by orders of magnitudes when compared with regular RS. In clinical and research laboratories, SERS provides a great potential for fast, sensitive, label-free, and non-destructive microbial detection and identification with the assistance of appropriate machine learning (ML) algorithms. However, choosing an appropriate algorithm for a specific group of bacterial species remains challenging, because with the large volumes of data generated during SERS analysis not all algorithms could achieve a relatively high accuracy. In this study, we compared three unsupervised machine learning methods and 10 supervised machine learning methods, respectively, on 2,752 SERS spectra from 117 Staphylococcus strains belonging to nine clinically important Staphylococcus species in order to test the capacity of different machine learning methods for bacterial rapid differentiation and accurate prediction. According to the results, density-based spatial clustering of applications with noise (DBSCAN) showed the best clustering capacity (Rand index 0.9733) while convolutional neural network (CNN) topped all other supervised machine learning methods as the best model for predicting Staphylococcus species via SERS spectra (ACC 98.21%, AUC 99.93%). Taken together, this study shows that machine learning methods are capable of distinguishing closely related Staphylococcus species and therefore have great application potentials for bacterial pathogen diagnosis in clinical settings. Frontiers Media S.A. 2021-08-31 /pmc/articles/PMC8439569/ /pubmed/34531835 http://dx.doi.org/10.3389/fmicb.2021.696921 Text en Copyright © 2021 Tang, Liu, Yin, Pan, Wen, Liu, Kang, Gu, Zhu and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Tang, Jia-Wei
Liu, Qing-Hua
Yin, Xiao-Cong
Pan, Ya-Cheng
Wen, Peng-Bo
Liu, Xin
Kang, Xing-Xing
Gu, Bing
Zhu, Zuo-Bin
Wang, Liang
Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title_full Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title_fullStr Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title_full_unstemmed Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title_short Comparative Analysis of Machine Learning Algorithms on Surface Enhanced Raman Spectra of Clinical Staphylococcus Species
title_sort comparative analysis of machine learning algorithms on surface enhanced raman spectra of clinical staphylococcus species
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8439569/
https://www.ncbi.nlm.nih.gov/pubmed/34531835
http://dx.doi.org/10.3389/fmicb.2021.696921
work_keys_str_mv AT tangjiawei comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT liuqinghua comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT yinxiaocong comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT panyacheng comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT wenpengbo comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT liuxin comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT kangxingxing comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT gubing comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT zhuzuobin comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies
AT wangliang comparativeanalysisofmachinelearningalgorithmsonsurfaceenhancedramanspectraofclinicalstaphylococcusspecies