Cargando…

SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra

Nuclear Magnetic Resonance (NMR) spectroscopy is widely used to analyze metabolites in biological samples, but the analysis can be cumbersome and inaccurate. Here, we present a powerful automated tool, SPA-STOCSY (Spatial Clustering Algorithm - Statistical Total Correlation Spectroscopy), which over...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Xu, Wang, Wanli, Ma, Li-Hua, Al-Ramahi, Ismael, Botas, Juan, MacKenzie, Kevin, Allen, Genevera I., Young, Damian W., Liu, Zhandong, Maletic-Savatic, Mirjana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980041/
https://www.ncbi.nlm.nih.gov/pubmed/36865102
http://dx.doi.org/10.1101/2023.02.22.529564
_version_ 1784899841022754816
author Han, Xu
Wang, Wanli
Ma, Li-Hua
Al-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I.
Young, Damian W.
Liu, Zhandong
Maletic-Savatic, Mirjana
author_facet Han, Xu
Wang, Wanli
Ma, Li-Hua
Al-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I.
Young, Damian W.
Liu, Zhandong
Maletic-Savatic, Mirjana
author_sort Han, Xu
collection PubMed
description Nuclear Magnetic Resonance (NMR) spectroscopy is widely used to analyze metabolites in biological samples, but the analysis can be cumbersome and inaccurate. Here, we present a powerful automated tool, SPA-STOCSY (Spatial Clustering Algorithm - Statistical Total Correlation Spectroscopy), which overcomes the challenges by identifying metabolites in each sample with high accuracy. As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset, first investigating the covariance pattern and then calculating the optimal threshold with which to cluster data points belonging to the same structural unit, i.e. metabolite. The generated clusters are then automatically linked to a compound library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized and real NMR data obtained from Drosophila melanogaster brains and human embryonic stem cells. In the synthesized spectra, SPA outperforms Statistical Recoupling of Variables, an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the real spectra, SPA-STOCSY performs comparably to operator-based Chenomx analysis but avoids operator bias and performs the analyses in less than seven minutes of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. As such, it might accelerate the utilization of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making.
format Online
Article
Text
id pubmed-9980041
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99800412023-03-03 SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra Han, Xu Wang, Wanli Ma, Li-Hua Al-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I. Young, Damian W. Liu, Zhandong Maletic-Savatic, Mirjana bioRxiv Article Nuclear Magnetic Resonance (NMR) spectroscopy is widely used to analyze metabolites in biological samples, but the analysis can be cumbersome and inaccurate. Here, we present a powerful automated tool, SPA-STOCSY (Spatial Clustering Algorithm - Statistical Total Correlation Spectroscopy), which overcomes the challenges by identifying metabolites in each sample with high accuracy. As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset, first investigating the covariance pattern and then calculating the optimal threshold with which to cluster data points belonging to the same structural unit, i.e. metabolite. The generated clusters are then automatically linked to a compound library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized and real NMR data obtained from Drosophila melanogaster brains and human embryonic stem cells. In the synthesized spectra, SPA outperforms Statistical Recoupling of Variables, an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the real spectra, SPA-STOCSY performs comparably to operator-based Chenomx analysis but avoids operator bias and performs the analyses in less than seven minutes of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. As such, it might accelerate the utilization of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making. Cold Spring Harbor Laboratory 2023-02-22 /pmc/articles/PMC9980041/ /pubmed/36865102 http://dx.doi.org/10.1101/2023.02.22.529564 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Han, Xu
Wang, Wanli
Ma, Li-Hua
Al-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I.
Young, Damian W.
Liu, Zhandong
Maletic-Savatic, Mirjana
SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title_full SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title_fullStr SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title_full_unstemmed SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title_short SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra
title_sort spa-stocsy: an automated tool for identification of annotated and non-annotated metabolites in high-throughput nmr spectra
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980041/
https://www.ncbi.nlm.nih.gov/pubmed/36865102
http://dx.doi.org/10.1101/2023.02.22.529564
work_keys_str_mv AT hanxu spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT wangwanli spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT malihua spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT alramahiismael spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT botasjuan spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT mackenziekevin spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT allengeneverai spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT youngdamianw spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT liuzhandong spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT maleticsavaticmirjana spastocsyanautomatedtoolforidentificationofannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra