Cargando…

SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra

MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Xu, Wang, Wanli, Ma, Li-Hua, AI-Ramahi, Ismael, Botas, Juan, MacKenzie, Kevin, Allen, Genevera I, Young, Damian W, Liu, Zhandong, Maletic-Savatic, Mirjana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10568371/
https://www.ncbi.nlm.nih.gov/pubmed/37792497
http://dx.doi.org/10.1093/bioinformatics/btad593
_version_ 1785119347097731072
author Han, Xu
Wang, Wanli
Ma, Li-Hua
AI-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I
Young, Damian W
Liu, Zhandong
Maletic-Savatic, Mirjana
author_facet Han, Xu
Wang, Wanli
Ma, Li-Hua
AI-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I
Young, Damian W
Liu, Zhandong
Maletic-Savatic, Mirjana
author_sort Han, Xu
collection PubMed
description MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal Correlation SpectroscopY (SPA-STOCSY), which overcomes challenges faced when analyzing NMR data and identifies metabolites in a sample with high accuracy. RESULTS: As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset. It first investigates the covariance pattern among datapoints and then calculates the optimal threshold with which to cluster datapoints belonging to the same structural unit, i.e. the metabolite. Generated clusters are then automatically linked to a metabolite library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized spectra and spectra acquired on Drosophila melanogaster tissue and human embryonic stem cells. In the synthesized spectra, SPA outperformed Statistical Recoupling of Variables (SRV), an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the biological data, SPA-STOCSY performed comparably to the operator-based Chenomx analysis while avoiding operator bias, and it required <7 min of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. It may thus accelerate the use of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making. AVAILABILITY AND IMPLEMENTATION: The codes of SPA-STOCSY are available at https://github.com/LiuzLab/SPA-STOCSY.
format Online
Article
Text
id pubmed-10568371
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105683712023-10-13 SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra Han, Xu Wang, Wanli Ma, Li-Hua AI-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I Young, Damian W Liu, Zhandong Maletic-Savatic, Mirjana Bioinformatics Original Paper MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal Correlation SpectroscopY (SPA-STOCSY), which overcomes challenges faced when analyzing NMR data and identifies metabolites in a sample with high accuracy. RESULTS: As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset. It first investigates the covariance pattern among datapoints and then calculates the optimal threshold with which to cluster datapoints belonging to the same structural unit, i.e. the metabolite. Generated clusters are then automatically linked to a metabolite library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized spectra and spectra acquired on Drosophila melanogaster tissue and human embryonic stem cells. In the synthesized spectra, SPA outperformed Statistical Recoupling of Variables (SRV), an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the biological data, SPA-STOCSY performed comparably to the operator-based Chenomx analysis while avoiding operator bias, and it required <7 min of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. It may thus accelerate the use of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making. AVAILABILITY AND IMPLEMENTATION: The codes of SPA-STOCSY are available at https://github.com/LiuzLab/SPA-STOCSY. Oxford University Press 2023-10-04 /pmc/articles/PMC10568371/ /pubmed/37792497 http://dx.doi.org/10.1093/bioinformatics/btad593 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Han, Xu
Wang, Wanli
Ma, Li-Hua
AI-Ramahi, Ismael
Botas, Juan
MacKenzie, Kevin
Allen, Genevera I
Young, Damian W
Liu, Zhandong
Maletic-Savatic, Mirjana
SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title_full SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title_fullStr SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title_full_unstemmed SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title_short SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
title_sort spa-stocsy: an automated tool for identifying annotated and non-annotated metabolites in high-throughput nmr spectra
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10568371/
https://www.ncbi.nlm.nih.gov/pubmed/37792497
http://dx.doi.org/10.1093/bioinformatics/btad593
work_keys_str_mv AT hanxu spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT wangwanli spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT malihua spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT airamahiismael spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT botasjuan spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT mackenziekevin spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT allengeneverai spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT youngdamianw spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT liuzhandong spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra
AT maleticsavaticmirjana spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra