Cargando…
SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10568371/ https://www.ncbi.nlm.nih.gov/pubmed/37792497 http://dx.doi.org/10.1093/bioinformatics/btad593 |
_version_ | 1785119347097731072 |
---|---|
author | Han, Xu Wang, Wanli Ma, Li-Hua AI-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I Young, Damian W Liu, Zhandong Maletic-Savatic, Mirjana |
author_facet | Han, Xu Wang, Wanli Ma, Li-Hua AI-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I Young, Damian W Liu, Zhandong Maletic-Savatic, Mirjana |
author_sort | Han, Xu |
collection | PubMed |
description | MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal Correlation SpectroscopY (SPA-STOCSY), which overcomes challenges faced when analyzing NMR data and identifies metabolites in a sample with high accuracy. RESULTS: As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset. It first investigates the covariance pattern among datapoints and then calculates the optimal threshold with which to cluster datapoints belonging to the same structural unit, i.e. the metabolite. Generated clusters are then automatically linked to a metabolite library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized spectra and spectra acquired on Drosophila melanogaster tissue and human embryonic stem cells. In the synthesized spectra, SPA outperformed Statistical Recoupling of Variables (SRV), an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the biological data, SPA-STOCSY performed comparably to the operator-based Chenomx analysis while avoiding operator bias, and it required <7 min of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. It may thus accelerate the use of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making. AVAILABILITY AND IMPLEMENTATION: The codes of SPA-STOCSY are available at https://github.com/LiuzLab/SPA-STOCSY. |
format | Online Article Text |
id | pubmed-10568371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-105683712023-10-13 SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra Han, Xu Wang, Wanli Ma, Li-Hua AI-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I Young, Damian W Liu, Zhandong Maletic-Savatic, Mirjana Bioinformatics Original Paper MOTIVATION: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal Correlation SpectroscopY (SPA-STOCSY), which overcomes challenges faced when analyzing NMR data and identifies metabolites in a sample with high accuracy. RESULTS: As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset. It first investigates the covariance pattern among datapoints and then calculates the optimal threshold with which to cluster datapoints belonging to the same structural unit, i.e. the metabolite. Generated clusters are then automatically linked to a metabolite library to identify candidates. To assess SPA-STOCSY’s efficiency and accuracy, we applied it to synthesized spectra and spectra acquired on Drosophila melanogaster tissue and human embryonic stem cells. In the synthesized spectra, SPA outperformed Statistical Recoupling of Variables (SRV), an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the biological data, SPA-STOCSY performed comparably to the operator-based Chenomx analysis while avoiding operator bias, and it required <7 min of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. It may thus accelerate the use of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making. AVAILABILITY AND IMPLEMENTATION: The codes of SPA-STOCSY are available at https://github.com/LiuzLab/SPA-STOCSY. Oxford University Press 2023-10-04 /pmc/articles/PMC10568371/ /pubmed/37792497 http://dx.doi.org/10.1093/bioinformatics/btad593 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Han, Xu Wang, Wanli Ma, Li-Hua AI-Ramahi, Ismael Botas, Juan MacKenzie, Kevin Allen, Genevera I Young, Damian W Liu, Zhandong Maletic-Savatic, Mirjana SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title | SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title_full | SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title_fullStr | SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title_full_unstemmed | SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title_short | SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra |
title_sort | spa-stocsy: an automated tool for identifying annotated and non-annotated metabolites in high-throughput nmr spectra |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10568371/ https://www.ncbi.nlm.nih.gov/pubmed/37792497 http://dx.doi.org/10.1093/bioinformatics/btad593 |
work_keys_str_mv | AT hanxu spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT wangwanli spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT malihua spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT airamahiismael spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT botasjuan spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT mackenziekevin spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT allengeneverai spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT youngdamianw spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT liuzhandong spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra AT maleticsavaticmirjana spastocsyanautomatedtoolforidentifyingannotatedandnonannotatedmetabolitesinhighthroughputnmrspectra |