Cargando…

An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data

BACKGROUND: Nuclear magnetic resonance spectroscopy (NMR) is a powerful technique to reveal and compare quantitative metabolic profiles of biological tissues. However, chemical and physical sample variations make the analysis of the data challenging, and typically require the application of a number...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Trung N, Valkenborg, Dirk, Smets, Koen, Verwaest, Kim A, Dommisse, Roger, Lemière, Filip, Verschoren, Alain, Goethals, Bart, Laukens, Kris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3217056/
https://www.ncbi.nlm.nih.gov/pubmed/22014236
http://dx.doi.org/10.1186/1471-2105-12-405
_version_ 1782216581242159104
author Vu, Trung N
Valkenborg, Dirk
Smets, Koen
Verwaest, Kim A
Dommisse, Roger
Lemière, Filip
Verschoren, Alain
Goethals, Bart
Laukens, Kris
author_facet Vu, Trung N
Valkenborg, Dirk
Smets, Koen
Verwaest, Kim A
Dommisse, Roger
Lemière, Filip
Verschoren, Alain
Goethals, Bart
Laukens, Kris
author_sort Vu, Trung N
collection PubMed
description BACKGROUND: Nuclear magnetic resonance spectroscopy (NMR) is a powerful technique to reveal and compare quantitative metabolic profiles of biological tissues. However, chemical and physical sample variations make the analysis of the data challenging, and typically require the application of a number of preprocessing steps prior to data interpretation. For example, noise reduction, normalization, baseline correction, peak picking, spectrum alignment and statistical analysis are indispensable components in any NMR analysis pipeline. RESULTS: We introduce a novel suite of informatics tools for the quantitative analysis of NMR metabolomic profile data. The core of the processing cascade is a novel peak alignment algorithm, called hierarchical Cluster-based Peak Alignment (CluPA). The algorithm aligns a target spectrum to the reference spectrum in a top-down fashion by building a hierarchical cluster tree from peak lists of reference and target spectra and then dividing the spectra into smaller segments based on the most distant clusters of the tree. To reduce the computational time to estimate the spectral misalignment, the method makes use of Fast Fourier Transformation (FFT) cross-correlation. Since the method returns a high-quality alignment, we can propose a simple methodology to study the variability of the NMR spectra. For each aligned NMR data point the ratio of the between-group and within-group sum of squares (BW-ratio) is calculated to quantify the difference in variability between and within predefined groups of NMR spectra. This differential analysis is related to the calculation of the F-statistic or a one-way ANOVA, but without distributional assumptions. Statistical inference based on the BW-ratio is achieved by bootstrapping the null distribution from the experimental data. CONCLUSIONS: The workflow performance was evaluated using a previously published dataset. Correlation maps, spectral and grey scale plots show clear improvements in comparison to other methods, and the down-to-earth quantitative analysis works well for the CluPA-aligned spectra. The whole workflow is embedded into a modular and statistically sound framework that is implemented as an R package called "speaq" ("spectrum alignment and quantitation"), which is freely available from http://code.google.com/p/speaq/.
format Online
Article
Text
id pubmed-3217056
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32170562011-11-16 An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data Vu, Trung N Valkenborg, Dirk Smets, Koen Verwaest, Kim A Dommisse, Roger Lemière, Filip Verschoren, Alain Goethals, Bart Laukens, Kris BMC Bioinformatics Methodology Article BACKGROUND: Nuclear magnetic resonance spectroscopy (NMR) is a powerful technique to reveal and compare quantitative metabolic profiles of biological tissues. However, chemical and physical sample variations make the analysis of the data challenging, and typically require the application of a number of preprocessing steps prior to data interpretation. For example, noise reduction, normalization, baseline correction, peak picking, spectrum alignment and statistical analysis are indispensable components in any NMR analysis pipeline. RESULTS: We introduce a novel suite of informatics tools for the quantitative analysis of NMR metabolomic profile data. The core of the processing cascade is a novel peak alignment algorithm, called hierarchical Cluster-based Peak Alignment (CluPA). The algorithm aligns a target spectrum to the reference spectrum in a top-down fashion by building a hierarchical cluster tree from peak lists of reference and target spectra and then dividing the spectra into smaller segments based on the most distant clusters of the tree. To reduce the computational time to estimate the spectral misalignment, the method makes use of Fast Fourier Transformation (FFT) cross-correlation. Since the method returns a high-quality alignment, we can propose a simple methodology to study the variability of the NMR spectra. For each aligned NMR data point the ratio of the between-group and within-group sum of squares (BW-ratio) is calculated to quantify the difference in variability between and within predefined groups of NMR spectra. This differential analysis is related to the calculation of the F-statistic or a one-way ANOVA, but without distributional assumptions. Statistical inference based on the BW-ratio is achieved by bootstrapping the null distribution from the experimental data. CONCLUSIONS: The workflow performance was evaluated using a previously published dataset. Correlation maps, spectral and grey scale plots show clear improvements in comparison to other methods, and the down-to-earth quantitative analysis works well for the CluPA-aligned spectra. The whole workflow is embedded into a modular and statistically sound framework that is implemented as an R package called "speaq" ("spectrum alignment and quantitation"), which is freely available from http://code.google.com/p/speaq/. BioMed Central 2011-10-20 /pmc/articles/PMC3217056/ /pubmed/22014236 http://dx.doi.org/10.1186/1471-2105-12-405 Text en Copyright ©2011 Vu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Vu, Trung N
Valkenborg, Dirk
Smets, Koen
Verwaest, Kim A
Dommisse, Roger
Lemière, Filip
Verschoren, Alain
Goethals, Bart
Laukens, Kris
An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title_full An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title_fullStr An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title_full_unstemmed An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title_short An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data
title_sort integrated workflow for robust alignment and simplified quantitative analysis of nmr spectrometry data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3217056/
https://www.ncbi.nlm.nih.gov/pubmed/22014236
http://dx.doi.org/10.1186/1471-2105-12-405
work_keys_str_mv AT vutrungn anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT valkenborgdirk anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT smetskoen anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT verwaestkima anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT dommisseroger anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT lemierefilip anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT verschorenalain anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT goethalsbart anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT laukenskris anintegratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT vutrungn integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT valkenborgdirk integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT smetskoen integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT verwaestkima integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT dommisseroger integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT lemierefilip integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT verschorenalain integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT goethalsbart integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata
AT laukenskris integratedworkflowforrobustalignmentandsimplifiedquantitativeanalysisofnmrspectrometrydata