Cargando…

State-of-the art data normalization methods improve NMR-based metabolomic analysis

Extracting biomedical information from large metabolomic datasets by multivariate data analysis is of considerable complexity. Common challenges include among others screening for differentially produced metabolites, estimation of fold changes, and sample classification. Prior to these analysis step...

Descripción completa

Detalles Bibliográficos
Autores principales: Kohl, Stefanie M., Klein, Matthias S., Hochrein, Jochen, Oefner, Peter J., Spang, Rainer, Gronwald, Wolfram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3337420/
https://www.ncbi.nlm.nih.gov/pubmed/22593726
http://dx.doi.org/10.1007/s11306-011-0350-z
_version_ 1782231074623979520
author Kohl, Stefanie M.
Klein, Matthias S.
Hochrein, Jochen
Oefner, Peter J.
Spang, Rainer
Gronwald, Wolfram
author_facet Kohl, Stefanie M.
Klein, Matthias S.
Hochrein, Jochen
Oefner, Peter J.
Spang, Rainer
Gronwald, Wolfram
author_sort Kohl, Stefanie M.
collection PubMed
description Extracting biomedical information from large metabolomic datasets by multivariate data analysis is of considerable complexity. Common challenges include among others screening for differentially produced metabolites, estimation of fold changes, and sample classification. Prior to these analysis steps, it is important to minimize contributions from unwanted biases and experimental variance. This is the goal of data preprocessing. In this work, different data normalization methods were compared systematically employing two different datasets generated by means of nuclear magnetic resonance (NMR) spectroscopy. To this end, two different types of normalization methods were used, one aiming to remove unwanted sample-to-sample variation while the other adjusts the variance of the different metabolites by variable scaling and variance stabilization methods. The impact of all methods tested on sample classification was evaluated on urinary NMR fingerprints obtained from healthy volunteers and patients suffering from autosomal polycystic kidney disease (ADPKD). Performance in terms of screening for differentially produced metabolites was investigated on a dataset following a Latin-square design, where varied amounts of 8 different metabolites were spiked into a human urine matrix while keeping the total spike-in amount constant. In addition, specific tests were conducted to systematically investigate the influence of the different preprocessing methods on the structure of the analyzed data. In conclusion, preprocessing methods originally developed for DNA microarray analysis, in particular, Quantile and Cubic-Spline Normalization, performed best in reducing bias, accurately detecting fold changes, and classifying samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-011-0350-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-3337420
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-33374202012-05-14 State-of-the art data normalization methods improve NMR-based metabolomic analysis Kohl, Stefanie M. Klein, Matthias S. Hochrein, Jochen Oefner, Peter J. Spang, Rainer Gronwald, Wolfram Metabolomics Original Article Extracting biomedical information from large metabolomic datasets by multivariate data analysis is of considerable complexity. Common challenges include among others screening for differentially produced metabolites, estimation of fold changes, and sample classification. Prior to these analysis steps, it is important to minimize contributions from unwanted biases and experimental variance. This is the goal of data preprocessing. In this work, different data normalization methods were compared systematically employing two different datasets generated by means of nuclear magnetic resonance (NMR) spectroscopy. To this end, two different types of normalization methods were used, one aiming to remove unwanted sample-to-sample variation while the other adjusts the variance of the different metabolites by variable scaling and variance stabilization methods. The impact of all methods tested on sample classification was evaluated on urinary NMR fingerprints obtained from healthy volunteers and patients suffering from autosomal polycystic kidney disease (ADPKD). Performance in terms of screening for differentially produced metabolites was investigated on a dataset following a Latin-square design, where varied amounts of 8 different metabolites were spiked into a human urine matrix while keeping the total spike-in amount constant. In addition, specific tests were conducted to systematically investigate the influence of the different preprocessing methods on the structure of the analyzed data. In conclusion, preprocessing methods originally developed for DNA microarray analysis, in particular, Quantile and Cubic-Spline Normalization, performed best in reducing bias, accurately detecting fold changes, and classifying samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-011-0350-z) contains supplementary material, which is available to authorized users. Springer US 2011-08-12 2012 /pmc/articles/PMC3337420/ /pubmed/22593726 http://dx.doi.org/10.1007/s11306-011-0350-z Text en © The Author(s) 2011 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Original Article
Kohl, Stefanie M.
Klein, Matthias S.
Hochrein, Jochen
Oefner, Peter J.
Spang, Rainer
Gronwald, Wolfram
State-of-the art data normalization methods improve NMR-based metabolomic analysis
title State-of-the art data normalization methods improve NMR-based metabolomic analysis
title_full State-of-the art data normalization methods improve NMR-based metabolomic analysis
title_fullStr State-of-the art data normalization methods improve NMR-based metabolomic analysis
title_full_unstemmed State-of-the art data normalization methods improve NMR-based metabolomic analysis
title_short State-of-the art data normalization methods improve NMR-based metabolomic analysis
title_sort state-of-the art data normalization methods improve nmr-based metabolomic analysis
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3337420/
https://www.ncbi.nlm.nih.gov/pubmed/22593726
http://dx.doi.org/10.1007/s11306-011-0350-z
work_keys_str_mv AT kohlstefaniem stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis
AT kleinmatthiass stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis
AT hochreinjochen stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis
AT oefnerpeterj stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis
AT spangrainer stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis
AT gronwaldwolfram stateoftheartdatanormalizationmethodsimprovenmrbasedmetabolomicanalysis