Cargando…

Gaussian process regression model for normalization of LC-MS data using scan-level information

BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method usi...

Descripción completa

Detalles Bibliográficos
Autores principales: Nezami Ranjbar, Mohammad R, Zhao, Yi, Tadesse, Mahlet G, Wang, Yue, Ressom, Habtom W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908948/
https://www.ncbi.nlm.nih.gov/pubmed/24564985
http://dx.doi.org/10.1186/1477-5956-11-S1-S13
_version_ 1782301767928643584
author Nezami Ranjbar, Mohammad R
Zhao, Yi
Tadesse, Mahlet G
Wang, Yue
Ressom, Habtom W
author_facet Nezami Ranjbar, Mohammad R
Zhao, Yi
Tadesse, Mahlet G
Wang, Yue
Ressom, Habtom W
author_sort Nezami Ranjbar, Mohammad R
collection PubMed
description BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method using the Gaussian process regression model (GPRM) that utilizes information from individual scans within an extracted ion chromatogram (EIC) of a peak. The proposed method is particularly applicable for normalization based on analysis order of LC-MS runs. Our method uses measurement variabilities estimated through LC-MS data acquired from quality control samples to correct for bias caused by instrument drift. Maximum likelihood approach is used to find the optimal parameters for the fitted GPRM. We review several normalization methods and compare their performance with GPRM. RESULTS: To evaluate the performance of different normalization methods, we consider LC-MS data from a study where metabolomic approach is utilized to discover biomarkers for liver cancer. The LC-MS data were acquired by analysis of sera from liver cancer patients and cirrhotic controls. In addition, LC-MS runs from a quality control (QC) sample are included to assess the run to run variability and to evaluate the ability of various normalization method in reducing this undesired variability. Also, ANOVA models are applied to the normalized LC-MS data to identify ions with intensity measurements that are significantly different between cases and controls. CONCLUSIONS: One of the challenges in using label-free LC-MS for quantitation of biomolecules is systematic bias in measurements. Several normalization methods have been introduced to overcome this issue, but there is no universally applicable approach at the present time. Each data set should be carefully examined to determine the most appropriate normalization method. We review here several existing methods and introduce the GPRM for normalization of LC-MS data. Through our in-house data set, we show that the GPRM outperforms other normalization methods considered here, in terms of decreasing the variability of ion intensities among quality control runs.
format Online
Article
Text
id pubmed-3908948
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39089482014-02-13 Gaussian process regression model for normalization of LC-MS data using scan-level information Nezami Ranjbar, Mohammad R Zhao, Yi Tadesse, Mahlet G Wang, Yue Ressom, Habtom W Proteome Sci Research BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method using the Gaussian process regression model (GPRM) that utilizes information from individual scans within an extracted ion chromatogram (EIC) of a peak. The proposed method is particularly applicable for normalization based on analysis order of LC-MS runs. Our method uses measurement variabilities estimated through LC-MS data acquired from quality control samples to correct for bias caused by instrument drift. Maximum likelihood approach is used to find the optimal parameters for the fitted GPRM. We review several normalization methods and compare their performance with GPRM. RESULTS: To evaluate the performance of different normalization methods, we consider LC-MS data from a study where metabolomic approach is utilized to discover biomarkers for liver cancer. The LC-MS data were acquired by analysis of sera from liver cancer patients and cirrhotic controls. In addition, LC-MS runs from a quality control (QC) sample are included to assess the run to run variability and to evaluate the ability of various normalization method in reducing this undesired variability. Also, ANOVA models are applied to the normalized LC-MS data to identify ions with intensity measurements that are significantly different between cases and controls. CONCLUSIONS: One of the challenges in using label-free LC-MS for quantitation of biomolecules is systematic bias in measurements. Several normalization methods have been introduced to overcome this issue, but there is no universally applicable approach at the present time. Each data set should be carefully examined to determine the most appropriate normalization method. We review here several existing methods and introduce the GPRM for normalization of LC-MS data. Through our in-house data set, we show that the GPRM outperforms other normalization methods considered here, in terms of decreasing the variability of ion intensities among quality control runs. BioMed Central 2013-11-07 /pmc/articles/PMC3908948/ /pubmed/24564985 http://dx.doi.org/10.1186/1477-5956-11-S1-S13 Text en Copyright © 2013 Nezami Ranjbar et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Nezami Ranjbar, Mohammad R
Zhao, Yi
Tadesse, Mahlet G
Wang, Yue
Ressom, Habtom W
Gaussian process regression model for normalization of LC-MS data using scan-level information
title Gaussian process regression model for normalization of LC-MS data using scan-level information
title_full Gaussian process regression model for normalization of LC-MS data using scan-level information
title_fullStr Gaussian process regression model for normalization of LC-MS data using scan-level information
title_full_unstemmed Gaussian process regression model for normalization of LC-MS data using scan-level information
title_short Gaussian process regression model for normalization of LC-MS data using scan-level information
title_sort gaussian process regression model for normalization of lc-ms data using scan-level information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908948/
https://www.ncbi.nlm.nih.gov/pubmed/24564985
http://dx.doi.org/10.1186/1477-5956-11-S1-S13
work_keys_str_mv AT nezamiranjbarmohammadr gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation
AT zhaoyi gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation
AT tadessemahletg gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation
AT wangyue gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation
AT ressomhabtomw gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation