Cargando…
Gaussian process regression model for normalization of LC-MS data using scan-level information
BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method usi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908948/ https://www.ncbi.nlm.nih.gov/pubmed/24564985 http://dx.doi.org/10.1186/1477-5956-11-S1-S13 |
_version_ | 1782301767928643584 |
---|---|
author | Nezami Ranjbar, Mohammad R Zhao, Yi Tadesse, Mahlet G Wang, Yue Ressom, Habtom W |
author_facet | Nezami Ranjbar, Mohammad R Zhao, Yi Tadesse, Mahlet G Wang, Yue Ressom, Habtom W |
author_sort | Nezami Ranjbar, Mohammad R |
collection | PubMed |
description | BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method using the Gaussian process regression model (GPRM) that utilizes information from individual scans within an extracted ion chromatogram (EIC) of a peak. The proposed method is particularly applicable for normalization based on analysis order of LC-MS runs. Our method uses measurement variabilities estimated through LC-MS data acquired from quality control samples to correct for bias caused by instrument drift. Maximum likelihood approach is used to find the optimal parameters for the fitted GPRM. We review several normalization methods and compare their performance with GPRM. RESULTS: To evaluate the performance of different normalization methods, we consider LC-MS data from a study where metabolomic approach is utilized to discover biomarkers for liver cancer. The LC-MS data were acquired by analysis of sera from liver cancer patients and cirrhotic controls. In addition, LC-MS runs from a quality control (QC) sample are included to assess the run to run variability and to evaluate the ability of various normalization method in reducing this undesired variability. Also, ANOVA models are applied to the normalized LC-MS data to identify ions with intensity measurements that are significantly different between cases and controls. CONCLUSIONS: One of the challenges in using label-free LC-MS for quantitation of biomolecules is systematic bias in measurements. Several normalization methods have been introduced to overcome this issue, but there is no universally applicable approach at the present time. Each data set should be carefully examined to determine the most appropriate normalization method. We review here several existing methods and introduce the GPRM for normalization of LC-MS data. Through our in-house data set, we show that the GPRM outperforms other normalization methods considered here, in terms of decreasing the variability of ion intensities among quality control runs. |
format | Online Article Text |
id | pubmed-3908948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-39089482014-02-13 Gaussian process regression model for normalization of LC-MS data using scan-level information Nezami Ranjbar, Mohammad R Zhao, Yi Tadesse, Mahlet G Wang, Yue Ressom, Habtom W Proteome Sci Research BACKGROUND: Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method using the Gaussian process regression model (GPRM) that utilizes information from individual scans within an extracted ion chromatogram (EIC) of a peak. The proposed method is particularly applicable for normalization based on analysis order of LC-MS runs. Our method uses measurement variabilities estimated through LC-MS data acquired from quality control samples to correct for bias caused by instrument drift. Maximum likelihood approach is used to find the optimal parameters for the fitted GPRM. We review several normalization methods and compare their performance with GPRM. RESULTS: To evaluate the performance of different normalization methods, we consider LC-MS data from a study where metabolomic approach is utilized to discover biomarkers for liver cancer. The LC-MS data were acquired by analysis of sera from liver cancer patients and cirrhotic controls. In addition, LC-MS runs from a quality control (QC) sample are included to assess the run to run variability and to evaluate the ability of various normalization method in reducing this undesired variability. Also, ANOVA models are applied to the normalized LC-MS data to identify ions with intensity measurements that are significantly different between cases and controls. CONCLUSIONS: One of the challenges in using label-free LC-MS for quantitation of biomolecules is systematic bias in measurements. Several normalization methods have been introduced to overcome this issue, but there is no universally applicable approach at the present time. Each data set should be carefully examined to determine the most appropriate normalization method. We review here several existing methods and introduce the GPRM for normalization of LC-MS data. Through our in-house data set, we show that the GPRM outperforms other normalization methods considered here, in terms of decreasing the variability of ion intensities among quality control runs. BioMed Central 2013-11-07 /pmc/articles/PMC3908948/ /pubmed/24564985 http://dx.doi.org/10.1186/1477-5956-11-S1-S13 Text en Copyright © 2013 Nezami Ranjbar et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Nezami Ranjbar, Mohammad R Zhao, Yi Tadesse, Mahlet G Wang, Yue Ressom, Habtom W Gaussian process regression model for normalization of LC-MS data using scan-level information |
title | Gaussian process regression model for normalization of LC-MS data using scan-level information |
title_full | Gaussian process regression model for normalization of LC-MS data using scan-level information |
title_fullStr | Gaussian process regression model for normalization of LC-MS data using scan-level information |
title_full_unstemmed | Gaussian process regression model for normalization of LC-MS data using scan-level information |
title_short | Gaussian process regression model for normalization of LC-MS data using scan-level information |
title_sort | gaussian process regression model for normalization of lc-ms data using scan-level information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908948/ https://www.ncbi.nlm.nih.gov/pubmed/24564985 http://dx.doi.org/10.1186/1477-5956-11-S1-S13 |
work_keys_str_mv | AT nezamiranjbarmohammadr gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation AT zhaoyi gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation AT tadessemahletg gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation AT wangyue gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation AT ressomhabtomw gaussianprocessregressionmodelfornormalizationoflcmsdatausingscanlevelinformation |