Cargando…

SCFIA: a statistical corresponding feature identification algorithm for LC/MS

BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elut...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Jian, Ma, Xuepo, Chen, Long, Zhang, Jianqiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3233610/
https://www.ncbi.nlm.nih.gov/pubmed/22078262
http://dx.doi.org/10.1186/1471-2105-12-439
_version_ 1782218443590729728
author Cui, Jian
Ma, Xuepo
Chen, Long
Zhang, Jianqiu
author_facet Cui, Jian
Ma, Xuepo
Chen, Long
Zhang, Jianqiu
author_sort Cui, Jian
collection PubMed
description BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elution time shifts among LC-MS datasets, which cannot resolve the ambiguity of corresponding feature identification since elution time shifts are random. We propose a Statistical Corresponding Feature Identification Algorithm(SCFIA) based on both elution time shifts and peak shape correlations between corresponding features. SCFIA first trains a set of statistical models, and then, all candidate corresponding features are scored by the statistical models to find the maximum likelihood solution. RESULTS: We test SCFIA on publicly available datasets. We first compare its performance with that of warping function based methods, and the results show significant improvements. The performance of SCFIA on replicates datasets and fractionated datasets is also evaluated. In both cases, the accuracy is above 90%, which is near optimal. Finally the coverage of SCFIA is evaluated, and it is shown that SCFIA can find corresponding features in multiple datasets for over 90% peptides identified by Tandem MS. CONCLUSIONS: SCFIA can be used for accurate corresponding feature identification in LC-MS. We have shown that peak shape correlation can be used effectively for improving the accuracy. SCFIA provides high coverage in corresponding feature identification in multiple datasets, which serves the basis for integrating multiple LC-MS measurements for accurate peptide quantification.
format Online
Article
Text
id pubmed-3233610
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32336102011-12-12 SCFIA: a statistical corresponding feature identification algorithm for LC/MS Cui, Jian Ma, Xuepo Chen, Long Zhang, Jianqiu BMC Bioinformatics Methodology Article BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elution time shifts among LC-MS datasets, which cannot resolve the ambiguity of corresponding feature identification since elution time shifts are random. We propose a Statistical Corresponding Feature Identification Algorithm(SCFIA) based on both elution time shifts and peak shape correlations between corresponding features. SCFIA first trains a set of statistical models, and then, all candidate corresponding features are scored by the statistical models to find the maximum likelihood solution. RESULTS: We test SCFIA on publicly available datasets. We first compare its performance with that of warping function based methods, and the results show significant improvements. The performance of SCFIA on replicates datasets and fractionated datasets is also evaluated. In both cases, the accuracy is above 90%, which is near optimal. Finally the coverage of SCFIA is evaluated, and it is shown that SCFIA can find corresponding features in multiple datasets for over 90% peptides identified by Tandem MS. CONCLUSIONS: SCFIA can be used for accurate corresponding feature identification in LC-MS. We have shown that peak shape correlation can be used effectively for improving the accuracy. SCFIA provides high coverage in corresponding feature identification in multiple datasets, which serves the basis for integrating multiple LC-MS measurements for accurate peptide quantification. BioMed Central 2011-11-11 /pmc/articles/PMC3233610/ /pubmed/22078262 http://dx.doi.org/10.1186/1471-2105-12-439 Text en Copyright ©2011 Cui et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Cui, Jian
Ma, Xuepo
Chen, Long
Zhang, Jianqiu
SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title_full SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title_fullStr SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title_full_unstemmed SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title_short SCFIA: a statistical corresponding feature identification algorithm for LC/MS
title_sort scfia: a statistical corresponding feature identification algorithm for lc/ms
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3233610/
https://www.ncbi.nlm.nih.gov/pubmed/22078262
http://dx.doi.org/10.1186/1471-2105-12-439
work_keys_str_mv AT cuijian scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms
AT maxuepo scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms
AT chenlong scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms
AT zhangjianqiu scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms