Cargando…
SCFIA: a statistical corresponding feature identification algorithm for LC/MS
BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elut...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3233610/ https://www.ncbi.nlm.nih.gov/pubmed/22078262 http://dx.doi.org/10.1186/1471-2105-12-439 |
_version_ | 1782218443590729728 |
---|---|
author | Cui, Jian Ma, Xuepo Chen, Long Zhang, Jianqiu |
author_facet | Cui, Jian Ma, Xuepo Chen, Long Zhang, Jianqiu |
author_sort | Cui, Jian |
collection | PubMed |
description | BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elution time shifts among LC-MS datasets, which cannot resolve the ambiguity of corresponding feature identification since elution time shifts are random. We propose a Statistical Corresponding Feature Identification Algorithm(SCFIA) based on both elution time shifts and peak shape correlations between corresponding features. SCFIA first trains a set of statistical models, and then, all candidate corresponding features are scored by the statistical models to find the maximum likelihood solution. RESULTS: We test SCFIA on publicly available datasets. We first compare its performance with that of warping function based methods, and the results show significant improvements. The performance of SCFIA on replicates datasets and fractionated datasets is also evaluated. In both cases, the accuracy is above 90%, which is near optimal. Finally the coverage of SCFIA is evaluated, and it is shown that SCFIA can find corresponding features in multiple datasets for over 90% peptides identified by Tandem MS. CONCLUSIONS: SCFIA can be used for accurate corresponding feature identification in LC-MS. We have shown that peak shape correlation can be used effectively for improving the accuracy. SCFIA provides high coverage in corresponding feature identification in multiple datasets, which serves the basis for integrating multiple LC-MS measurements for accurate peptide quantification. |
format | Online Article Text |
id | pubmed-3233610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32336102011-12-12 SCFIA: a statistical corresponding feature identification algorithm for LC/MS Cui, Jian Ma, Xuepo Chen, Long Zhang, Jianqiu BMC Bioinformatics Methodology Article BACKGROUND: Identifying corresponding features (LC peaks registered by identical peptides) in multiple Liquid Chromatography/Mass Spectrometry (LC-MS) datasets plays a crucial role in the analysis of complex peptide or protein mixtures. Warping functions are commonly used to correct the mean of elution time shifts among LC-MS datasets, which cannot resolve the ambiguity of corresponding feature identification since elution time shifts are random. We propose a Statistical Corresponding Feature Identification Algorithm(SCFIA) based on both elution time shifts and peak shape correlations between corresponding features. SCFIA first trains a set of statistical models, and then, all candidate corresponding features are scored by the statistical models to find the maximum likelihood solution. RESULTS: We test SCFIA on publicly available datasets. We first compare its performance with that of warping function based methods, and the results show significant improvements. The performance of SCFIA on replicates datasets and fractionated datasets is also evaluated. In both cases, the accuracy is above 90%, which is near optimal. Finally the coverage of SCFIA is evaluated, and it is shown that SCFIA can find corresponding features in multiple datasets for over 90% peptides identified by Tandem MS. CONCLUSIONS: SCFIA can be used for accurate corresponding feature identification in LC-MS. We have shown that peak shape correlation can be used effectively for improving the accuracy. SCFIA provides high coverage in corresponding feature identification in multiple datasets, which serves the basis for integrating multiple LC-MS measurements for accurate peptide quantification. BioMed Central 2011-11-11 /pmc/articles/PMC3233610/ /pubmed/22078262 http://dx.doi.org/10.1186/1471-2105-12-439 Text en Copyright ©2011 Cui et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Cui, Jian Ma, Xuepo Chen, Long Zhang, Jianqiu SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title | SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title_full | SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title_fullStr | SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title_full_unstemmed | SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title_short | SCFIA: a statistical corresponding feature identification algorithm for LC/MS |
title_sort | scfia: a statistical corresponding feature identification algorithm for lc/ms |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3233610/ https://www.ncbi.nlm.nih.gov/pubmed/22078262 http://dx.doi.org/10.1186/1471-2105-12-439 |
work_keys_str_mv | AT cuijian scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms AT maxuepo scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms AT chenlong scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms AT zhangjianqiu scfiaastatisticalcorrespondingfeatureidentificationalgorithmforlcms |