Cargando…

Efficient discovery of abundant post-translational modifications and spectral pairs using peptide mass and retention time differences

BACKGROUND: Peptide identification via tandem mass spectrometry is the basic task of current proteomics research. Due to the complexity of mass spectra, the majority of mass spectra cannot be interpreted at present. The existence of unexpected or unknown protein post-translational modifications is a...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Yan, Jia, Wei, Lu, Zhuang, Wang, Haipeng, Yuan, Zuofei, Chi, Hao, Li, You, Xiu, Liyun, Wang, Wenping, Liu, Chao, Wang, Leheng, Sun, Ruixiang, Gao, Wen, Qian, Xiaohong, He, Si-Min
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648780/
https://www.ncbi.nlm.nih.gov/pubmed/19208153
http://dx.doi.org/10.1186/1471-2105-10-S1-S50
Descripción
Sumario:BACKGROUND: Peptide identification via tandem mass spectrometry is the basic task of current proteomics research. Due to the complexity of mass spectra, the majority of mass spectra cannot be interpreted at present. The existence of unexpected or unknown protein post-translational modifications is a major reason. RESULTS: This paper describes an efficient and sequence database-independent approach to detecting abundant post-translational modifications in high-accuracy peptide mass spectra. The approach is based on the observation that the spectra of a modified peptide and its unmodified counterpart are correlated with each other in their peptide masses and retention time. Frequently occurring peptide mass differences in a data set imply possible modifications, while small and consistent retention time differences provide orthogonal supporting evidence. We propose to use a bivariate Gaussian mixture model to discriminate modification-related spectral pairs from random ones. Due to the use of two-dimensional information, accurate modification masses and confident spectral pairs can be determined as well as the quantitative influences of modifications on peptide retention time. CONCLUSION: Experiments on two glycoprotein data sets demonstrate that our method can effectively detect abundant modifications and spectral pairs. By including the discovered modifications into database search or by propagating peptide assignments between paired spectra, an average of 10% more spectra are interpreted.