Cargando…

SPEQ: quality assessment of peptide tandem mass spectra with deep learning

MOTIVATION: In proteomics, database search programs are routinely used for peptide identification from tandem mass spectrometry data. However, many low-quality spectra cannot be interpreted by any programs. Meanwhile, certain high-quality spectra may not be identified due to incompleteness of the da...

Descripción completa

Detalles Bibliográficos
Autores principales: Gholamizoj, Soroosh, Ma, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8896601/
https://www.ncbi.nlm.nih.gov/pubmed/34978568
http://dx.doi.org/10.1093/bioinformatics/btab874
Descripción
Sumario:MOTIVATION: In proteomics, database search programs are routinely used for peptide identification from tandem mass spectrometry data. However, many low-quality spectra cannot be interpreted by any programs. Meanwhile, certain high-quality spectra may not be identified due to incompleteness of the database or failure of the software. Thus, spectrum quality (SPEQ) assessment tools are helpful programs that can eliminate poor-quality spectra before the database search and highlight the high-quality spectra that are not identified in the initial search. These spectra may be valuable candidates for further analyses. RESULTS: We propose SPEQ: a spectrum quality assessment tool that uses a deep neural network to classify spectra into high-quality, which are worthy candidates for interpretation, and low-quality, which lack sufficient information for identification. SPEQ was compared with a few other prediction models and demonstrated improved prediction accuracy. AVAILABILITY AND IMPLEMENTATION: Source code and scripts are freely available at github.com/sor8sh/SPEQ, implemented in Python.