Cargando…

Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning

Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervise...

Descripción completa

Detalles Bibliográficos
Autores principales: Guan, Shenheng, Moran, Michael F., Ma, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773555/
https://www.ncbi.nlm.nih.gov/pubmed/31249099
http://dx.doi.org/10.1074/mcp.TIR119.001412
_version_ 1783455906560737280
author Guan, Shenheng
Moran, Michael F.
Ma, Bin
author_facet Guan, Shenheng
Moran, Michael F.
Ma, Bin
author_sort Guan, Shenheng
collection PubMed
description Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervised learning architecture, bidirectional long-short term memory (LSTM) recurrent neural networks was used to construct the three prediction models. Two featurization schemes were proposed and demonstrated to allow for efficient encoding of modifications. The iRT and charge state distribution models were trained with on order of 10(5) data points each. An HCD sequence ion prediction model was trained with 2 × 10(6) experimental spectra. The iRT prediction model and HCD sequence ion prediction model provide improved accuracies over the start-of-the-art models available in literature. The MS1 charge state distribution prediction model offers excellent performance. The prediction models can be used to enhance peptide identification and quantification in data-dependent acquisition and data-independent acquisition (DIA) experiments as well as to assist MRM (multiple reaction monitoring) and PRM (parallel reaction monitoring) experiment design.
format Online
Article
Text
id pubmed-6773555
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-67735552019-10-02 Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning Guan, Shenheng Moran, Michael F. Ma, Bin Mol Cell Proteomics Technological Innovation and Resources Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervised learning architecture, bidirectional long-short term memory (LSTM) recurrent neural networks was used to construct the three prediction models. Two featurization schemes were proposed and demonstrated to allow for efficient encoding of modifications. The iRT and charge state distribution models were trained with on order of 10(5) data points each. An HCD sequence ion prediction model was trained with 2 × 10(6) experimental spectra. The iRT prediction model and HCD sequence ion prediction model provide improved accuracies over the start-of-the-art models available in literature. The MS1 charge state distribution prediction model offers excellent performance. The prediction models can be used to enhance peptide identification and quantification in data-dependent acquisition and data-independent acquisition (DIA) experiments as well as to assist MRM (multiple reaction monitoring) and PRM (parallel reaction monitoring) experiment design. The American Society for Biochemistry and Molecular Biology 2019-10 2019-06-27 /pmc/articles/PMC6773555/ /pubmed/31249099 http://dx.doi.org/10.1074/mcp.TIR119.001412 Text en © 2019 Guan et al. Published by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version open access under the terms of the Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) .
spellingShingle Technological Innovation and Resources
Guan, Shenheng
Moran, Michael F.
Ma, Bin
Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title_full Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title_fullStr Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title_full_unstemmed Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title_short Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
title_sort prediction of lc-ms/ms properties of peptides from sequence by deep learning
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773555/
https://www.ncbi.nlm.nih.gov/pubmed/31249099
http://dx.doi.org/10.1074/mcp.TIR119.001412
work_keys_str_mv AT guanshenheng predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning
AT moranmichaelf predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning
AT mabin predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning