Cargando…
Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning
Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervise...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The American Society for Biochemistry and Molecular Biology
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773555/ https://www.ncbi.nlm.nih.gov/pubmed/31249099 http://dx.doi.org/10.1074/mcp.TIR119.001412 |
_version_ | 1783455906560737280 |
---|---|
author | Guan, Shenheng Moran, Michael F. Ma, Bin |
author_facet | Guan, Shenheng Moran, Michael F. Ma, Bin |
author_sort | Guan, Shenheng |
collection | PubMed |
description | Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervised learning architecture, bidirectional long-short term memory (LSTM) recurrent neural networks was used to construct the three prediction models. Two featurization schemes were proposed and demonstrated to allow for efficient encoding of modifications. The iRT and charge state distribution models were trained with on order of 10(5) data points each. An HCD sequence ion prediction model was trained with 2 × 10(6) experimental spectra. The iRT prediction model and HCD sequence ion prediction model provide improved accuracies over the start-of-the-art models available in literature. The MS1 charge state distribution prediction model offers excellent performance. The prediction models can be used to enhance peptide identification and quantification in data-dependent acquisition and data-independent acquisition (DIA) experiments as well as to assist MRM (multiple reaction monitoring) and PRM (parallel reaction monitoring) experiment design. |
format | Online Article Text |
id | pubmed-6773555 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | The American Society for Biochemistry and Molecular Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-67735552019-10-02 Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning Guan, Shenheng Moran, Michael F. Ma, Bin Mol Cell Proteomics Technological Innovation and Resources Deep learning models for prediction of three key LC-MS/MS properties from peptide sequences were developed. The LC-MS/MS properties or behaviors are indexed retention times (iRT), MS1 or survey scan charge state distributions, and sequence ion intensities of HCD spectra. A common core deep supervised learning architecture, bidirectional long-short term memory (LSTM) recurrent neural networks was used to construct the three prediction models. Two featurization schemes were proposed and demonstrated to allow for efficient encoding of modifications. The iRT and charge state distribution models were trained with on order of 10(5) data points each. An HCD sequence ion prediction model was trained with 2 × 10(6) experimental spectra. The iRT prediction model and HCD sequence ion prediction model provide improved accuracies over the start-of-the-art models available in literature. The MS1 charge state distribution prediction model offers excellent performance. The prediction models can be used to enhance peptide identification and quantification in data-dependent acquisition and data-independent acquisition (DIA) experiments as well as to assist MRM (multiple reaction monitoring) and PRM (parallel reaction monitoring) experiment design. The American Society for Biochemistry and Molecular Biology 2019-10 2019-06-27 /pmc/articles/PMC6773555/ /pubmed/31249099 http://dx.doi.org/10.1074/mcp.TIR119.001412 Text en © 2019 Guan et al. Published by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version open access under the terms of the Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) . |
spellingShingle | Technological Innovation and Resources Guan, Shenheng Moran, Michael F. Ma, Bin Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title | Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title_full | Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title_fullStr | Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title_full_unstemmed | Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title_short | Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning |
title_sort | prediction of lc-ms/ms properties of peptides from sequence by deep learning |
topic | Technological Innovation and Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6773555/ https://www.ncbi.nlm.nih.gov/pubmed/31249099 http://dx.doi.org/10.1074/mcp.TIR119.001412 |
work_keys_str_mv | AT guanshenheng predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning AT moranmichaelf predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning AT mabin predictionoflcmsmspropertiesofpeptidesfromsequencebydeeplearning |