Cargando…

AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics

Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Wen-Feng, Zhou, Xie-Xuan, Willems, Sander, Ammar, Constantin, Wahle, Maria, Bludau, Isabell, Voytik, Eugenia, Strauss, Maximillian T., Mann, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700817/
https://www.ncbi.nlm.nih.gov/pubmed/36433986
http://dx.doi.org/10.1038/s41467-022-34904-3
_version_ 1784839395803660288
author Zeng, Wen-Feng
Zhou, Xie-Xuan
Willems, Sander
Ammar, Constantin
Wahle, Maria
Bludau, Isabell
Voytik, Eugenia
Strauss, Maximillian T.
Mann, Matthias
author_facet Zeng, Wen-Feng
Zhou, Xie-Xuan
Willems, Sander
Ammar, Constantin
Wahle, Maria
Bludau, Isabell
Voytik, Eugenia
Strauss, Maximillian T.
Mann, Matthias
author_sort Zeng, Wen-Feng
collection PubMed
description Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides (https://github.com/MannLabs/alphapeptdeep). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition (https://github.com/MannLabs/PeptDeep-HLA).
format Online
Article
Text
id pubmed-9700817
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97008172022-11-27 AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics Zeng, Wen-Feng Zhou, Xie-Xuan Willems, Sander Ammar, Constantin Wahle, Maria Bludau, Isabell Voytik, Eugenia Strauss, Maximillian T. Mann, Matthias Nat Commun Article Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides (https://github.com/MannLabs/alphapeptdeep). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition (https://github.com/MannLabs/PeptDeep-HLA). Nature Publishing Group UK 2022-11-24 /pmc/articles/PMC9700817/ /pubmed/36433986 http://dx.doi.org/10.1038/s41467-022-34904-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zeng, Wen-Feng
Zhou, Xie-Xuan
Willems, Sander
Ammar, Constantin
Wahle, Maria
Bludau, Isabell
Voytik, Eugenia
Strauss, Maximillian T.
Mann, Matthias
AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title_full AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title_fullStr AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title_full_unstemmed AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title_short AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
title_sort alphapeptdeep: a modular deep learning framework to predict peptide properties for proteomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700817/
https://www.ncbi.nlm.nih.gov/pubmed/36433986
http://dx.doi.org/10.1038/s41467-022-34904-3
work_keys_str_mv AT zengwenfeng alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT zhouxiexuan alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT willemssander alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT ammarconstantin alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT wahlemaria alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT bludauisabell alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT voytikeugenia alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT straussmaximilliant alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics
AT mannmatthias alphapeptdeepamodulardeeplearningframeworktopredictpeptidepropertiesforproteomics