Cargando…

Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes

Proteasomes are the main producers of antigenic peptides presented to CD8(+) T cells. They can cut proteins and release their fragments or recombine non-contiguous fragments thereby generating novel sequences, i.e. spliced peptides. Understanding which are the driving forces and the sequence prefere...

Descripción completa

Detalles Bibliográficos
Autores principales: Specht, Gerd, Roetschke, Hanna P., Mansurkhodzhaev, Artem, Henklein, Petra, Textoris-Taube, Kathrin, Urlaub, Henning, Mishto, Michele, Liepe, Juliane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7228940/
https://www.ncbi.nlm.nih.gov/pubmed/32415162
http://dx.doi.org/10.1038/s41597-020-0487-6
_version_ 1783534664023015424
author Specht, Gerd
Roetschke, Hanna P.
Mansurkhodzhaev, Artem
Henklein, Petra
Textoris-Taube, Kathrin
Urlaub, Henning
Mishto, Michele
Liepe, Juliane
author_facet Specht, Gerd
Roetschke, Hanna P.
Mansurkhodzhaev, Artem
Henklein, Petra
Textoris-Taube, Kathrin
Urlaub, Henning
Mishto, Michele
Liepe, Juliane
author_sort Specht, Gerd
collection PubMed
description Proteasomes are the main producers of antigenic peptides presented to CD8(+) T cells. They can cut proteins and release their fragments or recombine non-contiguous fragments thereby generating novel sequences, i.e. spliced peptides. Understanding which are the driving forces and the sequence preferences of both reactions can streamline target discovery in immunotherapies against cancer, infection and autoimmunity. Here, we present a large database of spliced and non-spliced peptides generated by proteasomes in vitro, which is available as simple CSV file and as a MySQL database. To generate the database, we performed in vitro digestions of 55 unique synthetic polypeptide substrates with different proteasome isoforms and experimental conditions. We measured the samples using three mass spectrometers, filtered and validated putative peptides, identified 22,333 peptide product sequences (15,028 spliced and 7,305 non-spliced product sequences). Our database and datasets have been deposited to the Mendeley (doi:10.17632/nr7cs764rc.1) and PRIDE (PXD016782) repositories. We anticipate that this unique database can be a valuable source for predictors of proteasome-catalyzed peptide hydrolysis and splicing, with various future translational applications.
format Online
Article
Text
id pubmed-7228940
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72289402020-05-20 Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes Specht, Gerd Roetschke, Hanna P. Mansurkhodzhaev, Artem Henklein, Petra Textoris-Taube, Kathrin Urlaub, Henning Mishto, Michele Liepe, Juliane Sci Data Data Descriptor Proteasomes are the main producers of antigenic peptides presented to CD8(+) T cells. They can cut proteins and release their fragments or recombine non-contiguous fragments thereby generating novel sequences, i.e. spliced peptides. Understanding which are the driving forces and the sequence preferences of both reactions can streamline target discovery in immunotherapies against cancer, infection and autoimmunity. Here, we present a large database of spliced and non-spliced peptides generated by proteasomes in vitro, which is available as simple CSV file and as a MySQL database. To generate the database, we performed in vitro digestions of 55 unique synthetic polypeptide substrates with different proteasome isoforms and experimental conditions. We measured the samples using three mass spectrometers, filtered and validated putative peptides, identified 22,333 peptide product sequences (15,028 spliced and 7,305 non-spliced product sequences). Our database and datasets have been deposited to the Mendeley (doi:10.17632/nr7cs764rc.1) and PRIDE (PXD016782) repositories. We anticipate that this unique database can be a valuable source for predictors of proteasome-catalyzed peptide hydrolysis and splicing, with various future translational applications. Nature Publishing Group UK 2020-05-15 /pmc/articles/PMC7228940/ /pubmed/32415162 http://dx.doi.org/10.1038/s41597-020-0487-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Specht, Gerd
Roetschke, Hanna P.
Mansurkhodzhaev, Artem
Henklein, Petra
Textoris-Taube, Kathrin
Urlaub, Henning
Mishto, Michele
Liepe, Juliane
Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title_full Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title_fullStr Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title_full_unstemmed Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title_short Large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
title_sort large database for the analysis and prediction of spliced and non-spliced peptide generation by proteasomes
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7228940/
https://www.ncbi.nlm.nih.gov/pubmed/32415162
http://dx.doi.org/10.1038/s41597-020-0487-6
work_keys_str_mv AT spechtgerd largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT roetschkehannap largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT mansurkhodzhaevartem largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT henkleinpetra largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT textoristaubekathrin largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT urlaubhenning largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT mishtomichele largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes
AT liepejuliane largedatabasefortheanalysisandpredictionofsplicedandnonsplicedpeptidegenerationbyproteasomes