Cargando…

The path of no return—Truncated protein N‐termini and current ignorance of their genesis

Almost all regulatory processes in biology ultimately lead to or originate from modifications of protein function. However, it is unclear to which extent each mechanism of regulation actually affects proteins and thus phenotypes. We assessed the extent of N‐terminal protein truncation in a global an...

Descripción completa

Detalles Bibliográficos
Autores principales: Fortelny, Nikolaus, Pavlidis, Paul, Overall, Christopher M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4736679/
https://www.ncbi.nlm.nih.gov/pubmed/26010509
http://dx.doi.org/10.1002/pmic.201500043
_version_ 1782413331281215488
author Fortelny, Nikolaus
Pavlidis, Paul
Overall, Christopher M.
author_facet Fortelny, Nikolaus
Pavlidis, Paul
Overall, Christopher M.
author_sort Fortelny, Nikolaus
collection PubMed
description Almost all regulatory processes in biology ultimately lead to or originate from modifications of protein function. However, it is unclear to which extent each mechanism of regulation actually affects proteins and thus phenotypes. We assessed the extent of N‐terminal protein truncation in a global analysis of N‐terminomics data and find that most proteins have N‐terminally truncated proteoforms. Because N‐terminomics analyses do not identify the process generating the identified N‐termini, we compared identified termini to the three N‐termini generating events: protein cleavage, alternative translation, and alternative splicing. Of these, we sought to identify the most likely cause of N‐terminal protein truncations in the human proteome. We found that protease cleavage and alternative protein translation are the likely cause for most shortened proteoforms. However, the vast majority (about 90%) of N‐termini remain unexplained by any of these processes identified to date, so revealing large gaps in our knowledge of protein termini and their genesis. Further analysis and annotation of terminomics data is required, to which end we have created the TopFIND database, a major systematic annotation effort for protein termini. We outline the new features in version 3.0 of the updated database and the new bioinformatics tools available and encourage submission of generated data to fill current knowledge gaps.
format Online
Article
Text
id pubmed-4736679
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-47366792016-02-11 The path of no return—Truncated protein N‐termini and current ignorance of their genesis Fortelny, Nikolaus Pavlidis, Paul Overall, Christopher M. Proteomics Viewpoint Almost all regulatory processes in biology ultimately lead to or originate from modifications of protein function. However, it is unclear to which extent each mechanism of regulation actually affects proteins and thus phenotypes. We assessed the extent of N‐terminal protein truncation in a global analysis of N‐terminomics data and find that most proteins have N‐terminally truncated proteoforms. Because N‐terminomics analyses do not identify the process generating the identified N‐termini, we compared identified termini to the three N‐termini generating events: protein cleavage, alternative translation, and alternative splicing. Of these, we sought to identify the most likely cause of N‐terminal protein truncations in the human proteome. We found that protease cleavage and alternative protein translation are the likely cause for most shortened proteoforms. However, the vast majority (about 90%) of N‐termini remain unexplained by any of these processes identified to date, so revealing large gaps in our knowledge of protein termini and their genesis. Further analysis and annotation of terminomics data is required, to which end we have created the TopFIND database, a major systematic annotation effort for protein termini. We outline the new features in version 3.0 of the updated database and the new bioinformatics tools available and encourage submission of generated data to fill current knowledge gaps. John Wiley and Sons Inc. 2015-06-15 2015-07 /pmc/articles/PMC4736679/ /pubmed/26010509 http://dx.doi.org/10.1002/pmic.201500043 Text en © 2015 The Authors. Proteomics published by WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Viewpoint
Fortelny, Nikolaus
Pavlidis, Paul
Overall, Christopher M.
The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title_full The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title_fullStr The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title_full_unstemmed The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title_short The path of no return—Truncated protein N‐termini and current ignorance of their genesis
title_sort path of no return—truncated protein n‐termini and current ignorance of their genesis
topic Viewpoint
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4736679/
https://www.ncbi.nlm.nih.gov/pubmed/26010509
http://dx.doi.org/10.1002/pmic.201500043
work_keys_str_mv AT fortelnynikolaus thepathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis
AT pavlidispaul thepathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis
AT overallchristopherm thepathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis
AT fortelnynikolaus pathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis
AT pavlidispaul pathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis
AT overallchristopherm pathofnoreturntruncatedproteinnterminiandcurrentignoranceoftheirgenesis