Cargando…
N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenom...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The American Society for Biochemistry and Molecular Biology
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5461538/ https://www.ncbi.nlm.nih.gov/pubmed/28432195 http://dx.doi.org/10.1074/mcp.M116.066662 |
_version_ | 1783242353175166976 |
---|---|
author | Willems, Patrick Ndah, Elvis Jonckheere, Veronique Stael, Simon Sticker, Adriaan Martens, Lennart Van Breusegem, Frank Gevaert, Kris Van Damme, Petra |
author_facet | Willems, Patrick Ndah, Elvis Jonckheere, Veronique Stael, Simon Sticker, Adriaan Martens, Lennart Van Breusegem, Frank Gevaert, Kris Van Damme, Petra |
author_sort | Willems, Patrick |
collection | PubMed |
description | Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes. |
format | Online Article Text |
id | pubmed-5461538 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | The American Society for Biochemistry and Molecular Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-54615382017-06-14 N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana Willems, Patrick Ndah, Elvis Jonckheere, Veronique Stael, Simon Sticker, Adriaan Martens, Lennart Van Breusegem, Frank Gevaert, Kris Van Damme, Petra Mol Cell Proteomics Research Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes. The American Society for Biochemistry and Molecular Biology 2017-06 2017-04-21 /pmc/articles/PMC5461538/ /pubmed/28432195 http://dx.doi.org/10.1074/mcp.M116.066662 Text en © 2017 by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version free via Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) . |
spellingShingle | Research Willems, Patrick Ndah, Elvis Jonckheere, Veronique Stael, Simon Sticker, Adriaan Martens, Lennart Van Breusegem, Frank Gevaert, Kris Van Damme, Petra N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana |
title | N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
|
title_full | N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
|
title_fullStr | N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
|
title_full_unstemmed | N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
|
title_short | N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
|
title_sort | n-terminal proteomics assisted profiling of the unexplored translation initiation landscape in arabidopsis thaliana |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5461538/ https://www.ncbi.nlm.nih.gov/pubmed/28432195 http://dx.doi.org/10.1074/mcp.M116.066662 |
work_keys_str_mv | AT willemspatrick nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT ndahelvis nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT jonckheereveronique nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT staelsimon nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT stickeradriaan nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT martenslennart nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT vanbreusegemfrank nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT gevaertkris nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana AT vandammepetra nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana |