Cargando…

N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana

Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenom...

Descripción completa

Detalles Bibliográficos
Autores principales: Willems, Patrick, Ndah, Elvis, Jonckheere, Veronique, Stael, Simon, Sticker, Adriaan, Martens, Lennart, Van Breusegem, Frank, Gevaert, Kris, Van Damme, Petra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5461538/
https://www.ncbi.nlm.nih.gov/pubmed/28432195
http://dx.doi.org/10.1074/mcp.M116.066662
_version_ 1783242353175166976
author Willems, Patrick
Ndah, Elvis
Jonckheere, Veronique
Stael, Simon
Sticker, Adriaan
Martens, Lennart
Van Breusegem, Frank
Gevaert, Kris
Van Damme, Petra
author_facet Willems, Patrick
Ndah, Elvis
Jonckheere, Veronique
Stael, Simon
Sticker, Adriaan
Martens, Lennart
Van Breusegem, Frank
Gevaert, Kris
Van Damme, Petra
author_sort Willems, Patrick
collection PubMed
description Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes.
format Online
Article
Text
id pubmed-5461538
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-54615382017-06-14 N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana Willems, Patrick Ndah, Elvis Jonckheere, Veronique Stael, Simon Sticker, Adriaan Martens, Lennart Van Breusegem, Frank Gevaert, Kris Van Damme, Petra Mol Cell Proteomics Research Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes. The American Society for Biochemistry and Molecular Biology 2017-06 2017-04-21 /pmc/articles/PMC5461538/ /pubmed/28432195 http://dx.doi.org/10.1074/mcp.M116.066662 Text en © 2017 by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version free via Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) .
spellingShingle Research
Willems, Patrick
Ndah, Elvis
Jonckheere, Veronique
Stael, Simon
Sticker, Adriaan
Martens, Lennart
Van Breusegem, Frank
Gevaert, Kris
Van Damme, Petra
N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title_full N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title_fullStr N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title_full_unstemmed N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title_short N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana
title_sort n-terminal proteomics assisted profiling of the unexplored translation initiation landscape in arabidopsis thaliana
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5461538/
https://www.ncbi.nlm.nih.gov/pubmed/28432195
http://dx.doi.org/10.1074/mcp.M116.066662
work_keys_str_mv AT willemspatrick nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT ndahelvis nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT jonckheereveronique nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT staelsimon nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT stickeradriaan nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT martenslennart nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT vanbreusegemfrank nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT gevaertkris nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana
AT vandammepetra nterminalproteomicsassistedprofilingoftheunexploredtranslationinitiationlandscapeinarabidopsisthaliana