Cargando…

Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis

BACKGROUND: High-accuracy prediction tools are essential in the post-genomic era to define organellar proteomes in their full complexity. We recently applied a discriminative machine learning approach to predict plant proteins carrying peroxisome targeting signals (PTS) type 1 from genome sequences....

Descripción completa

Detalles Bibliográficos
Autores principales: Chowdhary, Gopal, Kataya, Amr RA, Lingner, Thomas, Reumann, Sigrun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3487989/
https://www.ncbi.nlm.nih.gov/pubmed/22882975
http://dx.doi.org/10.1186/1471-2229-12-142
_version_ 1782248560575643648
author Chowdhary, Gopal
Kataya, Amr RA
Lingner, Thomas
Reumann, Sigrun
author_facet Chowdhary, Gopal
Kataya, Amr RA
Lingner, Thomas
Reumann, Sigrun
author_sort Chowdhary, Gopal
collection PubMed
description BACKGROUND: High-accuracy prediction tools are essential in the post-genomic era to define organellar proteomes in their full complexity. We recently applied a discriminative machine learning approach to predict plant proteins carrying peroxisome targeting signals (PTS) type 1 from genome sequences. For Arabidopsis thaliana 392 gene models were predicted to be peroxisome-targeted. The predictions were extensively tested in vivo, resulting in a high experimental verification rate of Arabidopsis proteins previously not known to be peroxisomal. RESULTS: In this study, we experimentally validated the predictions in greater depth by focusing on the most challenging Arabidopsis proteins with unknown non-canonical PTS1 tripeptides and prediction scores close to the threshold. By in vivo subcellular targeting analysis, three novel PTS1 tripeptides (QRL>, SQM>, and SDL>) and two novel tripeptide residues (Q at position −3 and D at pos. -2) were identified. To understand why, among many Arabidopsis proteins carrying the same C-terminal tripeptides, these proteins were specifically predicted as peroxisomal, the residues upstream of the PTS1 tripeptide were computationally permuted and the changes in prediction scores were analyzed. The newly identified Arabidopsis proteins were found to contain four to five amino acid residues of high predicted targeting enhancing properties at position −4 to −12 in front of the non-canonical PTS1 tripeptide. The identity of the predicted targeting enhancing residues was unexpectedly diverse, comprising besides basic residues also proline, hydroxylated (Ser, Thr), hydrophobic (Ala, Val), and even acidic residues. CONCLUSIONS: Our computational and experimental analyses demonstrate that the plant PTS1 tripeptide motif is more diverse than previously thought, including an increasing number of non-canonical sequences and allowed residues. Specific targeting enhancing elements can be predicted for particular sequences of interest and are far more diverse in amino acid composition and positioning than previously assumed. Machine learning methods become indispensable to predict which specific proteins, among numerous candidate proteins carrying the same non-canonical PTS1 tripeptide, contain sufficient enhancer elements in terms of number, positioning and total strength to cause peroxisome targeting.
format Online
Article
Text
id pubmed-3487989
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34879892012-11-03 Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis Chowdhary, Gopal Kataya, Amr RA Lingner, Thomas Reumann, Sigrun BMC Plant Biol Research Article BACKGROUND: High-accuracy prediction tools are essential in the post-genomic era to define organellar proteomes in their full complexity. We recently applied a discriminative machine learning approach to predict plant proteins carrying peroxisome targeting signals (PTS) type 1 from genome sequences. For Arabidopsis thaliana 392 gene models were predicted to be peroxisome-targeted. The predictions were extensively tested in vivo, resulting in a high experimental verification rate of Arabidopsis proteins previously not known to be peroxisomal. RESULTS: In this study, we experimentally validated the predictions in greater depth by focusing on the most challenging Arabidopsis proteins with unknown non-canonical PTS1 tripeptides and prediction scores close to the threshold. By in vivo subcellular targeting analysis, three novel PTS1 tripeptides (QRL>, SQM>, and SDL>) and two novel tripeptide residues (Q at position −3 and D at pos. -2) were identified. To understand why, among many Arabidopsis proteins carrying the same C-terminal tripeptides, these proteins were specifically predicted as peroxisomal, the residues upstream of the PTS1 tripeptide were computationally permuted and the changes in prediction scores were analyzed. The newly identified Arabidopsis proteins were found to contain four to five amino acid residues of high predicted targeting enhancing properties at position −4 to −12 in front of the non-canonical PTS1 tripeptide. The identity of the predicted targeting enhancing residues was unexpectedly diverse, comprising besides basic residues also proline, hydroxylated (Ser, Thr), hydrophobic (Ala, Val), and even acidic residues. CONCLUSIONS: Our computational and experimental analyses demonstrate that the plant PTS1 tripeptide motif is more diverse than previously thought, including an increasing number of non-canonical sequences and allowed residues. Specific targeting enhancing elements can be predicted for particular sequences of interest and are far more diverse in amino acid composition and positioning than previously assumed. Machine learning methods become indispensable to predict which specific proteins, among numerous candidate proteins carrying the same non-canonical PTS1 tripeptide, contain sufficient enhancer elements in terms of number, positioning and total strength to cause peroxisome targeting. BioMed Central 2012-08-11 /pmc/articles/PMC3487989/ /pubmed/22882975 http://dx.doi.org/10.1186/1471-2229-12-142 Text en Copyright ©2012 Chowdhary et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chowdhary, Gopal
Kataya, Amr RA
Lingner, Thomas
Reumann, Sigrun
Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title_full Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title_fullStr Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title_full_unstemmed Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title_short Non-canonical peroxisome targeting signals: identification of novel PTS1 tripeptides and characterization of enhancer elements by computational permutation analysis
title_sort non-canonical peroxisome targeting signals: identification of novel pts1 tripeptides and characterization of enhancer elements by computational permutation analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3487989/
https://www.ncbi.nlm.nih.gov/pubmed/22882975
http://dx.doi.org/10.1186/1471-2229-12-142
work_keys_str_mv AT chowdharygopal noncanonicalperoxisometargetingsignalsidentificationofnovelpts1tripeptidesandcharacterizationofenhancerelementsbycomputationalpermutationanalysis
AT katayaamrra noncanonicalperoxisometargetingsignalsidentificationofnovelpts1tripeptidesandcharacterizationofenhancerelementsbycomputationalpermutationanalysis
AT lingnerthomas noncanonicalperoxisometargetingsignalsidentificationofnovelpts1tripeptidesandcharacterizationofenhancerelementsbycomputationalpermutationanalysis
AT reumannsigrun noncanonicalperoxisometargetingsignalsidentificationofnovelpts1tripeptidesandcharacterizationofenhancerelementsbycomputationalpermutationanalysis