Cargando…

The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand

BACKGROUND: We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at...

Descripción completa

Detalles Bibliográficos
Autores principales: Capone, Giovanni, Novello, Giuseppe, Fasano, Candida, Trost, Brett, Bickis, Mik, Kusalik, Anthony, Kanduc, Darja
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919516/
https://www.ncbi.nlm.nih.gov/pubmed/20646284
http://dx.doi.org/10.1186/1471-2105-11-383
_version_ 1782185194811293696
author Capone, Giovanni
Novello, Giuseppe
Fasano, Candida
Trost, Brett
Bickis, Mik
Kusalik, Anthony
Kanduc, Darja
author_facet Capone, Giovanni
Novello, Giuseppe
Fasano, Candida
Trost, Brett
Bickis, Mik
Kusalik, Anthony
Kanduc, Darja
author_sort Capone, Giovanni
collection PubMed
description BACKGROUND: We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at all. To understand the physico-chemical-biological basis underlying peptide usage at the proteomic level, in this study we analyse the energetic costs for the synthesis of rare and never-expressed versus frequent pentapeptides. In addition, we explore residue bulkiness, hydrophobicity, and codon number as factors able to modulate specific peptide frequencies. Then, the possible influence of amino acid composition is investigated in zero- and high-frequency pentapeptide sets by analysing the frequencies of the corresponding inverse-sequence pentapeptides. As a final step, we analyse the pentadecamer oligodeoxynucleotide sequences corresponding to the never-expressed pentapeptides. RESULTS: We find that only DNA context-dependent constraints (such as oligodeoxynucleotide sequence location in the minus strand, introns, pseudogenes, frameshifts, etc.) provide a coherent mechanistic platform to explain the occurrence of never-expressed versus frequent pentapeptides in the protein world. CONCLUSIONS: This study is of importance in cell biology. Indeed, the rarity (or lack of expression) of specific 5-mer peptide modules implies the rarity (or lack of expression) of the corresponding n-mer peptide sequences (with n < 5), so possibly modulating protein compositional trends. Moreover the data might further our understanding of the role exerted by rare pentapeptide modules as critical biological effectors in protein-protein interactions.
format Text
id pubmed-2919516
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29195162010-08-11 The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand Capone, Giovanni Novello, Giuseppe Fasano, Candida Trost, Brett Bickis, Mik Kusalik, Anthony Kanduc, Darja BMC Bioinformatics Research Article BACKGROUND: We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at all. To understand the physico-chemical-biological basis underlying peptide usage at the proteomic level, in this study we analyse the energetic costs for the synthesis of rare and never-expressed versus frequent pentapeptides. In addition, we explore residue bulkiness, hydrophobicity, and codon number as factors able to modulate specific peptide frequencies. Then, the possible influence of amino acid composition is investigated in zero- and high-frequency pentapeptide sets by analysing the frequencies of the corresponding inverse-sequence pentapeptides. As a final step, we analyse the pentadecamer oligodeoxynucleotide sequences corresponding to the never-expressed pentapeptides. RESULTS: We find that only DNA context-dependent constraints (such as oligodeoxynucleotide sequence location in the minus strand, introns, pseudogenes, frameshifts, etc.) provide a coherent mechanistic platform to explain the occurrence of never-expressed versus frequent pentapeptides in the protein world. CONCLUSIONS: This study is of importance in cell biology. Indeed, the rarity (or lack of expression) of specific 5-mer peptide modules implies the rarity (or lack of expression) of the corresponding n-mer peptide sequences (with n < 5), so possibly modulating protein compositional trends. Moreover the data might further our understanding of the role exerted by rare pentapeptide modules as critical biological effectors in protein-protein interactions. BioMed Central 2010-07-20 /pmc/articles/PMC2919516/ /pubmed/20646284 http://dx.doi.org/10.1186/1471-2105-11-383 Text en Copyright ©2010 Capone et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Capone, Giovanni
Novello, Giuseppe
Fasano, Candida
Trost, Brett
Bickis, Mik
Kusalik, Anthony
Kanduc, Darja
The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title_full The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title_fullStr The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title_full_unstemmed The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title_short The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
title_sort oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919516/
https://www.ncbi.nlm.nih.gov/pubmed/20646284
http://dx.doi.org/10.1186/1471-2105-11-383
work_keys_str_mv AT caponegiovanni theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT novellogiuseppe theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT fasanocandida theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT trostbrett theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT bickismik theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT kusalikanthony theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT kanducdarja theoligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT caponegiovanni oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT novellogiuseppe oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT fasanocandida oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT trostbrett oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT bickismik oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT kusalikanthony oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand
AT kanducdarja oligodeoxynucleotidesequencescorrespondingtoneverexpressedpeptidemotifsaremainlylocatedinthenoncodingstrand