Cargando…

The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins

For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We...

Descripción completa

Detalles Bibliográficos
Autores principales: de Leon, Miguel Ponce, de Miranda, Antonio Basilio, Alvarez-Valin, Fernando, Carels, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4039185/
https://www.ncbi.nlm.nih.gov/pubmed/24899802
http://dx.doi.org/10.4137/BBI.S13161
_version_ 1782318458520731648
author de Leon, Miguel Ponce
de Miranda, Antonio Basilio
Alvarez-Valin, Fernando
Carels, Nicolas
author_facet de Leon, Miguel Ponce
de Miranda, Antonio Basilio
Alvarez-Valin, Fernando
Carels, Nicolas
author_sort de Leon, Miguel Ponce
collection PubMed
description For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
format Online
Article
Text
id pubmed-4039185
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-40391852014-06-04 The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins de Leon, Miguel Ponce de Miranda, Antonio Basilio Alvarez-Valin, Fernando Carels, Nicolas Bioinform Biol Insights Original Research For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins. Libertas Academica 2014-05-20 /pmc/articles/PMC4039185/ /pubmed/24899802 http://dx.doi.org/10.4137/BBI.S13161 Text en © 2014 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Original Research
de Leon, Miguel Ponce
de Miranda, Antonio Basilio
Alvarez-Valin, Fernando
Carels, Nicolas
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title_full The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title_fullStr The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title_full_unstemmed The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title_short The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins
title_sort purine bias of coding sequences is determined by physicochemical constraints on proteins
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4039185/
https://www.ncbi.nlm.nih.gov/pubmed/24899802
http://dx.doi.org/10.4137/BBI.S13161
work_keys_str_mv AT deleonmiguelponce thepurinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT demirandaantoniobasilio thepurinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT alvarezvalinfernando thepurinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT carelsnicolas thepurinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT deleonmiguelponce purinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT demirandaantoniobasilio purinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT alvarezvalinfernando purinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins
AT carelsnicolas purinebiasofcodingsequencesisdeterminedbyphysicochemicalconstraintsonproteins