Cargando…

Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases

BACKGROUND: All the information necessary for protein folding is supposed to be present in the amino acid sequence. It is still not possible to provide specific ab initio structure predictions by bioinformatical methods. It is suspected that additional folding information is present in protein codin...

Descripción completa

Detalles Bibliográficos
Autor principal: Biro, Jan Charles
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1560374/
https://www.ncbi.nlm.nih.gov/pubmed/16893453
http://dx.doi.org/10.1186/1742-4682-3-28
_version_ 1782129481876504576
author Biro, Jan Charles
author_facet Biro, Jan Charles
author_sort Biro, Jan Charles
collection PubMed
description BACKGROUND: All the information necessary for protein folding is supposed to be present in the amino acid sequence. It is still not possible to provide specific ab initio structure predictions by bioinformatical methods. It is suspected that additional folding information is present in protein coding nucleic acid sequences, but this is not represented by the known genetic code. RESULTS: Nucleic acid subsequences comprising the 1st and/or 3rd codon residues in mRNAs express significantly higher free folding energy (FFE) than the subsequence containing only the 2nd residues (p < 0.0001, n = 81). This periodic FFE difference is not present in introns. It is therefore a specific physico-chemical characteristic of coding sequences and might contribute to unambiguous definition of codon boundaries during translation. The FFEs of the 1st and 3rd residues are additive, which suggests that these residues contain a significant number of complementary bases and that may contribute to selection for local RNA secondary structures in coding regions. This periodic, codon-related structure-formation of mRNAs indicates a connection between the structures of exons and the corresponding (translated) proteins. The folding energy dot plots of RNAs and the residue contact maps of the coded proteins are indeed similar. Residue contact statistics using 81 different protein structures confirmed that amino acids that are coded by partially reverse and complementary codons (Watson-Crick (WC) base pairs at the 1st and 3rd codon positions and translated in reverse orientation) are preferentially co-located in protein structures. CONCLUSION: Exons are distinguished from introns, and codon boundaries are physico-chemically defined, by periodically distributed FFE differences between codon positions. There is a selection for local RNA secondary structures in coding regions and this nucleic acid structure resembles the folding profiles of the coded proteins. The preferentially (specifically) interacting amino acids are coded by partially complementary codons, which strongly supports the connection between mRNA and the corresponding protein structures and indicates that there is protein folding information in nucleic acids that is not present in the genetic code. This might suggest an additional explanation of codon redundancy.
format Text
id pubmed-1560374
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15603742006-09-07 Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases Biro, Jan Charles Theor Biol Med Model Research BACKGROUND: All the information necessary for protein folding is supposed to be present in the amino acid sequence. It is still not possible to provide specific ab initio structure predictions by bioinformatical methods. It is suspected that additional folding information is present in protein coding nucleic acid sequences, but this is not represented by the known genetic code. RESULTS: Nucleic acid subsequences comprising the 1st and/or 3rd codon residues in mRNAs express significantly higher free folding energy (FFE) than the subsequence containing only the 2nd residues (p < 0.0001, n = 81). This periodic FFE difference is not present in introns. It is therefore a specific physico-chemical characteristic of coding sequences and might contribute to unambiguous definition of codon boundaries during translation. The FFEs of the 1st and 3rd residues are additive, which suggests that these residues contain a significant number of complementary bases and that may contribute to selection for local RNA secondary structures in coding regions. This periodic, codon-related structure-formation of mRNAs indicates a connection between the structures of exons and the corresponding (translated) proteins. The folding energy dot plots of RNAs and the residue contact maps of the coded proteins are indeed similar. Residue contact statistics using 81 different protein structures confirmed that amino acids that are coded by partially reverse and complementary codons (Watson-Crick (WC) base pairs at the 1st and 3rd codon positions and translated in reverse orientation) are preferentially co-located in protein structures. CONCLUSION: Exons are distinguished from introns, and codon boundaries are physico-chemically defined, by periodically distributed FFE differences between codon positions. There is a selection for local RNA secondary structures in coding regions and this nucleic acid structure resembles the folding profiles of the coded proteins. The preferentially (specifically) interacting amino acids are coded by partially complementary codons, which strongly supports the connection between mRNA and the corresponding protein structures and indicates that there is protein folding information in nucleic acids that is not present in the genetic code. This might suggest an additional explanation of codon redundancy. BioMed Central 2006-08-07 /pmc/articles/PMC1560374/ /pubmed/16893453 http://dx.doi.org/10.1186/1742-4682-3-28 Text en Copyright © 2006 Biro; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Biro, Jan Charles
Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title_full Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title_fullStr Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title_full_unstemmed Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title_short Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
title_sort indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1560374/
https://www.ncbi.nlm.nih.gov/pubmed/16893453
http://dx.doi.org/10.1186/1742-4682-3-28
work_keys_str_mv AT birojancharles indicationsthatcodonboundariesarephysicochemicallydefinedandthatproteinfoldinginformationiscontainedintheredundantexonbases