Cargando…

General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA

This study seeks to investigate distinct signatures and codes within different genomic sequence locations of the human genome. The promoter and other non-coding regions contain sites for the binding of biological particles, for processes such as transcription regulation. The specific rules and seque...

Descripción completa

Detalles Bibliográficos
Autor principal: Cohen, Dana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9690640/
https://www.ncbi.nlm.nih.gov/pubmed/36360206
http://dx.doi.org/10.3390/genes13111970
_version_ 1784836841243934720
author Cohen, Dana
author_facet Cohen, Dana
author_sort Cohen, Dana
collection PubMed
description This study seeks to investigate distinct signatures and codes within different genomic sequence locations of the human genome. The promoter and other non-coding regions contain sites for the binding of biological particles, for processes such as transcription regulation. The specific rules and sequence codes that govern this remain poorly understood. To derive these (codes), the general designs of sequence are investigated. Genomic signatures are a powerful tool for assessing the general designs of sequence, and cross-comparing different genomic regions for their distinct sequence properties. Through these genomic signatures, the relative non-random properties of sequences are also assessed. Furthermore, a binary components analysis is carried out making use of information theory ideas, to study the RY (purine/pyrimidine), WS (weak/strong) and KM (keto/amino) signatures in the sequences. From this comparison, it is possible to identify the relative importance of these properties within the various protein-coding and non-coding genomic locations. The results show that coding DNA has a strongly non-random WS signature, which reflects the genetic code, and the hydrogen-bond base pairing of codon–anti-codon interactions. In contrast, non-coding locations, such as the promoter, contain a distinct genomic signature. A prominent feature throughout non-coding DNA is a highly non-random RY signature, which is very different in nature to coding DNA, and suggests a structural-based RY code. This marks progress towards deciphering the unknown code(s) in non-protein-coding DNA, and a further understanding of the coding DNA. Additionally, it unravels how DNA carries information. These findings have implications for the most fundamental principles of biology, including knowledge of gene regulation, development and disease.
format Online
Article
Text
id pubmed-9690640
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96906402022-11-25 General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA Cohen, Dana Genes (Basel) Article This study seeks to investigate distinct signatures and codes within different genomic sequence locations of the human genome. The promoter and other non-coding regions contain sites for the binding of biological particles, for processes such as transcription regulation. The specific rules and sequence codes that govern this remain poorly understood. To derive these (codes), the general designs of sequence are investigated. Genomic signatures are a powerful tool for assessing the general designs of sequence, and cross-comparing different genomic regions for their distinct sequence properties. Through these genomic signatures, the relative non-random properties of sequences are also assessed. Furthermore, a binary components analysis is carried out making use of information theory ideas, to study the RY (purine/pyrimidine), WS (weak/strong) and KM (keto/amino) signatures in the sequences. From this comparison, it is possible to identify the relative importance of these properties within the various protein-coding and non-coding genomic locations. The results show that coding DNA has a strongly non-random WS signature, which reflects the genetic code, and the hydrogen-bond base pairing of codon–anti-codon interactions. In contrast, non-coding locations, such as the promoter, contain a distinct genomic signature. A prominent feature throughout non-coding DNA is a highly non-random RY signature, which is very different in nature to coding DNA, and suggests a structural-based RY code. This marks progress towards deciphering the unknown code(s) in non-protein-coding DNA, and a further understanding of the coding DNA. Additionally, it unravels how DNA carries information. These findings have implications for the most fundamental principles of biology, including knowledge of gene regulation, development and disease. MDPI 2022-10-28 /pmc/articles/PMC9690640/ /pubmed/36360206 http://dx.doi.org/10.3390/genes13111970 Text en © 2022 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cohen, Dana
General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title_full General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title_fullStr General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title_full_unstemmed General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title_short General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA
title_sort general designs reveal distinct codes in protein-coding and non-coding human dna
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9690640/
https://www.ncbi.nlm.nih.gov/pubmed/36360206
http://dx.doi.org/10.3390/genes13111970
work_keys_str_mv AT cohendana generaldesignsrevealdistinctcodesinproteincodingandnoncodinghumandna