Cargando…

Characterizing exons and introns by regularity of nucleotide strings

BACKGROUND: Translation of nucleotides into a numeric form has been approached in many ways and has allowed researchers to investigate the properties of protein-coding sequences and noncoding sequences. Typically, more pronounced long-range correlations and increased regularity were found in intron-...

Descripción completa

Detalles Bibliográficos
Autores principales: Woods, Tonya, Preeprem, Thanawadee, Lee, Kichun, Chang, Woojin, Vidakovic, Brani
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4745173/
https://www.ncbi.nlm.nih.gov/pubmed/26857564
http://dx.doi.org/10.1186/s13062-016-0108-7
_version_ 1782414592332267520
author Woods, Tonya
Preeprem, Thanawadee
Lee, Kichun
Chang, Woojin
Vidakovic, Brani
author_facet Woods, Tonya
Preeprem, Thanawadee
Lee, Kichun
Chang, Woojin
Vidakovic, Brani
author_sort Woods, Tonya
collection PubMed
description BACKGROUND: Translation of nucleotides into a numeric form has been approached in many ways and has allowed researchers to investigate the properties of protein-coding sequences and noncoding sequences. Typically, more pronounced long-range correlations and increased regularity were found in intron-containing genes and in non-transcribed regulatory DNA sequences, compared to cDNA sequences or intron-less genes. The regularity is assessed by spectral tools defined on numerical translates. In most popular approaches of numerical translation the resulting spectra depend on the assignment of numerical values to nucleotides. Our contribution is to propose and illustrate a spectra which remains invariant to the translation rules used in traditional approaches. RESULTS: We outline a methodology for representing sequences of DNA nucleotides as numeric matrices in order to analytically investigate important structural characteristics of DNA. This representation allows us to compute the 2-dimensional wavelet transformation and assess regularity characteristics of the sequence via the slope of the wavelet spectra. In addition to computing a global slope measure for a sequence, we can apply our methodology for overlapping sections of nucleotides to obtain an “evolutionary slope.” To illustrate our methodology, we analyzed 376 gene sequences from the first chromosome of the honeybee. CONCLUSION: For the genes analyzed, we find that introns are significantly more regular (lead to more negative spectral slopes) than exons, which agrees with the results from the literature where regularity is measured on “DNA walks”. However, unlike DNA walks where the nucleotides are assigned numerical values depending on nucleotide characteristics (purine-pyrimidine, weak-strong hydrogen bonds, keto-amino, etc.) or other spatial assignments, the proposed spectral tool is invariant to the assignment of nucleotides. Thus, ambiguity in numerical translation of nucleotides is eliminated. REVIEWERS: This article was reviewed by Dr. Vladimir Kuznetsov, Professor Marek Kimmel and Dr. Natsuhiro Ichinose (nominated by Professor Masanori Arita). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0108-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4745173
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47451732016-02-09 Characterizing exons and introns by regularity of nucleotide strings Woods, Tonya Preeprem, Thanawadee Lee, Kichun Chang, Woojin Vidakovic, Brani Biol Direct Research BACKGROUND: Translation of nucleotides into a numeric form has been approached in many ways and has allowed researchers to investigate the properties of protein-coding sequences and noncoding sequences. Typically, more pronounced long-range correlations and increased regularity were found in intron-containing genes and in non-transcribed regulatory DNA sequences, compared to cDNA sequences or intron-less genes. The regularity is assessed by spectral tools defined on numerical translates. In most popular approaches of numerical translation the resulting spectra depend on the assignment of numerical values to nucleotides. Our contribution is to propose and illustrate a spectra which remains invariant to the translation rules used in traditional approaches. RESULTS: We outline a methodology for representing sequences of DNA nucleotides as numeric matrices in order to analytically investigate important structural characteristics of DNA. This representation allows us to compute the 2-dimensional wavelet transformation and assess regularity characteristics of the sequence via the slope of the wavelet spectra. In addition to computing a global slope measure for a sequence, we can apply our methodology for overlapping sections of nucleotides to obtain an “evolutionary slope.” To illustrate our methodology, we analyzed 376 gene sequences from the first chromosome of the honeybee. CONCLUSION: For the genes analyzed, we find that introns are significantly more regular (lead to more negative spectral slopes) than exons, which agrees with the results from the literature where regularity is measured on “DNA walks”. However, unlike DNA walks where the nucleotides are assigned numerical values depending on nucleotide characteristics (purine-pyrimidine, weak-strong hydrogen bonds, keto-amino, etc.) or other spatial assignments, the proposed spectral tool is invariant to the assignment of nucleotides. Thus, ambiguity in numerical translation of nucleotides is eliminated. REVIEWERS: This article was reviewed by Dr. Vladimir Kuznetsov, Professor Marek Kimmel and Dr. Natsuhiro Ichinose (nominated by Professor Masanori Arita). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0108-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-08 /pmc/articles/PMC4745173/ /pubmed/26857564 http://dx.doi.org/10.1186/s13062-016-0108-7 Text en © Woods et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Woods, Tonya
Preeprem, Thanawadee
Lee, Kichun
Chang, Woojin
Vidakovic, Brani
Characterizing exons and introns by regularity of nucleotide strings
title Characterizing exons and introns by regularity of nucleotide strings
title_full Characterizing exons and introns by regularity of nucleotide strings
title_fullStr Characterizing exons and introns by regularity of nucleotide strings
title_full_unstemmed Characterizing exons and introns by regularity of nucleotide strings
title_short Characterizing exons and introns by regularity of nucleotide strings
title_sort characterizing exons and introns by regularity of nucleotide strings
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4745173/
https://www.ncbi.nlm.nih.gov/pubmed/26857564
http://dx.doi.org/10.1186/s13062-016-0108-7
work_keys_str_mv AT woodstonya characterizingexonsandintronsbyregularityofnucleotidestrings
AT preepremthanawadee characterizingexonsandintronsbyregularityofnucleotidestrings
AT leekichun characterizingexonsandintronsbyregularityofnucleotidestrings
AT changwoojin characterizingexonsandintronsbyregularityofnucleotidestrings
AT vidakovicbrani characterizingexonsandintronsbyregularityofnucleotidestrings