Cargando…

Visualization of the protein-coding regions with a self adaptive spectral rotation approach

Identifying protein-coding regions in DNA sequences is an active issue in computational biology. In this study, we present a self adaptive spectral rotation (SASR) approach, which visualizes coding regions in DNA sequences, based on investigation of the Triplet Periodicity property, without any prec...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Bo, Ji, Ping
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017620/
https://www.ncbi.nlm.nih.gov/pubmed/20947567
http://dx.doi.org/10.1093/nar/gkq891
_version_ 1782195929028231168
author Chen, Bo
Ji, Ping
author_facet Chen, Bo
Ji, Ping
author_sort Chen, Bo
collection PubMed
description Identifying protein-coding regions in DNA sequences is an active issue in computational biology. In this study, we present a self adaptive spectral rotation (SASR) approach, which visualizes coding regions in DNA sequences, based on investigation of the Triplet Periodicity property, without any preceding training process. It is proposed to help with the rough coding regions prediction when there is no extra information for the training required by other outstanding methods. In this approach, at each position in the DNA sequence, a Fourier spectrum is calculated from the posterior subsequence. Following the spectrums, a random walk in complex plane is generated as the SASR's graphic output. Applications of the SASR on real DNA data show that patterns in the graphic output reveal locations of the coding regions and the frame shifts between them: arcs indicate coding regions, stable points indicate non-coding regions and corners’ shapes reveal frame shifts. Tests on genomic data set from Saccharomyces Cerevisiae reveal that the graphic patterns for coding and non-coding regions differ to a great extent, so that the coding regions can be visually distinguished. Meanwhile, a time cost test shows that the SASR can be easily implemented with the computational complexity of O(N).
format Text
id pubmed-3017620
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30176202011-01-10 Visualization of the protein-coding regions with a self adaptive spectral rotation approach Chen, Bo Ji, Ping Nucleic Acids Res Methods Online Identifying protein-coding regions in DNA sequences is an active issue in computational biology. In this study, we present a self adaptive spectral rotation (SASR) approach, which visualizes coding regions in DNA sequences, based on investigation of the Triplet Periodicity property, without any preceding training process. It is proposed to help with the rough coding regions prediction when there is no extra information for the training required by other outstanding methods. In this approach, at each position in the DNA sequence, a Fourier spectrum is calculated from the posterior subsequence. Following the spectrums, a random walk in complex plane is generated as the SASR's graphic output. Applications of the SASR on real DNA data show that patterns in the graphic output reveal locations of the coding regions and the frame shifts between them: arcs indicate coding regions, stable points indicate non-coding regions and corners’ shapes reveal frame shifts. Tests on genomic data set from Saccharomyces Cerevisiae reveal that the graphic patterns for coding and non-coding regions differ to a great extent, so that the coding regions can be visually distinguished. Meanwhile, a time cost test shows that the SASR can be easily implemented with the computational complexity of O(N). Oxford University Press 2011-01 2010-10-14 /pmc/articles/PMC3017620/ /pubmed/20947567 http://dx.doi.org/10.1093/nar/gkq891 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Chen, Bo
Ji, Ping
Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title_full Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title_fullStr Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title_full_unstemmed Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title_short Visualization of the protein-coding regions with a self adaptive spectral rotation approach
title_sort visualization of the protein-coding regions with a self adaptive spectral rotation approach
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3017620/
https://www.ncbi.nlm.nih.gov/pubmed/20947567
http://dx.doi.org/10.1093/nar/gkq891
work_keys_str_mv AT chenbo visualizationoftheproteincodingregionswithaselfadaptivespectralrotationapproach
AT jiping visualizationoftheproteincodingregionswithaselfadaptivespectralrotationapproach