Cargando…

A Convolutional Code-Based Sequence Analysis Model and Its Application

A new approach for encoding DNA sequences as input for DNA sequence analysis is proposed using the error correction coding theory of communication engineering. The encoder was designed as a convolutional code model whose generator matrix is designed based on the degeneracy of codons, with a codon tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xiao, Geng, Xiaoli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International (MDPI) 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3645750/
https://www.ncbi.nlm.nih.gov/pubmed/23591850
http://dx.doi.org/10.3390/ijms14048393
_version_ 1782268537595756544
author Liu, Xiao
Geng, Xiaoli
author_facet Liu, Xiao
Geng, Xiaoli
author_sort Liu, Xiao
collection PubMed
description A new approach for encoding DNA sequences as input for DNA sequence analysis is proposed using the error correction coding theory of communication engineering. The encoder was designed as a convolutional code model whose generator matrix is designed based on the degeneracy of codons, with a codon treated in the model as an informational unit. The utility of the proposed model was demonstrated through the analysis of twelve prokaryote and nine eukaryote DNA sequences having different GC contents. Distinct differences in code distances were observed near the initiation and termination sites in the open reading frame, which provided a well-regulated characterization of the DNA sequences. Clearly distinguished period-3 features appeared in the coding regions, and the characteristic average code distances of the analyzed sequences were approximately proportional to their GC contents, particularly in the selected prokaryotic organisms, presenting the potential utility as an added taxonomic characteristic for use in studying the relationships of living organisms.
format Online
Article
Text
id pubmed-3645750
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Molecular Diversity Preservation International (MDPI)
record_format MEDLINE/PubMed
spelling pubmed-36457502013-05-13 A Convolutional Code-Based Sequence Analysis Model and Its Application Liu, Xiao Geng, Xiaoli Int J Mol Sci Article A new approach for encoding DNA sequences as input for DNA sequence analysis is proposed using the error correction coding theory of communication engineering. The encoder was designed as a convolutional code model whose generator matrix is designed based on the degeneracy of codons, with a codon treated in the model as an informational unit. The utility of the proposed model was demonstrated through the analysis of twelve prokaryote and nine eukaryote DNA sequences having different GC contents. Distinct differences in code distances were observed near the initiation and termination sites in the open reading frame, which provided a well-regulated characterization of the DNA sequences. Clearly distinguished period-3 features appeared in the coding regions, and the characteristic average code distances of the analyzed sequences were approximately proportional to their GC contents, particularly in the selected prokaryotic organisms, presenting the potential utility as an added taxonomic characteristic for use in studying the relationships of living organisms. Molecular Diversity Preservation International (MDPI) 2013-04-16 /pmc/articles/PMC3645750/ /pubmed/23591850 http://dx.doi.org/10.3390/ijms14048393 Text en © 2013 by the authors; licensee MDPI, Basel, Switzerland http://creativecommons.org/licenses/by/3.0 This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Liu, Xiao
Geng, Xiaoli
A Convolutional Code-Based Sequence Analysis Model and Its Application
title A Convolutional Code-Based Sequence Analysis Model and Its Application
title_full A Convolutional Code-Based Sequence Analysis Model and Its Application
title_fullStr A Convolutional Code-Based Sequence Analysis Model and Its Application
title_full_unstemmed A Convolutional Code-Based Sequence Analysis Model and Its Application
title_short A Convolutional Code-Based Sequence Analysis Model and Its Application
title_sort convolutional code-based sequence analysis model and its application
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3645750/
https://www.ncbi.nlm.nih.gov/pubmed/23591850
http://dx.doi.org/10.3390/ijms14048393
work_keys_str_mv AT liuxiao aconvolutionalcodebasedsequenceanalysismodelanditsapplication
AT gengxiaoli aconvolutionalcodebasedsequenceanalysismodelanditsapplication
AT liuxiao convolutionalcodebasedsequenceanalysismodelanditsapplication
AT gengxiaoli convolutionalcodebasedsequenceanalysismodelanditsapplication