Cargando…

Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences

Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of informat...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Jianbo, Qi, Yan, Cao, Yinhe, Tung, Wen-wen
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184046/
https://www.ncbi.nlm.nih.gov/pubmed/16046819
http://dx.doi.org/10.1155/JBB.2005.139
_version_ 1782124709123457024
author Gao, Jianbo
Qi, Yan
Cao, Yinhe
Tung, Wen-wen
author_facet Gao, Jianbo
Qi, Yan
Cao, Yinhe
Tung, Wen-wen
author_sort Gao, Jianbo
collection PubMed
description Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of information about coding regions, it becomes more successful. It is thus highly desirable to develop new and efficient codon indices by simultaneously characterizing the fractal and periodic features of a DNA sequence. In this paper, we describe a novel way of achieving this goal. The efficiency of the new codon index is evaluated by studying all of the 16 yeast chromosomes. In particular, we show that the method automatically and correctly identifies which of the three reading frames is the one that contains a gene.
format Text
id pubmed-1184046
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-11840462005-09-07 Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences Gao, Jianbo Qi, Yan Cao, Yinhe Tung, Wen-wen J Biomed Biotechnol Research Article Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of information about coding regions, it becomes more successful. It is thus highly desirable to develop new and efficient codon indices by simultaneously characterizing the fractal and periodic features of a DNA sequence. In this paper, we describe a novel way of achieving this goal. The efficiency of the new codon index is evaluated by studying all of the 16 yeast chromosomes. In particular, we show that the method automatically and correctly identifies which of the three reading frames is the one that contains a gene. Hindawi Publishing Corporation 2005 /pmc/articles/PMC1184046/ /pubmed/16046819 http://dx.doi.org/10.1155/JBB.2005.139 Text en Hindawi Publishing Corporation
spellingShingle Research Article
Gao, Jianbo
Qi, Yan
Cao, Yinhe
Tung, Wen-wen
Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title_full Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title_fullStr Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title_full_unstemmed Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title_short Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
title_sort protein coding sequence identification by simultaneously characterizing the periodic and random features of dna sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184046/
https://www.ncbi.nlm.nih.gov/pubmed/16046819
http://dx.doi.org/10.1155/JBB.2005.139
work_keys_str_mv AT gaojianbo proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences
AT qiyan proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences
AT caoyinhe proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences
AT tungwenwen proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences