Cargando…
Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of informat...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184046/ https://www.ncbi.nlm.nih.gov/pubmed/16046819 http://dx.doi.org/10.1155/JBB.2005.139 |
_version_ | 1782124709123457024 |
---|---|
author | Gao, Jianbo Qi, Yan Cao, Yinhe Tung, Wen-wen |
author_facet | Gao, Jianbo Qi, Yan Cao, Yinhe Tung, Wen-wen |
author_sort | Gao, Jianbo |
collection | PubMed |
description | Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of information about coding regions, it becomes more successful. It is thus highly desirable to develop new and efficient codon indices by simultaneously characterizing the fractal and periodic features of a DNA sequence. In this paper, we describe a novel way of achieving this goal. The efficiency of the new codon index is evaluated by studying all of the 16 yeast chromosomes. In particular, we show that the method automatically and correctly identifies which of the three reading frames is the one that contains a gene. |
format | Text |
id | pubmed-1184046 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-11840462005-09-07 Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences Gao, Jianbo Qi, Yan Cao, Yinhe Tung, Wen-wen J Biomed Biotechnol Research Article Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of information about coding regions, it becomes more successful. It is thus highly desirable to develop new and efficient codon indices by simultaneously characterizing the fractal and periodic features of a DNA sequence. In this paper, we describe a novel way of achieving this goal. The efficiency of the new codon index is evaluated by studying all of the 16 yeast chromosomes. In particular, we show that the method automatically and correctly identifies which of the three reading frames is the one that contains a gene. Hindawi Publishing Corporation 2005 /pmc/articles/PMC1184046/ /pubmed/16046819 http://dx.doi.org/10.1155/JBB.2005.139 Text en Hindawi Publishing Corporation |
spellingShingle | Research Article Gao, Jianbo Qi, Yan Cao, Yinhe Tung, Wen-wen Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences |
title | Protein Coding Sequence Identification by Simultaneously Characterizing
the Periodic and Random Features of DNA Sequences |
title_full | Protein Coding Sequence Identification by Simultaneously Characterizing
the Periodic and Random Features of DNA Sequences |
title_fullStr | Protein Coding Sequence Identification by Simultaneously Characterizing
the Periodic and Random Features of DNA Sequences |
title_full_unstemmed | Protein Coding Sequence Identification by Simultaneously Characterizing
the Periodic and Random Features of DNA Sequences |
title_short | Protein Coding Sequence Identification by Simultaneously Characterizing
the Periodic and Random Features of DNA Sequences |
title_sort | protein coding sequence identification by simultaneously characterizing
the periodic and random features of dna sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184046/ https://www.ncbi.nlm.nih.gov/pubmed/16046819 http://dx.doi.org/10.1155/JBB.2005.139 |
work_keys_str_mv | AT gaojianbo proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences AT qiyan proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences AT caoyinhe proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences AT tungwenwen proteincodingsequenceidentificationbysimultaneouslycharacterizingtheperiodicandrandomfeaturesofdnasequences |