Cargando…

Categorical spectral analysis of periodicity in human and viral genomes

Periodicity in nucleotide sequences arises from regular repeating patterns which may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has provided the basis for powerful gene prediction algorithms, its origins are still no...

Descripción completa

Detalles Bibliográficos
Autores principales: Howe, Elizabeth D., Song, Jun S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561982/
https://www.ncbi.nlm.nih.gov/pubmed/23241388
http://dx.doi.org/10.1093/nar/gks1261
_version_ 1782258027080974336
author Howe, Elizabeth D.
Song, Jun S.
author_facet Howe, Elizabeth D.
Song, Jun S.
author_sort Howe, Elizabeth D.
collection PubMed
description Periodicity in nucleotide sequences arises from regular repeating patterns which may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has provided the basis for powerful gene prediction algorithms, its origins are still not fully understood. Here, we show that, contrary to common belief, amino acid (AA) bias and codon usage bias are insufficient to create base-3 periodicity. This article applies the rigorous method of spectral envelope to systematically characterize the contributions of codon bias, AA bias and protein structural motifs to the three-base periodicity of coding sequences. The method is also used to classify CpG islands in the human genome. In addition, we show how spectral envelope can be used to trace the evolution of viral genomes and monitor global sequence changes without having to align to previously known genomes. This approach also detects reassortment events, such as those that led to the 2009 pandemic H1N1 virus.
format Online
Article
Text
id pubmed-3561982
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35619822013-02-01 Categorical spectral analysis of periodicity in human and viral genomes Howe, Elizabeth D. Song, Jun S. Nucleic Acids Res Computational Biology Periodicity in nucleotide sequences arises from regular repeating patterns which may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has provided the basis for powerful gene prediction algorithms, its origins are still not fully understood. Here, we show that, contrary to common belief, amino acid (AA) bias and codon usage bias are insufficient to create base-3 periodicity. This article applies the rigorous method of spectral envelope to systematically characterize the contributions of codon bias, AA bias and protein structural motifs to the three-base periodicity of coding sequences. The method is also used to classify CpG islands in the human genome. In addition, we show how spectral envelope can be used to trace the evolution of viral genomes and monitor global sequence changes without having to align to previously known genomes. This approach also detects reassortment events, such as those that led to the 2009 pandemic H1N1 virus. Oxford University Press 2013-02 2012-12-14 /pmc/articles/PMC3561982/ /pubmed/23241388 http://dx.doi.org/10.1093/nar/gks1261 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Computational Biology
Howe, Elizabeth D.
Song, Jun S.
Categorical spectral analysis of periodicity in human and viral genomes
title Categorical spectral analysis of periodicity in human and viral genomes
title_full Categorical spectral analysis of periodicity in human and viral genomes
title_fullStr Categorical spectral analysis of periodicity in human and viral genomes
title_full_unstemmed Categorical spectral analysis of periodicity in human and viral genomes
title_short Categorical spectral analysis of periodicity in human and viral genomes
title_sort categorical spectral analysis of periodicity in human and viral genomes
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561982/
https://www.ncbi.nlm.nih.gov/pubmed/23241388
http://dx.doi.org/10.1093/nar/gks1261
work_keys_str_mv AT howeelizabethd categoricalspectralanalysisofperiodicityinhumanandviralgenomes
AT songjuns categoricalspectralanalysisofperiodicityinhumanandviralgenomes