Cargando…
A new method to cluster DNA sequences using Fourier power spectrum
A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from thes...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094126/ https://www.ncbi.nlm.nih.gov/pubmed/25747773 http://dx.doi.org/10.1016/j.jtbi.2015.02.026 |
_version_ | 1783510405959647232 |
---|---|
author | Hoang, Tung Yin, Changchuan Zheng, Hui Yu, Chenglong Lucy He, Rong Yau, Stephen S.-T. |
author_facet | Hoang, Tung Yin, Changchuan Zheng, Hui Yu, Chenglong Lucy He, Rong Yau, Stephen S.-T. |
author_sort | Hoang, Tung |
collection | PubMed |
description | A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences. The novelty of this method is that sequences with different lengths can be compared easily via the use of power spectra and moments. Experimental results on various datasets show that the proposed method provides an efficient tool to classify genes and genomes. It not only gives comparable results but also is remarkably faster than other multiple sequence alignment and alignment-free methods. |
format | Online Article Text |
id | pubmed-7094126 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70941262020-03-25 A new method to cluster DNA sequences using Fourier power spectrum Hoang, Tung Yin, Changchuan Zheng, Hui Yu, Chenglong Lucy He, Rong Yau, Stephen S.-T. J Theor Biol Article A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences. The novelty of this method is that sequences with different lengths can be compared easily via the use of power spectra and moments. Experimental results on various datasets show that the proposed method provides an efficient tool to classify genes and genomes. It not only gives comparable results but also is remarkably faster than other multiple sequence alignment and alignment-free methods. Elsevier Ltd. 2015-05-07 2015-03-05 /pmc/articles/PMC7094126/ /pubmed/25747773 http://dx.doi.org/10.1016/j.jtbi.2015.02.026 Text en Copyright © 2015 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Hoang, Tung Yin, Changchuan Zheng, Hui Yu, Chenglong Lucy He, Rong Yau, Stephen S.-T. A new method to cluster DNA sequences using Fourier power spectrum |
title | A new method to cluster DNA sequences using Fourier power spectrum |
title_full | A new method to cluster DNA sequences using Fourier power spectrum |
title_fullStr | A new method to cluster DNA sequences using Fourier power spectrum |
title_full_unstemmed | A new method to cluster DNA sequences using Fourier power spectrum |
title_short | A new method to cluster DNA sequences using Fourier power spectrum |
title_sort | new method to cluster dna sequences using fourier power spectrum |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094126/ https://www.ncbi.nlm.nih.gov/pubmed/25747773 http://dx.doi.org/10.1016/j.jtbi.2015.02.026 |
work_keys_str_mv | AT hoangtung anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT yinchangchuan anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT zhenghui anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT yuchenglong anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT lucyherong anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT yaustephenst anewmethodtoclusterdnasequencesusingfourierpowerspectrum AT hoangtung newmethodtoclusterdnasequencesusingfourierpowerspectrum AT yinchangchuan newmethodtoclusterdnasequencesusingfourierpowerspectrum AT zhenghui newmethodtoclusterdnasequencesusingfourierpowerspectrum AT yuchenglong newmethodtoclusterdnasequencesusingfourierpowerspectrum AT lucyherong newmethodtoclusterdnasequencesusingfourierpowerspectrum AT yaustephenst newmethodtoclusterdnasequencesusingfourierpowerspectrum |