Cargando…

A new method to cluster DNA sequences using Fourier power spectrum

A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from thes...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoang, Tung, Yin, Changchuan, Zheng, Hui, Yu, Chenglong, Lucy He, Rong, Yau, Stephen S.-T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094126/
https://www.ncbi.nlm.nih.gov/pubmed/25747773
http://dx.doi.org/10.1016/j.jtbi.2015.02.026
_version_ 1783510405959647232
author Hoang, Tung
Yin, Changchuan
Zheng, Hui
Yu, Chenglong
Lucy He, Rong
Yau, Stephen S.-T.
author_facet Hoang, Tung
Yin, Changchuan
Zheng, Hui
Yu, Chenglong
Lucy He, Rong
Yau, Stephen S.-T.
author_sort Hoang, Tung
collection PubMed
description A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences. The novelty of this method is that sequences with different lengths can be compared easily via the use of power spectra and moments. Experimental results on various datasets show that the proposed method provides an efficient tool to classify genes and genomes. It not only gives comparable results but also is remarkably faster than other multiple sequence alignment and alignment-free methods.
format Online
Article
Text
id pubmed-7094126
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-70941262020-03-25 A new method to cluster DNA sequences using Fourier power spectrum Hoang, Tung Yin, Changchuan Zheng, Hui Yu, Chenglong Lucy He, Rong Yau, Stephen S.-T. J Theor Biol Article A novel clustering method is proposed to classify genes and genomes. For a given DNA sequence, a binary indicator sequence of each nucleotide is constructed, and Discrete Fourier Transform is applied on these four sequences to attain respective power spectra. Mathematical moments are built from these spectra, and multidimensional vectors of real numbers are constructed from these moments. Cluster analysis is then performed in order to determine the evolutionary relationship between DNA sequences. The novelty of this method is that sequences with different lengths can be compared easily via the use of power spectra and moments. Experimental results on various datasets show that the proposed method provides an efficient tool to classify genes and genomes. It not only gives comparable results but also is remarkably faster than other multiple sequence alignment and alignment-free methods. Elsevier Ltd. 2015-05-07 2015-03-05 /pmc/articles/PMC7094126/ /pubmed/25747773 http://dx.doi.org/10.1016/j.jtbi.2015.02.026 Text en Copyright © 2015 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Hoang, Tung
Yin, Changchuan
Zheng, Hui
Yu, Chenglong
Lucy He, Rong
Yau, Stephen S.-T.
A new method to cluster DNA sequences using Fourier power spectrum
title A new method to cluster DNA sequences using Fourier power spectrum
title_full A new method to cluster DNA sequences using Fourier power spectrum
title_fullStr A new method to cluster DNA sequences using Fourier power spectrum
title_full_unstemmed A new method to cluster DNA sequences using Fourier power spectrum
title_short A new method to cluster DNA sequences using Fourier power spectrum
title_sort new method to cluster dna sequences using fourier power spectrum
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094126/
https://www.ncbi.nlm.nih.gov/pubmed/25747773
http://dx.doi.org/10.1016/j.jtbi.2015.02.026
work_keys_str_mv AT hoangtung anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yinchangchuan anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT zhenghui anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yuchenglong anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT lucyherong anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yaustephenst anewmethodtoclusterdnasequencesusingfourierpowerspectrum
AT hoangtung newmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yinchangchuan newmethodtoclusterdnasequencesusingfourierpowerspectrum
AT zhenghui newmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yuchenglong newmethodtoclusterdnasequencesusingfourierpowerspectrum
AT lucyherong newmethodtoclusterdnasequencesusingfourierpowerspectrum
AT yaustephenst newmethodtoclusterdnasequencesusingfourierpowerspectrum