Cargando…

Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity

Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish...

Descripción completa

Detalles Bibliográficos
Autores principales: King, Brian R, Aburdene, Maurice, Thompson, Alex, Warres, Zach
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4077688/
https://www.ncbi.nlm.nih.gov/pubmed/24991213
http://dx.doi.org/10.1186/1687-4153-2014-8
_version_ 1782323634860195840
author King, Brian R
Aburdene, Maurice
Thompson, Alex
Warres, Zach
author_facet King, Brian R
Aburdene, Maurice
Thompson, Alex
Warres, Zach
author_sort King, Brian R
collection PubMed
description Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
format Online
Article
Text
id pubmed-4077688
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40776882014-07-02 Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity King, Brian R Aburdene, Maurice Thompson, Alex Warres, Zach EURASIP J Bioinform Syst Biol Research Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity. BioMed Central 2014 2014-05-28 /pmc/articles/PMC4077688/ /pubmed/24991213 http://dx.doi.org/10.1186/1687-4153-2014-8 Text en Copyright © 2014 King et al.; licensee Springer. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research
King, Brian R
Aburdene, Maurice
Thompson, Alex
Warres, Zach
Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title_full Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title_fullStr Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title_full_unstemmed Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title_short Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
title_sort application of discrete fourier inter-coefficient difference for assessing genetic sequence similarity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4077688/
https://www.ncbi.nlm.nih.gov/pubmed/24991213
http://dx.doi.org/10.1186/1687-4153-2014-8
work_keys_str_mv AT kingbrianr applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity
AT aburdenemaurice applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity
AT thompsonalex applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity
AT warreszach applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity