Cargando…
Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity
Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4077688/ https://www.ncbi.nlm.nih.gov/pubmed/24991213 http://dx.doi.org/10.1186/1687-4153-2014-8 |
_version_ | 1782323634860195840 |
---|---|
author | King, Brian R Aburdene, Maurice Thompson, Alex Warres, Zach |
author_facet | King, Brian R Aburdene, Maurice Thompson, Alex Warres, Zach |
author_sort | King, Brian R |
collection | PubMed |
description | Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity. |
format | Online Article Text |
id | pubmed-4077688 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40776882014-07-02 Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity King, Brian R Aburdene, Maurice Thompson, Alex Warres, Zach EURASIP J Bioinform Syst Biol Research Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity. BioMed Central 2014 2014-05-28 /pmc/articles/PMC4077688/ /pubmed/24991213 http://dx.doi.org/10.1186/1687-4153-2014-8 Text en Copyright © 2014 King et al.; licensee Springer. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research King, Brian R Aburdene, Maurice Thompson, Alex Warres, Zach Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title | Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title_full | Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title_fullStr | Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title_full_unstemmed | Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title_short | Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity |
title_sort | application of discrete fourier inter-coefficient difference for assessing genetic sequence similarity |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4077688/ https://www.ncbi.nlm.nih.gov/pubmed/24991213 http://dx.doi.org/10.1186/1687-4153-2014-8 |
work_keys_str_mv | AT kingbrianr applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity AT aburdenemaurice applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity AT thompsonalex applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity AT warreszach applicationofdiscretefourierintercoefficientdifferenceforassessinggeneticsequencesimilarity |