Cargando…

SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches

Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinformatics. Current methods are based mainly on 2 major approaches: (1) sequence alignment, which is computationally expensive, and (2) faster, but less accurate, alignment-free methods based on various sta...

Descripción completa

Detalles Bibliográficos
Autores principales: Farkaš, Tomáš, Sitarčík, Jozef, Brejová, Broňa, Lucká, Mária
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545658/
https://www.ncbi.nlm.nih.gov/pubmed/31210725
http://dx.doi.org/10.1177/1176934319849071
_version_ 1783423419718565888
author Farkaš, Tomáš
Sitarčík, Jozef
Brejová, Broňa
Lucká, Mária
author_facet Farkaš, Tomáš
Sitarčík, Jozef
Brejová, Broňa
Lucká, Mária
author_sort Farkaš, Tomáš
collection PubMed
description Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinformatics. Current methods are based mainly on 2 major approaches: (1) sequence alignment, which is computationally expensive, and (2) faster, but less accurate, alignment-free methods based on various statistical summaries, for example, short word counts. We propose a new distance measure based on mathematical transforms from the domain of signal processing. To tolerate large-scale rearrangements in the sequences, the transform is computed across sliding windows. We compare our method on several data sets with current state-of-art alignment-free methods. Our method compares favorably in terms of accuracy and outperforms other methods in running time and memory requirements. In addition, it is massively scalable up to dozens of processing units without the loss of performance due to communication overhead. Source files and sample data are available at https://bitbucket.org/fiitstubioinfo/swspm/src
format Online
Article
Text
id pubmed-6545658
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-65456582019-06-17 SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches Farkaš, Tomáš Sitarčík, Jozef Brejová, Broňa Lucká, Mária Evol Bioinform Online Original Research Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinformatics. Current methods are based mainly on 2 major approaches: (1) sequence alignment, which is computationally expensive, and (2) faster, but less accurate, alignment-free methods based on various statistical summaries, for example, short word counts. We propose a new distance measure based on mathematical transforms from the domain of signal processing. To tolerate large-scale rearrangements in the sequences, the transform is computed across sliding windows. We compare our method on several data sets with current state-of-art alignment-free methods. Our method compares favorably in terms of accuracy and outperforms other methods in running time and memory requirements. In addition, it is massively scalable up to dozens of processing units without the loss of performance due to communication overhead. Source files and sample data are available at https://bitbucket.org/fiitstubioinfo/swspm/src SAGE Publications 2019-05-30 /pmc/articles/PMC6545658/ /pubmed/31210725 http://dx.doi.org/10.1177/1176934319849071 Text en © The Author(s) 2019 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Farkaš, Tomáš
Sitarčík, Jozef
Brejová, Broňa
Lucká, Mária
SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title_full SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title_fullStr SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title_full_unstemmed SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title_short SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches
title_sort swspm: a novel alignment-free dna comparison method based on signal processing approaches
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545658/
https://www.ncbi.nlm.nih.gov/pubmed/31210725
http://dx.doi.org/10.1177/1176934319849071
work_keys_str_mv AT farkastomas swspmanovelalignmentfreednacomparisonmethodbasedonsignalprocessingapproaches
AT sitarcikjozef swspmanovelalignmentfreednacomparisonmethodbasedonsignalprocessingapproaches
AT brejovabrona swspmanovelalignmentfreednacomparisonmethodbasedonsignalprocessingapproaches
AT luckamaria swspmanovelalignmentfreednacomparisonmethodbasedonsignalprocessingapproaches