Cargando…

Genomic signal processing for DNA sequence clustering

Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendizabal-Ruiz, Gerardo, Román-Godínez, Israel, Torres-Ramos, Sulema, Salido-Ruiz, Ricardo A., Vélez-Pérez, Hugo, Morales, J. Alejandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5786891/
https://www.ncbi.nlm.nih.gov/pubmed/29379686
http://dx.doi.org/10.7717/peerj.4264
_version_ 1783295841104035840
author Mendizabal-Ruiz, Gerardo
Román-Godínez, Israel
Torres-Ramos, Sulema
Salido-Ruiz, Ricardo A.
Vélez-Pérez, Hugo
Morales, J. Alejandro
author_facet Mendizabal-Ruiz, Gerardo
Román-Godínez, Israel
Torres-Ramos, Sulema
Salido-Ruiz, Ricardo A.
Vélez-Pérez, Hugo
Morales, J. Alejandro
author_sort Mendizabal-Ruiz, Gerardo
collection PubMed
description Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data.
format Online
Article
Text
id pubmed-5786891
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-57868912018-01-29 Genomic signal processing for DNA sequence clustering Mendizabal-Ruiz, Gerardo Román-Godínez, Israel Torres-Ramos, Sulema Salido-Ruiz, Ricardo A. Vélez-Pérez, Hugo Morales, J. Alejandro PeerJ Bioinformatics Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data. PeerJ Inc. 2018-01-24 /pmc/articles/PMC5786891/ /pubmed/29379686 http://dx.doi.org/10.7717/peerj.4264 Text en ©2018 Mendizabal-Ruiz et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Mendizabal-Ruiz, Gerardo
Román-Godínez, Israel
Torres-Ramos, Sulema
Salido-Ruiz, Ricardo A.
Vélez-Pérez, Hugo
Morales, J. Alejandro
Genomic signal processing for DNA sequence clustering
title Genomic signal processing for DNA sequence clustering
title_full Genomic signal processing for DNA sequence clustering
title_fullStr Genomic signal processing for DNA sequence clustering
title_full_unstemmed Genomic signal processing for DNA sequence clustering
title_short Genomic signal processing for DNA sequence clustering
title_sort genomic signal processing for dna sequence clustering
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5786891/
https://www.ncbi.nlm.nih.gov/pubmed/29379686
http://dx.doi.org/10.7717/peerj.4264
work_keys_str_mv AT mendizabalruizgerardo genomicsignalprocessingfordnasequenceclustering
AT romangodinezisrael genomicsignalprocessingfordnasequenceclustering
AT torresramossulema genomicsignalprocessingfordnasequenceclustering
AT salidoruizricardoa genomicsignalprocessingfordnasequenceclustering
AT velezperezhugo genomicsignalprocessingfordnasequenceclustering
AT moralesjalejandro genomicsignalprocessingfordnasequenceclustering