Cargando…

Genomic signal processing for DNA sequence clustering

Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendizabal-Ruiz, Gerardo, Román-Godínez, Israel, Torres-Ramos, Sulema, Salido-Ruiz, Ricardo A., Vélez-Pérez, Hugo, Morales, J. Alejandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5786891/
https://www.ncbi.nlm.nih.gov/pubmed/29379686
http://dx.doi.org/10.7717/peerj.4264
Descripción
Sumario:Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data.