Cargando…
Genomic signal processing for DNA sequence clustering
Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5786891/ https://www.ncbi.nlm.nih.gov/pubmed/29379686 http://dx.doi.org/10.7717/peerj.4264 |
_version_ | 1783295841104035840 |
---|---|
author | Mendizabal-Ruiz, Gerardo Román-Godínez, Israel Torres-Ramos, Sulema Salido-Ruiz, Ricardo A. Vélez-Pérez, Hugo Morales, J. Alejandro |
author_facet | Mendizabal-Ruiz, Gerardo Román-Godínez, Israel Torres-Ramos, Sulema Salido-Ruiz, Ricardo A. Vélez-Pérez, Hugo Morales, J. Alejandro |
author_sort | Mendizabal-Ruiz, Gerardo |
collection | PubMed |
description | Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data. |
format | Online Article Text |
id | pubmed-5786891 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-57868912018-01-29 Genomic signal processing for DNA sequence clustering Mendizabal-Ruiz, Gerardo Román-Godínez, Israel Torres-Ramos, Sulema Salido-Ruiz, Ricardo A. Vélez-Pérez, Hugo Morales, J. Alejandro PeerJ Bioinformatics Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data. PeerJ Inc. 2018-01-24 /pmc/articles/PMC5786891/ /pubmed/29379686 http://dx.doi.org/10.7717/peerj.4264 Text en ©2018 Mendizabal-Ruiz et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Mendizabal-Ruiz, Gerardo Román-Godínez, Israel Torres-Ramos, Sulema Salido-Ruiz, Ricardo A. Vélez-Pérez, Hugo Morales, J. Alejandro Genomic signal processing for DNA sequence clustering |
title | Genomic signal processing for DNA sequence clustering |
title_full | Genomic signal processing for DNA sequence clustering |
title_fullStr | Genomic signal processing for DNA sequence clustering |
title_full_unstemmed | Genomic signal processing for DNA sequence clustering |
title_short | Genomic signal processing for DNA sequence clustering |
title_sort | genomic signal processing for dna sequence clustering |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5786891/ https://www.ncbi.nlm.nih.gov/pubmed/29379686 http://dx.doi.org/10.7717/peerj.4264 |
work_keys_str_mv | AT mendizabalruizgerardo genomicsignalprocessingfordnasequenceclustering AT romangodinezisrael genomicsignalprocessingfordnasequenceclustering AT torresramossulema genomicsignalprocessingfordnasequenceclustering AT salidoruizricardoa genomicsignalprocessingfordnasequenceclustering AT velezperezhugo genomicsignalprocessingfordnasequenceclustering AT moralesjalejandro genomicsignalprocessingfordnasequenceclustering |