Cargando…

On DNA numerical representations for genomic similarity computation

Genomic signal processing (GSP) refers to the use of signal processing for the analysis of genomic data. GSP methods require the transformation or mapping of the genomic data to a numeric representation. To date, several DNA numeric representations (DNR) have been proposed; however, it is not clear...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendizabal-Ruiz, Gerardo, Román-Godínez, Israel, Torres-Ramos, Sulema, Salido-Ruiz, Ricardo A., Morales, J. Alejandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5360225/
https://www.ncbi.nlm.nih.gov/pubmed/28323839
http://dx.doi.org/10.1371/journal.pone.0173288
Descripción
Sumario:Genomic signal processing (GSP) refers to the use of signal processing for the analysis of genomic data. GSP methods require the transformation or mapping of the genomic data to a numeric representation. To date, several DNA numeric representations (DNR) have been proposed; however, it is not clear what the properties of each DNR are and how the selection of one will affect the results when using a signal processing technique to analyze them. In this paper, we present an experimental study of the characteristics of nine of the most frequently-used DNR. The objective of this paper is to evaluate the behavior of each representation when used to measure the similarity of a given pair of DNA sequences.