Cargando…

Segmentation of DNA using simple recurrent neural network

We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Wei-Chen, Huang, Jau-Chi, Liou, Cheng-Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7126336/
https://www.ncbi.nlm.nih.gov/pubmed/32288315
http://dx.doi.org/10.1016/j.knosys.2011.09.001
Descripción
Sumario:We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of prediction error indicates how the underlying hidden regularity of the genome sequences and the results are consistent with the finding of biologists: predicated protein coding features of SARS genome. This implies that the simple recurrent network is capable of providing new features for further biological studies when applied on genome studies. The HA gene of influenza A subtype H1N1 is also analyzed in a similar way.