Cargando…

iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes

5hmC, 6mA, and 4mC are three common DNA modifications and are involved in various of biological processes. Accurate genome-wide identification of these sites is invaluable for better understanding their biological functions. Owing to the labor-intensive and expensive nature of experimental methods,...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Hao, Dao, Fu-Ying, Zhang, Dan, Guan, Zheng-Xing, Yang, Hui, Su, Wei, Liu, Meng-Lu, Ding, Hui, Chen, Wei, Lin, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7115099/
https://www.ncbi.nlm.nih.gov/pubmed/32240948
http://dx.doi.org/10.1016/j.isci.2020.100991
Descripción
Sumario:5hmC, 6mA, and 4mC are three common DNA modifications and are involved in various of biological processes. Accurate genome-wide identification of these sites is invaluable for better understanding their biological functions. Owing to the labor-intensive and expensive nature of experimental methods, it is urgent to develop computational methods for the genome-wide detection of these sites. Keeping this in mind, the current study was devoted to construct a computational method to identify 5hmC, 6mA, and 4mC. We initially used K-tuple nucleotide component, nucleotide chemical property and nucleotide frequency, and mono-nucleotide binary encoding scheme to formulate samples. Subsequently, random forest was utilized to identify 5hmC, 6mA, and 4mC sites. Cross-validated results showed that the proposed method could produce the excellent generalization ability in the identification of the three modification sites. Based on the proposed model, a web-server called iDNA-MS was established and is freely accessible at http://lin-group.cn/server/iDNA-MS.