Cargando…

iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice

DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensi...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Hao, Dao, Fu-Ying, Guan, Zheng-Xing, Zhang, Dan, Tan, Jiu-Xin, Zhang, Yong, Chen, Wei, Lin, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6746913/
https://www.ncbi.nlm.nih.gov/pubmed/31552096
http://dx.doi.org/10.3389/fgene.2019.00793
_version_ 1783451779985309696
author Lv, Hao
Dao, Fu-Ying
Guan, Zheng-Xing
Zhang, Dan
Tan, Jiu-Xin
Zhang, Yong
Chen, Wei
Lin, Hao
author_facet Lv, Hao
Dao, Fu-Ying
Guan, Zheng-Xing
Zhang, Dan
Tan, Jiu-Xin
Zhang, Yong
Chen, Wei
Lin, Hao
author_sort Lv, Hao
collection PubMed
description DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mono-nucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice.
format Online
Article
Text
id pubmed-6746913
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67469132019-09-24 iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice Lv, Hao Dao, Fu-Ying Guan, Zheng-Xing Zhang, Dan Tan, Jiu-Xin Zhang, Yong Chen, Wei Lin, Hao Front Genet Genetics DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mono-nucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice. Frontiers Media S.A. 2019-09-10 /pmc/articles/PMC6746913/ /pubmed/31552096 http://dx.doi.org/10.3389/fgene.2019.00793 Text en Copyright © 2019 Lv, Dao, Guan, Zhang, Tan, Zhang, Chen and Lin http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lv, Hao
Dao, Fu-Ying
Guan, Zheng-Xing
Zhang, Dan
Tan, Jiu-Xin
Zhang, Yong
Chen, Wei
Lin, Hao
iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title_full iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title_fullStr iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title_full_unstemmed iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title_short iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
title_sort idna6ma-rice: a computational tool for detecting n6-methyladenine sites in rice
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6746913/
https://www.ncbi.nlm.nih.gov/pubmed/31552096
http://dx.doi.org/10.3389/fgene.2019.00793
work_keys_str_mv AT lvhao idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT daofuying idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT guanzhengxing idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT zhangdan idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT tanjiuxin idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT zhangyong idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT chenwei idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice
AT linhao idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice