Cargando…
iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice
DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6746913/ https://www.ncbi.nlm.nih.gov/pubmed/31552096 http://dx.doi.org/10.3389/fgene.2019.00793 |
_version_ | 1783451779985309696 |
---|---|
author | Lv, Hao Dao, Fu-Ying Guan, Zheng-Xing Zhang, Dan Tan, Jiu-Xin Zhang, Yong Chen, Wei Lin, Hao |
author_facet | Lv, Hao Dao, Fu-Ying Guan, Zheng-Xing Zhang, Dan Tan, Jiu-Xin Zhang, Yong Chen, Wei Lin, Hao |
author_sort | Lv, Hao |
collection | PubMed |
description | DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mono-nucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice. |
format | Online Article Text |
id | pubmed-6746913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67469132019-09-24 iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice Lv, Hao Dao, Fu-Ying Guan, Zheng-Xing Zhang, Dan Tan, Jiu-Xin Zhang, Yong Chen, Wei Lin, Hao Front Genet Genetics DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based method of predicting 6mA sites in the rice genome. We initially used mono-nucleotide binary encoding to formulate positive and negative samples. Subsequently, the machine learning algorithm named Random Forest was utilized to perform the classification for identifying 6mA sites. Our proposed method could produce an area under the receiver operating characteristic curve of 0.964 with an overall accuracy of 0.917, as indicated by the fivefold cross-validation test. Furthermore, an independent dataset was established to assess the generalization ability of our method. Finally, an area under the receiver operating characteristic curve of 0.981 was obtained, suggesting that the proposed method had good performance of predicting 6mA sites in the rice genome. For the convenience of retrieving 6mA sites, on the basis of the computational method, we built a freely accessible web server named iDNA6mA-Rice at http://lin-group.cn/server/iDNA6mA-Rice. Frontiers Media S.A. 2019-09-10 /pmc/articles/PMC6746913/ /pubmed/31552096 http://dx.doi.org/10.3389/fgene.2019.00793 Text en Copyright © 2019 Lv, Dao, Guan, Zhang, Tan, Zhang, Chen and Lin http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Lv, Hao Dao, Fu-Ying Guan, Zheng-Xing Zhang, Dan Tan, Jiu-Xin Zhang, Yong Chen, Wei Lin, Hao iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title | iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title_full | iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title_fullStr | iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title_full_unstemmed | iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title_short | iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice |
title_sort | idna6ma-rice: a computational tool for detecting n6-methyladenine sites in rice |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6746913/ https://www.ncbi.nlm.nih.gov/pubmed/31552096 http://dx.doi.org/10.3389/fgene.2019.00793 |
work_keys_str_mv | AT lvhao idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT daofuying idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT guanzhengxing idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT zhangdan idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT tanjiuxin idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT zhangyong idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT chenwei idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice AT linhao idna6mariceacomputationaltoolfordetectingn6methyladeninesitesinrice |