Cargando…
iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, exist...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044371/ https://www.ncbi.nlm.nih.gov/pubmed/33868390 http://dx.doi.org/10.3389/fgene.2021.663572 |
_version_ | 1783678469552472064 |
---|---|
author | Yang, Xiao Ye, Xiucai Li, Xuehong Wei, Lesong |
author_facet | Yang, Xiao Ye, Xiucai Li, Xuehong Wei, Lesong |
author_sort | Yang, Xiao |
collection | PubMed |
description | MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, existing methods to identify 4mA or 6mC sites are all single tasks, which demonstrates that they can identify only a certain modification in one species. Therefore, it is desirable to develop a novel computational method to identify the modification sites in multiple species simultaneously. RESULTS: In this study, we proposed a computational method, called iDNA-MT, to identify 4mC sites and 6mA sites in multiple species, respectively. The proposed iDNA-MT mainly employed multi-task learning coupled with the bidirectional gated recurrent units (BGRU) to capture the sharing information among different species directly from DNA primary sequences. Experimental comparative results on two benchmark datasets, containing different species respectively, show that either for identifying 4mA or for 6mC site in multiple species, the proposed iDNA-MT outperforms other state-of-the-art single-task methods. The promising results have demonstrated that iDNA-MT has great potential to be a powerful and practically useful tool to accurately identify DNA modifications. |
format | Online Article Text |
id | pubmed-8044371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80443712021-04-15 iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool Yang, Xiao Ye, Xiucai Li, Xuehong Wei, Lesong Front Genet Genetics MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, existing methods to identify 4mA or 6mC sites are all single tasks, which demonstrates that they can identify only a certain modification in one species. Therefore, it is desirable to develop a novel computational method to identify the modification sites in multiple species simultaneously. RESULTS: In this study, we proposed a computational method, called iDNA-MT, to identify 4mC sites and 6mA sites in multiple species, respectively. The proposed iDNA-MT mainly employed multi-task learning coupled with the bidirectional gated recurrent units (BGRU) to capture the sharing information among different species directly from DNA primary sequences. Experimental comparative results on two benchmark datasets, containing different species respectively, show that either for identifying 4mA or for 6mC site in multiple species, the proposed iDNA-MT outperforms other state-of-the-art single-task methods. The promising results have demonstrated that iDNA-MT has great potential to be a powerful and practically useful tool to accurately identify DNA modifications. Frontiers Media S.A. 2021-03-31 /pmc/articles/PMC8044371/ /pubmed/33868390 http://dx.doi.org/10.3389/fgene.2021.663572 Text en Copyright © 2021 Yang, Ye, Li and Wei. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Yang, Xiao Ye, Xiucai Li, Xuehong Wei, Lesong iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title | iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title_full | iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title_fullStr | iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title_full_unstemmed | iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title_short | iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool |
title_sort | idna-mt: identification dna modification sites in multiple species by using multi-task learning based a neural network tool |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044371/ https://www.ncbi.nlm.nih.gov/pubmed/33868390 http://dx.doi.org/10.3389/fgene.2021.663572 |
work_keys_str_mv | AT yangxiao idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool AT yexiucai idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool AT lixuehong idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool AT weilesong idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool |