Cargando…

iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool

MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, exist...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Xiao, Ye, Xiucai, Li, Xuehong, Wei, Lesong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044371/
https://www.ncbi.nlm.nih.gov/pubmed/33868390
http://dx.doi.org/10.3389/fgene.2021.663572
_version_ 1783678469552472064
author Yang, Xiao
Ye, Xiucai
Li, Xuehong
Wei, Lesong
author_facet Yang, Xiao
Ye, Xiucai
Li, Xuehong
Wei, Lesong
author_sort Yang, Xiao
collection PubMed
description MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, existing methods to identify 4mA or 6mC sites are all single tasks, which demonstrates that they can identify only a certain modification in one species. Therefore, it is desirable to develop a novel computational method to identify the modification sites in multiple species simultaneously. RESULTS: In this study, we proposed a computational method, called iDNA-MT, to identify 4mC sites and 6mA sites in multiple species, respectively. The proposed iDNA-MT mainly employed multi-task learning coupled with the bidirectional gated recurrent units (BGRU) to capture the sharing information among different species directly from DNA primary sequences. Experimental comparative results on two benchmark datasets, containing different species respectively, show that either for identifying 4mA or for 6mC site in multiple species, the proposed iDNA-MT outperforms other state-of-the-art single-task methods. The promising results have demonstrated that iDNA-MT has great potential to be a powerful and practically useful tool to accurately identify DNA modifications.
format Online
Article
Text
id pubmed-8044371
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80443712021-04-15 iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool Yang, Xiao Ye, Xiucai Li, Xuehong Wei, Lesong Front Genet Genetics MOTIVATION: DNA N4-methylcytosine (4mC) and N6-methyladenine (6mA) are two important DNA modifications and play crucial roles in a variety of biological processes. Accurate identification of the modifications is essential to better understand their biological functions and mechanisms. However, existing methods to identify 4mA or 6mC sites are all single tasks, which demonstrates that they can identify only a certain modification in one species. Therefore, it is desirable to develop a novel computational method to identify the modification sites in multiple species simultaneously. RESULTS: In this study, we proposed a computational method, called iDNA-MT, to identify 4mC sites and 6mA sites in multiple species, respectively. The proposed iDNA-MT mainly employed multi-task learning coupled with the bidirectional gated recurrent units (BGRU) to capture the sharing information among different species directly from DNA primary sequences. Experimental comparative results on two benchmark datasets, containing different species respectively, show that either for identifying 4mA or for 6mC site in multiple species, the proposed iDNA-MT outperforms other state-of-the-art single-task methods. The promising results have demonstrated that iDNA-MT has great potential to be a powerful and practically useful tool to accurately identify DNA modifications. Frontiers Media S.A. 2021-03-31 /pmc/articles/PMC8044371/ /pubmed/33868390 http://dx.doi.org/10.3389/fgene.2021.663572 Text en Copyright © 2021 Yang, Ye, Li and Wei. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Yang, Xiao
Ye, Xiucai
Li, Xuehong
Wei, Lesong
iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title_full iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title_fullStr iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title_full_unstemmed iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title_short iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool
title_sort idna-mt: identification dna modification sites in multiple species by using multi-task learning based a neural network tool
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044371/
https://www.ncbi.nlm.nih.gov/pubmed/33868390
http://dx.doi.org/10.3389/fgene.2021.663572
work_keys_str_mv AT yangxiao idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool
AT yexiucai idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool
AT lixuehong idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool
AT weilesong idnamtidentificationdnamodificationsitesinmultiplespeciesbyusingmultitasklearningbasedaneuralnetworktool