Cargando…

4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism

DNA methylation is one of the most extensive epigenetic modifications. DNA 4mC modification plays a key role in regulating chromatin structure and gene expression. In this study, we proposed a generic 4mC computational predictor, namely, 4mCPred-MTL using multi-task learning coupled with Transformer...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Rao, Cheng, Song, Liao, Minghong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8141656/
https://www.ncbi.nlm.nih.gov/pubmed/34041243
http://dx.doi.org/10.3389/fcell.2021.664669
_version_ 1783696409740967936
author Zeng, Rao
Cheng, Song
Liao, Minghong
author_facet Zeng, Rao
Cheng, Song
Liao, Minghong
author_sort Zeng, Rao
collection PubMed
description DNA methylation is one of the most extensive epigenetic modifications. DNA 4mC modification plays a key role in regulating chromatin structure and gene expression. In this study, we proposed a generic 4mC computational predictor, namely, 4mCPred-MTL using multi-task learning coupled with Transformer to predict 4mC sites in multiple species. In this predictor, we utilize a multi-task learning framework, in which each task is to train species-specific data based on Transformer. Extensive experimental results show that our multi-task predictive model can significantly improve the performance of the model based on single task and outperform existing methods on benchmarking comparison. Moreover, we found that our model can sufficiently capture better characteristics of 4mC sites as compared to existing commonly used feature descriptors, demonstrating the strong feature learning ability of our model. Therefore, based on the above results, it can be expected that our 4mCPred-MTL can be a useful tool for research communities of interest.
format Online
Article
Text
id pubmed-8141656
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81416562021-05-25 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism Zeng, Rao Cheng, Song Liao, Minghong Front Cell Dev Biol Cell and Developmental Biology DNA methylation is one of the most extensive epigenetic modifications. DNA 4mC modification plays a key role in regulating chromatin structure and gene expression. In this study, we proposed a generic 4mC computational predictor, namely, 4mCPred-MTL using multi-task learning coupled with Transformer to predict 4mC sites in multiple species. In this predictor, we utilize a multi-task learning framework, in which each task is to train species-specific data based on Transformer. Extensive experimental results show that our multi-task predictive model can significantly improve the performance of the model based on single task and outperform existing methods on benchmarking comparison. Moreover, we found that our model can sufficiently capture better characteristics of 4mC sites as compared to existing commonly used feature descriptors, demonstrating the strong feature learning ability of our model. Therefore, based on the above results, it can be expected that our 4mCPred-MTL can be a useful tool for research communities of interest. Frontiers Media S.A. 2021-05-10 /pmc/articles/PMC8141656/ /pubmed/34041243 http://dx.doi.org/10.3389/fcell.2021.664669 Text en Copyright © 2021 Zeng, Cheng and Liao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Zeng, Rao
Cheng, Song
Liao, Minghong
4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title_full 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title_fullStr 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title_full_unstemmed 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title_short 4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism
title_sort 4mcpred-mtl: accurate identification of dna 4mc sites in multiple species using multi-task deep learning based on multi-head attention mechanism
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8141656/
https://www.ncbi.nlm.nih.gov/pubmed/34041243
http://dx.doi.org/10.3389/fcell.2021.664669
work_keys_str_mv AT zengrao 4mcpredmtlaccurateidentificationofdna4mcsitesinmultiplespeciesusingmultitaskdeeplearningbasedonmultiheadattentionmechanism
AT chengsong 4mcpredmtlaccurateidentificationofdna4mcsitesinmultiplespeciesusingmultitaskdeeplearningbasedonmultiheadattentionmechanism
AT liaominghong 4mcpredmtlaccurateidentificationofdna4mcsitesinmultiplespeciesusingmultitaskdeeplearningbasedonmultiheadattentionmechanism