Cargando…

Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction

BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consu...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Ying, Wu, Yanan, Zhang, Zequn, Liu, Niannian, Peng, Jun, Tang, Jianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241225/
https://www.ncbi.nlm.nih.gov/pubmed/35768759
http://dx.doi.org/10.1186/s12859-022-04789-6
_version_ 1784737754857340928
author Liang, Ying
Wu, Yanan
Zhang, Zequn
Liu, Niannian
Peng, Jun
Tang, Jianjun
author_facet Liang, Ying
Wu, Yanan
Zhang, Zequn
Liu, Niannian
Peng, Jun
Tang, Jianjun
author_sort Liang, Ying
collection PubMed
description BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consuming and expensive. Besides, considering the huge differences in the number of 4mC samples among different species, it is challenging to achieve a robust multi-species 4mC site prediction performance. Hence, it is of great significance to develop effective computational tools to identify 4mC sites. RESULTS: This work proposes a flexible deep learning-based framework to predict 4mC sites, called Hyb4mC. Hyb4mC adopts the DNA2vec method for sequence embedding, which captures more efficient and comprehensive information compared with the sequence-based feature method. Then, two different subnets are used for further analysis: Hyb_Caps and Hyb_Conv. Hyb_Caps is composed of a capsule neural network and can generalize from fewer samples. Hyb_Conv combines the attention mechanism with a text convolutional neural network for further feature learning. CONCLUSIONS: Extensive benchmark tests have shown that Hyb4mC can significantly enhance the performance of predicting 4mC sites compared with the recently proposed methods.
format Online
Article
Text
id pubmed-9241225
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92412252022-06-30 Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction Liang, Ying Wu, Yanan Zhang, Zequn Liu, Niannian Peng, Jun Tang, Jianjun BMC Bioinformatics Research BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consuming and expensive. Besides, considering the huge differences in the number of 4mC samples among different species, it is challenging to achieve a robust multi-species 4mC site prediction performance. Hence, it is of great significance to develop effective computational tools to identify 4mC sites. RESULTS: This work proposes a flexible deep learning-based framework to predict 4mC sites, called Hyb4mC. Hyb4mC adopts the DNA2vec method for sequence embedding, which captures more efficient and comprehensive information compared with the sequence-based feature method. Then, two different subnets are used for further analysis: Hyb_Caps and Hyb_Conv. Hyb_Caps is composed of a capsule neural network and can generalize from fewer samples. Hyb_Conv combines the attention mechanism with a text convolutional neural network for further feature learning. CONCLUSIONS: Extensive benchmark tests have shown that Hyb4mC can significantly enhance the performance of predicting 4mC sites compared with the recently proposed methods. BioMed Central 2022-06-29 /pmc/articles/PMC9241225/ /pubmed/35768759 http://dx.doi.org/10.1186/s12859-022-04789-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liang, Ying
Wu, Yanan
Zhang, Zequn
Liu, Niannian
Peng, Jun
Tang, Jianjun
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title_full Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title_fullStr Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title_full_unstemmed Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title_short Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
title_sort hyb4mc: a hybrid dna2vec-based model for dna n4-methylcytosine sites prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241225/
https://www.ncbi.nlm.nih.gov/pubmed/35768759
http://dx.doi.org/10.1186/s12859-022-04789-6
work_keys_str_mv AT liangying hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction
AT wuyanan hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction
AT zhangzequn hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction
AT liuniannian hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction
AT pengjun hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction
AT tangjianjun hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction