Cargando…
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction
BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consu...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241225/ https://www.ncbi.nlm.nih.gov/pubmed/35768759 http://dx.doi.org/10.1186/s12859-022-04789-6 |
_version_ | 1784737754857340928 |
---|---|
author | Liang, Ying Wu, Yanan Zhang, Zequn Liu, Niannian Peng, Jun Tang, Jianjun |
author_facet | Liang, Ying Wu, Yanan Zhang, Zequn Liu, Niannian Peng, Jun Tang, Jianjun |
author_sort | Liang, Ying |
collection | PubMed |
description | BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consuming and expensive. Besides, considering the huge differences in the number of 4mC samples among different species, it is challenging to achieve a robust multi-species 4mC site prediction performance. Hence, it is of great significance to develop effective computational tools to identify 4mC sites. RESULTS: This work proposes a flexible deep learning-based framework to predict 4mC sites, called Hyb4mC. Hyb4mC adopts the DNA2vec method for sequence embedding, which captures more efficient and comprehensive information compared with the sequence-based feature method. Then, two different subnets are used for further analysis: Hyb_Caps and Hyb_Conv. Hyb_Caps is composed of a capsule neural network and can generalize from fewer samples. Hyb_Conv combines the attention mechanism with a text convolutional neural network for further feature learning. CONCLUSIONS: Extensive benchmark tests have shown that Hyb4mC can significantly enhance the performance of predicting 4mC sites compared with the recently proposed methods. |
format | Online Article Text |
id | pubmed-9241225 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92412252022-06-30 Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction Liang, Ying Wu, Yanan Zhang, Zequn Liu, Niannian Peng, Jun Tang, Jianjun BMC Bioinformatics Research BACKGROUND: DNA N4-methylcytosine is part of the restrictive modification system, which works by regulating some biological processes, for example, the initiation of DNA replication, mismatch repair and inactivation of transposon. However, using experimental methods to detect 4mC sites is time-consuming and expensive. Besides, considering the huge differences in the number of 4mC samples among different species, it is challenging to achieve a robust multi-species 4mC site prediction performance. Hence, it is of great significance to develop effective computational tools to identify 4mC sites. RESULTS: This work proposes a flexible deep learning-based framework to predict 4mC sites, called Hyb4mC. Hyb4mC adopts the DNA2vec method for sequence embedding, which captures more efficient and comprehensive information compared with the sequence-based feature method. Then, two different subnets are used for further analysis: Hyb_Caps and Hyb_Conv. Hyb_Caps is composed of a capsule neural network and can generalize from fewer samples. Hyb_Conv combines the attention mechanism with a text convolutional neural network for further feature learning. CONCLUSIONS: Extensive benchmark tests have shown that Hyb4mC can significantly enhance the performance of predicting 4mC sites compared with the recently proposed methods. BioMed Central 2022-06-29 /pmc/articles/PMC9241225/ /pubmed/35768759 http://dx.doi.org/10.1186/s12859-022-04789-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Liang, Ying Wu, Yanan Zhang, Zequn Liu, Niannian Peng, Jun Tang, Jianjun Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title | Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title_full | Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title_fullStr | Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title_full_unstemmed | Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title_short | Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction |
title_sort | hyb4mc: a hybrid dna2vec-based model for dna n4-methylcytosine sites prediction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241225/ https://www.ncbi.nlm.nih.gov/pubmed/35768759 http://dx.doi.org/10.1186/s12859-022-04789-6 |
work_keys_str_mv | AT liangying hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction AT wuyanan hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction AT zhangzequn hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction AT liuniannian hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction AT pengjun hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction AT tangjianjun hyb4mcahybriddna2vecbasedmodelfordnan4methylcytosinesitesprediction |