Cargando…

i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features

N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propos...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen-Vo, Thanh-Hoang, Trinh, Quang H., Nguyen, Loc, Nguyen-Hoang, Phuong-Uyen, Rahardja, Susanto, Nguyen, Binh P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238585/
https://www.ncbi.nlm.nih.gov/pubmed/37273848
http://dx.doi.org/10.1016/j.csbj.2023.05.014
_version_ 1785053324021596160
author Nguyen-Vo, Thanh-Hoang
Trinh, Quang H.
Nguyen, Loc
Nguyen-Hoang, Phuong-Uyen
Rahardja, Susanto
Nguyen, Binh P.
author_facet Nguyen-Vo, Thanh-Hoang
Trinh, Quang H.
Nguyen, Loc
Nguyen-Hoang, Phuong-Uyen
Rahardja, Susanto
Nguyen, Binh P.
author_sort Nguyen-Vo, Thanh-Hoang
collection PubMed
description N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies.
format Online
Article
Text
id pubmed-10238585
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-102385852023-06-04 i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features Nguyen-Vo, Thanh-Hoang Trinh, Quang H. Nguyen, Loc Nguyen-Hoang, Phuong-Uyen Rahardja, Susanto Nguyen, Binh P. Comput Struct Biotechnol J Software/Web server Article N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies. Research Network of Computational and Structural Biotechnology 2023-05-16 /pmc/articles/PMC10238585/ /pubmed/37273848 http://dx.doi.org/10.1016/j.csbj.2023.05.014 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Software/Web server Article
Nguyen-Vo, Thanh-Hoang
Trinh, Quang H.
Nguyen, Loc
Nguyen-Hoang, Phuong-Uyen
Rahardja, Susanto
Nguyen, Binh P.
i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title_full i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title_fullStr i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title_full_unstemmed i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title_short i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
title_sort i4mc-gru: identifying dna n(4)-methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
topic Software/Web server Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238585/
https://www.ncbi.nlm.nih.gov/pubmed/37273848
http://dx.doi.org/10.1016/j.csbj.2023.05.014
work_keys_str_mv AT nguyenvothanhhoang i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures
AT trinhquangh i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures
AT nguyenloc i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures
AT nguyenhoangphuonguyen i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures
AT rahardjasusanto i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures
AT nguyenbinhp i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures