Cargando…
i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propos...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238585/ https://www.ncbi.nlm.nih.gov/pubmed/37273848 http://dx.doi.org/10.1016/j.csbj.2023.05.014 |
_version_ | 1785053324021596160 |
---|---|
author | Nguyen-Vo, Thanh-Hoang Trinh, Quang H. Nguyen, Loc Nguyen-Hoang, Phuong-Uyen Rahardja, Susanto Nguyen, Binh P. |
author_facet | Nguyen-Vo, Thanh-Hoang Trinh, Quang H. Nguyen, Loc Nguyen-Hoang, Phuong-Uyen Rahardja, Susanto Nguyen, Binh P. |
author_sort | Nguyen-Vo, Thanh-Hoang |
collection | PubMed |
description | N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies. |
format | Online Article Text |
id | pubmed-10238585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-102385852023-06-04 i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features Nguyen-Vo, Thanh-Hoang Trinh, Quang H. Nguyen, Loc Nguyen-Hoang, Phuong-Uyen Rahardja, Susanto Nguyen, Binh P. Comput Struct Biotechnol J Software/Web server Article N(4)-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies. Research Network of Computational and Structural Biotechnology 2023-05-16 /pmc/articles/PMC10238585/ /pubmed/37273848 http://dx.doi.org/10.1016/j.csbj.2023.05.014 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Software/Web server Article Nguyen-Vo, Thanh-Hoang Trinh, Quang H. Nguyen, Loc Nguyen-Hoang, Phuong-Uyen Rahardja, Susanto Nguyen, Binh P. i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title | i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title_full | i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title_fullStr | i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title_full_unstemmed | i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title_short | i4mC-GRU: Identifying DNA N(4)-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
title_sort | i4mc-gru: identifying dna n(4)-methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features |
topic | Software/Web server Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238585/ https://www.ncbi.nlm.nih.gov/pubmed/37273848 http://dx.doi.org/10.1016/j.csbj.2023.05.014 |
work_keys_str_mv | AT nguyenvothanhhoang i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures AT trinhquangh i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures AT nguyenloc i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures AT nguyenhoangphuonguyen i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures AT rahardjasusanto i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures AT nguyenbinhp i4mcgruidentifyingdnan4methylcytosinesitesinmousegenomesusingbidirectionalgatedrecurrentunitandsequenceembeddedfeatures |