Cargando…
m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191635/ https://www.ncbi.nlm.nih.gov/pubmed/34122525 http://dx.doi.org/10.3389/fgene.2021.670852 |
_version_ | 1783705904040902656 |
---|---|
author | Wang, Yan Guo, Rui Huang, Lan Yang, Sen Hu, Xuemei He, Kai |
author_facet | Wang, Yan Guo, Rui Huang, Lan Yang, Sen Hu, Xuemei He, Kai |
author_sort | Wang, Yan |
collection | PubMed |
description | N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological significance of m(6)A. The existing biological experimental methods for identifying m(6)A sites are time-consuming and costly. Thus, developing a high confidence computational method is significant to explore m(6)A intrinsic characters. In this study, we propose a predictor called m6AGE which utilizes sequence-derived and graph embedding features. To the best of our knowledge, our predictor is the first to combine sequence-derived features and graph embeddings for m(6)A site prediction. Comparison results show that our proposed predictor achieved the best performance compared with other predictors on four public datasets across three species. On the A101 dataset, our predictor outperformed 1.34% (accuracy), 0.0227 (Matthew’s correlation coefficient), 5.63% (specificity), and 0.0081 (AUC) than comparing predictors, which indicates that m6AGE is a useful tool for m(6)A site prediction. The source code of m6AGE is available at https://github.com/bokunoBike/m6AGE. |
format | Online Article Text |
id | pubmed-8191635 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81916352021-06-11 m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information Wang, Yan Guo, Rui Huang, Lan Yang, Sen Hu, Xuemei He, Kai Front Genet Genetics N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological significance of m(6)A. The existing biological experimental methods for identifying m(6)A sites are time-consuming and costly. Thus, developing a high confidence computational method is significant to explore m(6)A intrinsic characters. In this study, we propose a predictor called m6AGE which utilizes sequence-derived and graph embedding features. To the best of our knowledge, our predictor is the first to combine sequence-derived features and graph embeddings for m(6)A site prediction. Comparison results show that our proposed predictor achieved the best performance compared with other predictors on four public datasets across three species. On the A101 dataset, our predictor outperformed 1.34% (accuracy), 0.0227 (Matthew’s correlation coefficient), 5.63% (specificity), and 0.0081 (AUC) than comparing predictors, which indicates that m6AGE is a useful tool for m(6)A site prediction. The source code of m6AGE is available at https://github.com/bokunoBike/m6AGE. Frontiers Media S.A. 2021-05-27 /pmc/articles/PMC8191635/ /pubmed/34122525 http://dx.doi.org/10.3389/fgene.2021.670852 Text en Copyright © 2021 Wang, Guo, Huang, Yang, Hu and He. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Yan Guo, Rui Huang, Lan Yang, Sen Hu, Xuemei He, Kai m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title | m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title_full | m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title_fullStr | m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title_full_unstemmed | m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title_short | m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information |
title_sort | m6age: a predictor for n6-methyladenosine sites identification utilizing sequence characteristics and graph embedding-based geometrical information |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191635/ https://www.ncbi.nlm.nih.gov/pubmed/34122525 http://dx.doi.org/10.3389/fgene.2021.670852 |
work_keys_str_mv | AT wangyan m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation AT guorui m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation AT huanglan m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation AT yangsen m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation AT huxuemei m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation AT hekai m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation |