Cargando…

m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information

N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan, Guo, Rui, Huang, Lan, Yang, Sen, Hu, Xuemei, He, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191635/
https://www.ncbi.nlm.nih.gov/pubmed/34122525
http://dx.doi.org/10.3389/fgene.2021.670852
_version_ 1783705904040902656
author Wang, Yan
Guo, Rui
Huang, Lan
Yang, Sen
Hu, Xuemei
He, Kai
author_facet Wang, Yan
Guo, Rui
Huang, Lan
Yang, Sen
Hu, Xuemei
He, Kai
author_sort Wang, Yan
collection PubMed
description N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological significance of m(6)A. The existing biological experimental methods for identifying m(6)A sites are time-consuming and costly. Thus, developing a high confidence computational method is significant to explore m(6)A intrinsic characters. In this study, we propose a predictor called m6AGE which utilizes sequence-derived and graph embedding features. To the best of our knowledge, our predictor is the first to combine sequence-derived features and graph embeddings for m(6)A site prediction. Comparison results show that our proposed predictor achieved the best performance compared with other predictors on four public datasets across three species. On the A101 dataset, our predictor outperformed 1.34% (accuracy), 0.0227 (Matthew’s correlation coefficient), 5.63% (specificity), and 0.0081 (AUC) than comparing predictors, which indicates that m6AGE is a useful tool for m(6)A site prediction. The source code of m6AGE is available at https://github.com/bokunoBike/m6AGE.
format Online
Article
Text
id pubmed-8191635
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81916352021-06-11 m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information Wang, Yan Guo, Rui Huang, Lan Yang, Sen Hu, Xuemei He, Kai Front Genet Genetics N(6)-methyladenosine (m(6)A) is one of the most prevalent RNA post-transcriptional modifications and is involved in various vital biological processes such as mRNA splicing, exporting, stability, and so on. Identifying m(6)A sites contributes to understanding the functional mechanism and biological significance of m(6)A. The existing biological experimental methods for identifying m(6)A sites are time-consuming and costly. Thus, developing a high confidence computational method is significant to explore m(6)A intrinsic characters. In this study, we propose a predictor called m6AGE which utilizes sequence-derived and graph embedding features. To the best of our knowledge, our predictor is the first to combine sequence-derived features and graph embeddings for m(6)A site prediction. Comparison results show that our proposed predictor achieved the best performance compared with other predictors on four public datasets across three species. On the A101 dataset, our predictor outperformed 1.34% (accuracy), 0.0227 (Matthew’s correlation coefficient), 5.63% (specificity), and 0.0081 (AUC) than comparing predictors, which indicates that m6AGE is a useful tool for m(6)A site prediction. The source code of m6AGE is available at https://github.com/bokunoBike/m6AGE. Frontiers Media S.A. 2021-05-27 /pmc/articles/PMC8191635/ /pubmed/34122525 http://dx.doi.org/10.3389/fgene.2021.670852 Text en Copyright © 2021 Wang, Guo, Huang, Yang, Hu and He. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Yan
Guo, Rui
Huang, Lan
Yang, Sen
Hu, Xuemei
He, Kai
m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title_full m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title_fullStr m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title_full_unstemmed m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title_short m6AGE: A Predictor for N6-Methyladenosine Sites Identification Utilizing Sequence Characteristics and Graph Embedding-Based Geometrical Information
title_sort m6age: a predictor for n6-methyladenosine sites identification utilizing sequence characteristics and graph embedding-based geometrical information
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191635/
https://www.ncbi.nlm.nih.gov/pubmed/34122525
http://dx.doi.org/10.3389/fgene.2021.670852
work_keys_str_mv AT wangyan m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation
AT guorui m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation
AT huanglan m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation
AT yangsen m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation
AT huxuemei m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation
AT hekai m6ageapredictorforn6methyladenosinesitesidentificationutilizingsequencecharacteristicsandgraphembeddingbasedgeometricalinformation