Cargando…

m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features

5-Methyluridine (m(5)U) is one of the most common post-transcriptional RNA modifications, which is involved in a variety of important biological processes and disease development. The precise identification of the m(5)U sites allows for a better understanding of the biological processes of RNA and c...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Zhongxing, Wang, Xuan, Meng, Jia, Zhang, Lin, Song, Bowen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10627201/
https://www.ncbi.nlm.nih.gov/pubmed/37937221
http://dx.doi.org/10.3389/fmicb.2023.1277099
_version_ 1785131489852129280
author Xu, Zhongxing
Wang, Xuan
Meng, Jia
Zhang, Lin
Song, Bowen
author_facet Xu, Zhongxing
Wang, Xuan
Meng, Jia
Zhang, Lin
Song, Bowen
author_sort Xu, Zhongxing
collection PubMed
description 5-Methyluridine (m(5)U) is one of the most common post-transcriptional RNA modifications, which is involved in a variety of important biological processes and disease development. The precise identification of the m(5)U sites allows for a better understanding of the biological processes of RNA and contributes to the discovery of new RNA functional and therapeutic targets. Here, we present m5U-GEPred, a prediction framework, to combine sequence characteristics and graph embedding-based information for m(5)U identification. The graph embedding approach was introduced to extract the global information of training data that complemented the local information represented by conventional sequence features, thereby enhancing the prediction performance of m(5)U identification. m5U-GEPred outperformed the state-of-the-art m(5)U predictors built on two independent species, with an average AUROC of 0.984 and 0.985 tested on human and yeast transcriptomes, respectively. To further validate the performance of our newly proposed framework, the experimentally validated m(5)U sites identified from Oxford Nanopore Technology (ONT) were collected as independent testing data, and in this project, m5U-GEPred achieved reasonable prediction performance with ACC of 91.84%. We hope that m5U-GEPred should make a useful computational alternative for m(5)U identification.
format Online
Article
Text
id pubmed-10627201
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106272012023-11-07 m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features Xu, Zhongxing Wang, Xuan Meng, Jia Zhang, Lin Song, Bowen Front Microbiol Microbiology 5-Methyluridine (m(5)U) is one of the most common post-transcriptional RNA modifications, which is involved in a variety of important biological processes and disease development. The precise identification of the m(5)U sites allows for a better understanding of the biological processes of RNA and contributes to the discovery of new RNA functional and therapeutic targets. Here, we present m5U-GEPred, a prediction framework, to combine sequence characteristics and graph embedding-based information for m(5)U identification. The graph embedding approach was introduced to extract the global information of training data that complemented the local information represented by conventional sequence features, thereby enhancing the prediction performance of m(5)U identification. m5U-GEPred outperformed the state-of-the-art m(5)U predictors built on two independent species, with an average AUROC of 0.984 and 0.985 tested on human and yeast transcriptomes, respectively. To further validate the performance of our newly proposed framework, the experimentally validated m(5)U sites identified from Oxford Nanopore Technology (ONT) were collected as independent testing data, and in this project, m5U-GEPred achieved reasonable prediction performance with ACC of 91.84%. We hope that m5U-GEPred should make a useful computational alternative for m(5)U identification. Frontiers Media S.A. 2023-10-23 /pmc/articles/PMC10627201/ /pubmed/37937221 http://dx.doi.org/10.3389/fmicb.2023.1277099 Text en Copyright © 2023 Xu, Wang, Meng, Zhang and Song. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Xu, Zhongxing
Wang, Xuan
Meng, Jia
Zhang, Lin
Song, Bowen
m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title_full m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title_fullStr m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title_full_unstemmed m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title_short m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features
title_sort m5u-gepred: prediction of rna 5-methyluridine sites based on sequence-derived and graph embedding features
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10627201/
https://www.ncbi.nlm.nih.gov/pubmed/37937221
http://dx.doi.org/10.3389/fmicb.2023.1277099
work_keys_str_mv AT xuzhongxing m5ugepredpredictionofrna5methyluridinesitesbasedonsequencederivedandgraphembeddingfeatures
AT wangxuan m5ugepredpredictionofrna5methyluridinesitesbasedonsequencederivedandgraphembeddingfeatures
AT mengjia m5ugepredpredictionofrna5methyluridinesitesbasedonsequencederivedandgraphembeddingfeatures
AT zhanglin m5ugepredpredictionofrna5methyluridinesitesbasedonsequencederivedandgraphembeddingfeatures
AT songbowen m5ugepredpredictionofrna5methyluridinesitesbasedonsequencederivedandgraphembeddingfeatures