Cargando…

m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation

BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute...

Descripción completa

Detalles Bibliográficos
Autores principales: Ao, Chunyan, Ye, Xiucai, Sakurai, Tetsuya, Zou, Quan, Yu, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127088/
https://www.ncbi.nlm.nih.gov/pubmed/37095510
http://dx.doi.org/10.1186/s12915-023-01596-0
_version_ 1785030389885042688
author Ao, Chunyan
Ye, Xiucai
Sakurai, Tetsuya
Zou, Quan
Yu, Liang
author_facet Ao, Chunyan
Ye, Xiucai
Sakurai, Tetsuya
Zou, Quan
Yu, Liang
author_sort Ao, Chunyan
collection PubMed
description BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations. RESULTS: In this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool. CONCLUSIONS: m5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01596-0.
format Online
Article
Text
id pubmed-10127088
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101270882023-04-26 m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation Ao, Chunyan Ye, Xiucai Sakurai, Tetsuya Zou, Quan Yu, Liang BMC Biol Methodology Article BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations. RESULTS: In this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool. CONCLUSIONS: m5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01596-0. BioMed Central 2023-04-24 /pmc/articles/PMC10127088/ /pubmed/37095510 http://dx.doi.org/10.1186/s12915-023-01596-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Ao, Chunyan
Ye, Xiucai
Sakurai, Tetsuya
Zou, Quan
Yu, Liang
m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title_full m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title_fullStr m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title_full_unstemmed m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title_short m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
title_sort m5u-svm: identification of rna 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127088/
https://www.ncbi.nlm.nih.gov/pubmed/37095510
http://dx.doi.org/10.1186/s12915-023-01596-0
work_keys_str_mv AT aochunyan m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation
AT yexiucai m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation
AT sakuraitetsuya m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation
AT zouquan m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation
AT yuliang m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation