Cargando…
m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation
BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127088/ https://www.ncbi.nlm.nih.gov/pubmed/37095510 http://dx.doi.org/10.1186/s12915-023-01596-0 |
_version_ | 1785030389885042688 |
---|---|
author | Ao, Chunyan Ye, Xiucai Sakurai, Tetsuya Zou, Quan Yu, Liang |
author_facet | Ao, Chunyan Ye, Xiucai Sakurai, Tetsuya Zou, Quan Yu, Liang |
author_sort | Ao, Chunyan |
collection | PubMed |
description | BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations. RESULTS: In this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool. CONCLUSIONS: m5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01596-0. |
format | Online Article Text |
id | pubmed-10127088 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101270882023-04-26 m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation Ao, Chunyan Ye, Xiucai Sakurai, Tetsuya Zou, Quan Yu, Liang BMC Biol Methodology Article BACKGROUND: RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C(5) position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations. RESULTS: In this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool. CONCLUSIONS: m5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01596-0. BioMed Central 2023-04-24 /pmc/articles/PMC10127088/ /pubmed/37095510 http://dx.doi.org/10.1186/s12915-023-01596-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Ao, Chunyan Ye, Xiucai Sakurai, Tetsuya Zou, Quan Yu, Liang m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title | m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title_full | m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title_fullStr | m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title_full_unstemmed | m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title_short | m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
title_sort | m5u-svm: identification of rna 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127088/ https://www.ncbi.nlm.nih.gov/pubmed/37095510 http://dx.doi.org/10.1186/s12915-023-01596-0 |
work_keys_str_mv | AT aochunyan m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation AT yexiucai m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation AT sakuraitetsuya m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation AT zouquan m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation AT yuliang m5usvmidentificationofrna5methyluridinemodificationsitesbasedonmultiviewfeaturesofphysicochemicalfeaturesanddistributedrepresentation |