Cargando…

Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions

Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lei, Li, ZhanDong, Zhang, ShiQi, Zhang, Yu-Hang, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776474/
https://www.ncbi.nlm.nih.gov/pubmed/35071593
http://dx.doi.org/10.1155/2022/4035462
_version_ 1784636844881739776
author Chen, Lei
Li, ZhanDong
Zhang, ShiQi
Zhang, Yu-Hang
Huang, Tao
Cai, Yu-Dong
author_facet Chen, Lei
Li, ZhanDong
Zhang, ShiQi
Zhang, Yu-Hang
Huang, Tao
Cai, Yu-Dong
author_sort Chen, Lei
collection PubMed
description Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m(5)C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from k-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated.
format Online
Article
Text
id pubmed-8776474
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-87764742022-01-21 Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions Chen, Lei Li, ZhanDong Zhang, ShiQi Zhang, Yu-Hang Huang, Tao Cai, Yu-Dong Biomed Res Int Research Article Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m(5)C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from k-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated. Hindawi 2022-01-13 /pmc/articles/PMC8776474/ /pubmed/35071593 http://dx.doi.org/10.1155/2022/4035462 Text en Copyright © 2022 Lei Chen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Lei
Li, ZhanDong
Zhang, ShiQi
Zhang, Yu-Hang
Huang, Tao
Cai, Yu-Dong
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title_full Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title_fullStr Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title_full_unstemmed Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title_short Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
title_sort predicting rna 5-methylcytosine sites by using essential sequence features and distributions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776474/
https://www.ncbi.nlm.nih.gov/pubmed/35071593
http://dx.doi.org/10.1155/2022/4035462
work_keys_str_mv AT chenlei predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions
AT lizhandong predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions
AT zhangshiqi predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions
AT zhangyuhang predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions
AT huangtao predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions
AT caiyudong predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions