Cargando…
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions
Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776474/ https://www.ncbi.nlm.nih.gov/pubmed/35071593 http://dx.doi.org/10.1155/2022/4035462 |
_version_ | 1784636844881739776 |
---|---|
author | Chen, Lei Li, ZhanDong Zhang, ShiQi Zhang, Yu-Hang Huang, Tao Cai, Yu-Dong |
author_facet | Chen, Lei Li, ZhanDong Zhang, ShiQi Zhang, Yu-Hang Huang, Tao Cai, Yu-Dong |
author_sort | Chen, Lei |
collection | PubMed |
description | Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m(5)C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from k-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated. |
format | Online Article Text |
id | pubmed-8776474 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-87764742022-01-21 Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions Chen, Lei Li, ZhanDong Zhang, ShiQi Zhang, Yu-Hang Huang, Tao Cai, Yu-Dong Biomed Res Int Research Article Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m(5)C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m(5)C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from k-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated. Hindawi 2022-01-13 /pmc/articles/PMC8776474/ /pubmed/35071593 http://dx.doi.org/10.1155/2022/4035462 Text en Copyright © 2022 Lei Chen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Lei Li, ZhanDong Zhang, ShiQi Zhang, Yu-Hang Huang, Tao Cai, Yu-Dong Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title | Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title_full | Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title_fullStr | Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title_full_unstemmed | Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title_short | Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions |
title_sort | predicting rna 5-methylcytosine sites by using essential sequence features and distributions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776474/ https://www.ncbi.nlm.nih.gov/pubmed/35071593 http://dx.doi.org/10.1155/2022/4035462 |
work_keys_str_mv | AT chenlei predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions AT lizhandong predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions AT zhangshiqi predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions AT zhangyuhang predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions AT huangtao predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions AT caiyudong predictingrna5methylcytosinesitesbyusingessentialsequencefeaturesanddistributions |