Cargando…

ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species

A-to-I editing is the most prevalent RNA editing event, which refers to the change of adenosine (A) bases to inosine (I) bases in double-stranded RNAs. Several studies have revealed that A-to-I editing can regulate cellular processes and is associated with various human diseases. Therefore, accurate...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ruyi, Li, Fuyi, Guo, Xudong, Bi, Yue, Li, Chen, Pan, Shirui, Coin, Lachlan J M, Song, Jiangning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10565902/
https://www.ncbi.nlm.nih.gov/pubmed/37150785
http://dx.doi.org/10.1093/bib/bbad170
_version_ 1785118796145491968
author Chen, Ruyi
Li, Fuyi
Guo, Xudong
Bi, Yue
Li, Chen
Pan, Shirui
Coin, Lachlan J M
Song, Jiangning
author_facet Chen, Ruyi
Li, Fuyi
Guo, Xudong
Bi, Yue
Li, Chen
Pan, Shirui
Coin, Lachlan J M
Song, Jiangning
author_sort Chen, Ruyi
collection PubMed
description A-to-I editing is the most prevalent RNA editing event, which refers to the change of adenosine (A) bases to inosine (I) bases in double-stranded RNAs. Several studies have revealed that A-to-I editing can regulate cellular processes and is associated with various human diseases. Therefore, accurate identification of A-to-I editing sites is crucial for understanding RNA-level (i.e. transcriptional) modifications and their potential roles in molecular functions. To date, various computational approaches for A-to-I editing site identification have been developed; however, their performance is still unsatisfactory and needs further improvement. In this study, we developed a novel stacked-ensemble learning model, ATTIC (A-To-I ediTing predICtor), to accurately identify A-to-I editing sites across three species, including Homo sapiens, Mus musculus and Drosophila melanogaster. We first comprehensively evaluated 37 RNA sequence-derived features combined with 14 popular machine learning algorithms. Then, we selected the optimal base models to build a series of stacked ensemble models. The final ATTIC framework was developed based on the optimal models improved by the feature selection strategy for specific species. Extensive cross-validation and independent tests illustrate that ATTIC outperforms state-of-the-art tools for predicting A-to-I editing sites. We also developed a web server for ATTIC, which is publicly available at http://web.unimelb-bioinfortools.cloud.edu.au/ATTIC/. We anticipate that ATTIC can be utilized as a useful tool to accelerate the identification of A-to-I RNA editing events and help characterize their roles in post-transcriptional regulation.
format Online
Article
Text
id pubmed-10565902
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105659022023-10-12 ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species Chen, Ruyi Li, Fuyi Guo, Xudong Bi, Yue Li, Chen Pan, Shirui Coin, Lachlan J M Song, Jiangning Brief Bioinform Problem Solving Protocol A-to-I editing is the most prevalent RNA editing event, which refers to the change of adenosine (A) bases to inosine (I) bases in double-stranded RNAs. Several studies have revealed that A-to-I editing can regulate cellular processes and is associated with various human diseases. Therefore, accurate identification of A-to-I editing sites is crucial for understanding RNA-level (i.e. transcriptional) modifications and their potential roles in molecular functions. To date, various computational approaches for A-to-I editing site identification have been developed; however, their performance is still unsatisfactory and needs further improvement. In this study, we developed a novel stacked-ensemble learning model, ATTIC (A-To-I ediTing predICtor), to accurately identify A-to-I editing sites across three species, including Homo sapiens, Mus musculus and Drosophila melanogaster. We first comprehensively evaluated 37 RNA sequence-derived features combined with 14 popular machine learning algorithms. Then, we selected the optimal base models to build a series of stacked ensemble models. The final ATTIC framework was developed based on the optimal models improved by the feature selection strategy for specific species. Extensive cross-validation and independent tests illustrate that ATTIC outperforms state-of-the-art tools for predicting A-to-I editing sites. We also developed a web server for ATTIC, which is publicly available at http://web.unimelb-bioinfortools.cloud.edu.au/ATTIC/. We anticipate that ATTIC can be utilized as a useful tool to accelerate the identification of A-to-I RNA editing events and help characterize their roles in post-transcriptional regulation. Oxford University Press 2023-05-06 /pmc/articles/PMC10565902/ /pubmed/37150785 http://dx.doi.org/10.1093/bib/bbad170 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Chen, Ruyi
Li, Fuyi
Guo, Xudong
Bi, Yue
Li, Chen
Pan, Shirui
Coin, Lachlan J M
Song, Jiangning
ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title_full ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title_fullStr ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title_full_unstemmed ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title_short ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species
title_sort attic is an integrated approach for predicting a-to-i rna editing sites in three species
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10565902/
https://www.ncbi.nlm.nih.gov/pubmed/37150785
http://dx.doi.org/10.1093/bib/bbad170
work_keys_str_mv AT chenruyi atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT lifuyi atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT guoxudong atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT biyue atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT lichen atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT panshirui atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT coinlachlanjm atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies
AT songjiangning atticisanintegratedapproachforpredictingatoirnaeditingsitesinthreespecies