Cargando…
WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach
N (6)-methyladenosine (m(6)A) is the most prevalent post-transcriptional modification in eukaryotes, and plays a pivotal role in various biological processes, such as splicing, RNA degradation and RNA–protein interaction. We report here a prediction framework WHISTLE for transcriptome-wide m(6)A RNA...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6468314/ https://www.ncbi.nlm.nih.gov/pubmed/30993345 http://dx.doi.org/10.1093/nar/gkz074 |
_version_ | 1783411407068332032 |
---|---|
author | Chen, Kunqi Wei, Zhen Zhang, Qing Wu, Xiangyu Rong, Rong Lu, Zhiliang Su, Jionglong de Magalhães, João Pedro Rigden, Daniel J Meng, Jia |
author_facet | Chen, Kunqi Wei, Zhen Zhang, Qing Wu, Xiangyu Rong, Rong Lu, Zhiliang Su, Jionglong de Magalhães, João Pedro Rigden, Daniel J Meng, Jia |
author_sort | Chen, Kunqi |
collection | PubMed |
description | N (6)-methyladenosine (m(6)A) is the most prevalent post-transcriptional modification in eukaryotes, and plays a pivotal role in various biological processes, such as splicing, RNA degradation and RNA–protein interaction. We report here a prediction framework WHISTLE for transcriptome-wide m(6)A RNA-methylation site prediction. When tested on six independent datasets, our approach, which integrated 35 additional genomic features besides the conventional sequence features, achieved a major improvement in the accuracy of m(6)A site prediction (average AUC: 0.948 and 0.880 under the full transcript or mature messenger RNA models, respectively) compared to the state-of-the-art computational approaches MethyRNA (AUC: 0.790 and 0.732) and SRAMP (AUC: 0.761 and 0.706). It also out-performed the existing epitranscriptome databases MeT-DB (AUC: 0.798 and 0.744) and RMBase (AUC: 0.786 and 0.736), which were built upon hundreds of epitranscriptome high-throughput sequencing samples. To probe the putative biological processes impacted by changes in an individual m(6)A site, a network-based approach was implemented according to the ‘guilt-by-association’ principle by integrating RNA methylation profiles, gene expression profiles and protein–protein interaction data. Finally, the WHISTLE web server was built to facilitate the query of our high-accuracy map of the human m(6)A epitranscriptome, and the server is freely available at: www.xjtlu.edu.cn/biologicalsciences/whistle and http://whistle-epitranscriptome.com. |
format | Online Article Text |
id | pubmed-6468314 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-64683142019-04-22 WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach Chen, Kunqi Wei, Zhen Zhang, Qing Wu, Xiangyu Rong, Rong Lu, Zhiliang Su, Jionglong de Magalhães, João Pedro Rigden, Daniel J Meng, Jia Nucleic Acids Res Methods Online N (6)-methyladenosine (m(6)A) is the most prevalent post-transcriptional modification in eukaryotes, and plays a pivotal role in various biological processes, such as splicing, RNA degradation and RNA–protein interaction. We report here a prediction framework WHISTLE for transcriptome-wide m(6)A RNA-methylation site prediction. When tested on six independent datasets, our approach, which integrated 35 additional genomic features besides the conventional sequence features, achieved a major improvement in the accuracy of m(6)A site prediction (average AUC: 0.948 and 0.880 under the full transcript or mature messenger RNA models, respectively) compared to the state-of-the-art computational approaches MethyRNA (AUC: 0.790 and 0.732) and SRAMP (AUC: 0.761 and 0.706). It also out-performed the existing epitranscriptome databases MeT-DB (AUC: 0.798 and 0.744) and RMBase (AUC: 0.786 and 0.736), which were built upon hundreds of epitranscriptome high-throughput sequencing samples. To probe the putative biological processes impacted by changes in an individual m(6)A site, a network-based approach was implemented according to the ‘guilt-by-association’ principle by integrating RNA methylation profiles, gene expression profiles and protein–protein interaction data. Finally, the WHISTLE web server was built to facilitate the query of our high-accuracy map of the human m(6)A epitranscriptome, and the server is freely available at: www.xjtlu.edu.cn/biologicalsciences/whistle and http://whistle-epitranscriptome.com. Oxford University Press 2019-04-23 2019-02-14 /pmc/articles/PMC6468314/ /pubmed/30993345 http://dx.doi.org/10.1093/nar/gkz074 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Chen, Kunqi Wei, Zhen Zhang, Qing Wu, Xiangyu Rong, Rong Lu, Zhiliang Su, Jionglong de Magalhães, João Pedro Rigden, Daniel J Meng, Jia WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title | WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title_full | WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title_fullStr | WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title_full_unstemmed | WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title_short | WHISTLE: a high-accuracy map of the human N(6)-methyladenosine (m(6)A) epitranscriptome predicted using a machine learning approach |
title_sort | whistle: a high-accuracy map of the human n(6)-methyladenosine (m(6)a) epitranscriptome predicted using a machine learning approach |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6468314/ https://www.ncbi.nlm.nih.gov/pubmed/30993345 http://dx.doi.org/10.1093/nar/gkz074 |
work_keys_str_mv | AT chenkunqi whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT weizhen whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT zhangqing whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT wuxiangyu whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT rongrong whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT luzhiliang whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT sujionglong whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT demagalhaesjoaopedro whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT rigdendanielj whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach AT mengjia whistleahighaccuracymapofthehumann6methyladenosinem6aepitranscriptomepredictedusingamachinelearningapproach |