Cargando…

HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data

BACKGROUND: Methylated RNA Immunoprecipatation combined with RNA sequencing (MeRIP-seq) is revolutionizing the de novo study of RNA epigenomics at a higher resolution. However, this new technology poses unique bioinformatics problems that call for novel and sophisticated statistical computational so...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Xiaodong, Meng, Jia, Rao, Manjeet K, Chen, Yidong, Huang, Yufei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416174/
https://www.ncbi.nlm.nih.gov/pubmed/25917296
http://dx.doi.org/10.1186/1471-2164-16-S4-S2
_version_ 1782369190570622976
author Cui, Xiaodong
Meng, Jia
Rao, Manjeet K
Chen, Yidong
Huang, Yufei
author_facet Cui, Xiaodong
Meng, Jia
Rao, Manjeet K
Chen, Yidong
Huang, Yufei
author_sort Cui, Xiaodong
collection PubMed
description BACKGROUND: Methylated RNA Immunoprecipatation combined with RNA sequencing (MeRIP-seq) is revolutionizing the de novo study of RNA epigenomics at a higher resolution. However, this new technology poses unique bioinformatics problems that call for novel and sophisticated statistical computational solutions, aiming at identifying and characterizing transcriptome-wide methyltranscriptome. RESULTS: We developed HEP, a Hidden Markov Model (HMM)-based Exome Peak-finding algorithm for predicting transcriptome methylation sites using MeRIP-seq data. In contrast to exomePeak, our previously developed MeRIP-seq peak calling algorithm, HEPeak models the correlation between continuous bins in an m(6)A peak region and it is a model-based approach, which admits rigorous statistical inference. HEPeak was evaluated on a simulated MeRIP-seq dataset and achieved higher sensitivity and specificity than exomePeak. HEPeak was also applied to real MeRIP-seq datasets from human HEK293T cell line and mouse midbrain cells and was shown to be able to recapitulate known m(6)A distribution in transcripts and identify novel m(6)A sites in long non-coding RNAs. CONCLUSIONS: In this paper, a novel HMM-based peak calling algorithm, HEPeak, was developed for peak calling for MeRIP-seq data. HEPeak is written in R and is publicly available.
format Online
Article
Text
id pubmed-4416174
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44161742015-05-07 HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data Cui, Xiaodong Meng, Jia Rao, Manjeet K Chen, Yidong Huang, Yufei BMC Genomics Research BACKGROUND: Methylated RNA Immunoprecipatation combined with RNA sequencing (MeRIP-seq) is revolutionizing the de novo study of RNA epigenomics at a higher resolution. However, this new technology poses unique bioinformatics problems that call for novel and sophisticated statistical computational solutions, aiming at identifying and characterizing transcriptome-wide methyltranscriptome. RESULTS: We developed HEP, a Hidden Markov Model (HMM)-based Exome Peak-finding algorithm for predicting transcriptome methylation sites using MeRIP-seq data. In contrast to exomePeak, our previously developed MeRIP-seq peak calling algorithm, HEPeak models the correlation between continuous bins in an m(6)A peak region and it is a model-based approach, which admits rigorous statistical inference. HEPeak was evaluated on a simulated MeRIP-seq dataset and achieved higher sensitivity and specificity than exomePeak. HEPeak was also applied to real MeRIP-seq datasets from human HEK293T cell line and mouse midbrain cells and was shown to be able to recapitulate known m(6)A distribution in transcripts and identify novel m(6)A sites in long non-coding RNAs. CONCLUSIONS: In this paper, a novel HMM-based peak calling algorithm, HEPeak, was developed for peak calling for MeRIP-seq data. HEPeak is written in R and is publicly available. BioMed Central 2015-04-21 /pmc/articles/PMC4416174/ /pubmed/25917296 http://dx.doi.org/10.1186/1471-2164-16-S4-S2 Text en Copyright © 2015 Cui et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Cui, Xiaodong
Meng, Jia
Rao, Manjeet K
Chen, Yidong
Huang, Yufei
HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title_full HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title_fullStr HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title_full_unstemmed HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title_short HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data
title_sort hepeak: an hmm-based exome peak-finding package for rna epigenome sequencing data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416174/
https://www.ncbi.nlm.nih.gov/pubmed/25917296
http://dx.doi.org/10.1186/1471-2164-16-S4-S2
work_keys_str_mv AT cuixiaodong hepeakanhmmbasedexomepeakfindingpackageforrnaepigenomesequencingdata
AT mengjia hepeakanhmmbasedexomepeakfindingpackageforrnaepigenomesequencingdata
AT raomanjeetk hepeakanhmmbasedexomepeakfindingpackageforrnaepigenomesequencingdata
AT chenyidong hepeakanhmmbasedexomepeakfindingpackageforrnaepigenomesequencingdata
AT huangyufei hepeakanhmmbasedexomepeakfindingpackageforrnaepigenomesequencingdata