Cargando…

QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model

BACKGROUND: As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Lian, Zhang, Shao-Wu, Huang, Yufei, Meng, Jia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5667504/
https://www.ncbi.nlm.nih.gov/pubmed/28859631
http://dx.doi.org/10.1186/s12859-017-1808-4
_version_ 1783275501149749248
author Liu, Lian
Zhang, Shao-Wu
Huang, Yufei
Meng, Jia
author_facet Liu, Lian
Zhang, Shao-Wu
Huang, Yufei
Meng, Jia
author_sort Liu, Lian
collection PubMed
description BACKGROUND: As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. RESULTS: We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. CONCLUSION: QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m(1)A-Seq, Par-CLIP, RIP-Seq, etc. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1808-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5667504
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56675042017-11-08 QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model Liu, Lian Zhang, Shao-Wu Huang, Yufei Meng, Jia BMC Bioinformatics Methodology Article BACKGROUND: As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. RESULTS: We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. CONCLUSION: QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m(1)A-Seq, Par-CLIP, RIP-Seq, etc. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1808-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-31 /pmc/articles/PMC5667504/ /pubmed/28859631 http://dx.doi.org/10.1186/s12859-017-1808-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Liu, Lian
Zhang, Shao-Wu
Huang, Yufei
Meng, Jia
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title_full QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title_fullStr QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title_full_unstemmed QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title_short QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
title_sort qnb: differential rna methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5667504/
https://www.ncbi.nlm.nih.gov/pubmed/28859631
http://dx.doi.org/10.1186/s12859-017-1808-4
work_keys_str_mv AT liulian qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel
AT zhangshaowu qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel
AT huangyufei qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel
AT mengjia qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel