Cargando…

Local sequence and sequencing depth dependent accuracy of RNA-seq reads

BACKGROUND: Many biases and spurious effects are inherent in RNA-seq technology, resulting in a non-uniform distribution of sequencing read counts for each base position in a gene. Therefore, a base-level strategy is required to model the non-uniformity. Also, the properties of sequencing read count...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Guoshuai, Liang, Shoudan, Zheng, Xiaofeng, Xiao, Feifei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5550947/
https://www.ncbi.nlm.nih.gov/pubmed/28793860
http://dx.doi.org/10.1186/s12859-017-1780-z
_version_ 1783256216739250176
author Cai, Guoshuai
Liang, Shoudan
Zheng, Xiaofeng
Xiao, Feifei
author_facet Cai, Guoshuai
Liang, Shoudan
Zheng, Xiaofeng
Xiao, Feifei
author_sort Cai, Guoshuai
collection PubMed
description BACKGROUND: Many biases and spurious effects are inherent in RNA-seq technology, resulting in a non-uniform distribution of sequencing read counts for each base position in a gene. Therefore, a base-level strategy is required to model the non-uniformity. Also, the properties of sequencing read counts can be leveraged to achieve a more precise estimation of the mean and variance of measurement. RESULTS: In this study, we aimed to unveil the effects on RNA-seq accuracy from multiple factors and develop accurate modeling of RNA-seq reads in comparison. We found that the overdispersion rate decreased when sequencing depth increased on the base level. Moreover, the influence of local sequence(s) on the overdispersion rate was notable but no longer significant after adjusting the effect from sequencing depth. Based on these findings, we propose a desirable beta-binomial model with a dynamic overdispersion rate on the base-level proportion of sequencing read counts from two samples. CONCLUSIONS: The current study provides thorough insights into the impact of overdispersion at the position level and especially into its relationship with sequencing depth, local sequence, and preparation protocol. These properties of RNA-seq will aid in improvement of the quality control procedure and development of statistical methods for RNA-seq downstream analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1780-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5550947
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55509472017-08-14 Local sequence and sequencing depth dependent accuracy of RNA-seq reads Cai, Guoshuai Liang, Shoudan Zheng, Xiaofeng Xiao, Feifei BMC Bioinformatics Research Article BACKGROUND: Many biases and spurious effects are inherent in RNA-seq technology, resulting in a non-uniform distribution of sequencing read counts for each base position in a gene. Therefore, a base-level strategy is required to model the non-uniformity. Also, the properties of sequencing read counts can be leveraged to achieve a more precise estimation of the mean and variance of measurement. RESULTS: In this study, we aimed to unveil the effects on RNA-seq accuracy from multiple factors and develop accurate modeling of RNA-seq reads in comparison. We found that the overdispersion rate decreased when sequencing depth increased on the base level. Moreover, the influence of local sequence(s) on the overdispersion rate was notable but no longer significant after adjusting the effect from sequencing depth. Based on these findings, we propose a desirable beta-binomial model with a dynamic overdispersion rate on the base-level proportion of sequencing read counts from two samples. CONCLUSIONS: The current study provides thorough insights into the impact of overdispersion at the position level and especially into its relationship with sequencing depth, local sequence, and preparation protocol. These properties of RNA-seq will aid in improvement of the quality control procedure and development of statistical methods for RNA-seq downstream analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1780-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-09 /pmc/articles/PMC5550947/ /pubmed/28793860 http://dx.doi.org/10.1186/s12859-017-1780-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cai, Guoshuai
Liang, Shoudan
Zheng, Xiaofeng
Xiao, Feifei
Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title_full Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title_fullStr Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title_full_unstemmed Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title_short Local sequence and sequencing depth dependent accuracy of RNA-seq reads
title_sort local sequence and sequencing depth dependent accuracy of rna-seq reads
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5550947/
https://www.ncbi.nlm.nih.gov/pubmed/28793860
http://dx.doi.org/10.1186/s12859-017-1780-z
work_keys_str_mv AT caiguoshuai localsequenceandsequencingdepthdependentaccuracyofrnaseqreads
AT liangshoudan localsequenceandsequencingdepthdependentaccuracyofrnaseqreads
AT zhengxiaofeng localsequenceandsequencingdepthdependentaccuracyofrnaseqreads
AT xiaofeifei localsequenceandsequencingdepthdependentaccuracyofrnaseqreads