Cargando…
Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data
Significant advances in RNA sequencing have been recently made possible by using oligo(dT) primers for simultaneous mRNA enrichment and reverse transcription priming. The associated increase in efficiency has enabled more economical bulk RNA sequencing methods and the advent of high-throughput singl...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142200/ https://www.ncbi.nlm.nih.gov/pubmed/35651651 http://dx.doi.org/10.1093/nargab/lqac035 |
_version_ | 1784715520523632640 |
---|---|
author | Svoboda, Marek Frost, H Robert Bosco, Giovanni |
author_facet | Svoboda, Marek Frost, H Robert Bosco, Giovanni |
author_sort | Svoboda, Marek |
collection | PubMed |
description | Significant advances in RNA sequencing have been recently made possible by using oligo(dT) primers for simultaneous mRNA enrichment and reverse transcription priming. The associated increase in efficiency has enabled more economical bulk RNA sequencing methods and the advent of high-throughput single-cell RNA sequencing, already one of the most widely adopted methods in transcriptomics. However, the effects of off-target oligo(dT) priming on gene expression quantification have not been appreciated. In the present study, we describe the extent, the possible causes, and the consequences of internal oligo(dT) priming across multiple public datasets obtained from various bulk and single-cell RNA sequencing platforms. To explore and address this issue, we developed a computational algorithm for RNA counting methods, which identifies the sequencing read alignments that likely resulted from internal oligo(dT) priming and removes them from the data. Directly comparing filtered datasets to those obtained by an alternative method reveals significant improvements in gene expression measurement. Finally, we infer a list of human genes whose expression quantification is most likely to be affected by internal oligo(dT) priming and predict that when measured using these methods, the expression of most genes may be inflated by at least 10% whereby some genes are affected more than others. |
format | Online Article Text |
id | pubmed-9142200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-91422002022-05-31 Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data Svoboda, Marek Frost, H Robert Bosco, Giovanni NAR Genom Bioinform Methods Article Significant advances in RNA sequencing have been recently made possible by using oligo(dT) primers for simultaneous mRNA enrichment and reverse transcription priming. The associated increase in efficiency has enabled more economical bulk RNA sequencing methods and the advent of high-throughput single-cell RNA sequencing, already one of the most widely adopted methods in transcriptomics. However, the effects of off-target oligo(dT) priming on gene expression quantification have not been appreciated. In the present study, we describe the extent, the possible causes, and the consequences of internal oligo(dT) priming across multiple public datasets obtained from various bulk and single-cell RNA sequencing platforms. To explore and address this issue, we developed a computational algorithm for RNA counting methods, which identifies the sequencing read alignments that likely resulted from internal oligo(dT) priming and removes them from the data. Directly comparing filtered datasets to those obtained by an alternative method reveals significant improvements in gene expression measurement. Finally, we infer a list of human genes whose expression quantification is most likely to be affected by internal oligo(dT) priming and predict that when measured using these methods, the expression of most genes may be inflated by at least 10% whereby some genes are affected more than others. Oxford University Press 2022-05-25 /pmc/articles/PMC9142200/ /pubmed/35651651 http://dx.doi.org/10.1093/nargab/lqac035 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Article Svoboda, Marek Frost, H Robert Bosco, Giovanni Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title | Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title_full | Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title_fullStr | Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title_full_unstemmed | Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title_short | Internal oligo(dT) priming introduces systematic bias in bulk and single-cell RNA sequencing count data |
title_sort | internal oligo(dt) priming introduces systematic bias in bulk and single-cell rna sequencing count data |
topic | Methods Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142200/ https://www.ncbi.nlm.nih.gov/pubmed/35651651 http://dx.doi.org/10.1093/nargab/lqac035 |
work_keys_str_mv | AT svobodamarek internaloligodtprimingintroducessystematicbiasinbulkandsinglecellrnasequencingcountdata AT frosthrobert internaloligodtprimingintroducessystematicbiasinbulkandsinglecellrnasequencingcountdata AT boscogiovanni internaloligodtprimingintroducessystematicbiasinbulkandsinglecellrnasequencingcountdata |