Cargando…

3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling

BACKGROUND: The advent of Next Generation Sequencing has allowed transcriptomes to be profiled with unprecedented accuracy, but the high costs of full-length mRNA sequencing have posed a limit on the accessibility and scalability of the technology. To address this, we developed 3’Pool-seq: a simple,...

Descripción completa

Detalles Bibliográficos
Autores principales: Sholder, Gabriel, Lanz, Thomas A., Moccia, Robert, Quan, Jie, Aparicio-Prat, Estel, Stanton, Robert, Xi, Hualin S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6971924/
https://www.ncbi.nlm.nih.gov/pubmed/31959126
http://dx.doi.org/10.1186/s12864-020-6478-3
_version_ 1783489813821784064
author Sholder, Gabriel
Lanz, Thomas A.
Moccia, Robert
Quan, Jie
Aparicio-Prat, Estel
Stanton, Robert
Xi, Hualin S.
author_facet Sholder, Gabriel
Lanz, Thomas A.
Moccia, Robert
Quan, Jie
Aparicio-Prat, Estel
Stanton, Robert
Xi, Hualin S.
author_sort Sholder, Gabriel
collection PubMed
description BACKGROUND: The advent of Next Generation Sequencing has allowed transcriptomes to be profiled with unprecedented accuracy, but the high costs of full-length mRNA sequencing have posed a limit on the accessibility and scalability of the technology. To address this, we developed 3’Pool-seq: a simple, cost-effective, and scalable RNA-seq method that focuses sequencing to the 3′-end of mRNA. We drew from aspects of SMART-seq, Drop-seq, and TruSeq to implement an easy workflow, and optimized parameters such as input RNA concentrations, tagmentation conditions, and read depth specifically for bulk-RNA. RESULTS: Thorough optimization resulted in a protocol that takes less than 12 h to perform, does not require custom sequencing primers or instrumentation, and cuts over 90% of the costs associated with TruSeq, while still achieving accurate gene expression quantification (Pearson’s correlation coefficient with ERCC theoretical concentration r = 0.96) and differential gene detection (ROC analysis of 3’Pool-seq compared to TruSeq AUC = 0.921). The 3’Pool-seq dual indexing scheme was further adapted for a 96-well plate format, and ERCC spike-ins were used to correct for potential row or column pooling effects. Transcriptional profiling of troglitazone and pioglitazone treatments at multiple doses and time points in HepG2 cells was then used to show how 3’Pool-seq could distinguish the two molecules based on their molecular signatures. CONCLUSIONS: 3’Pool-seq can accurately detect gene expression at a level that is on par with TruSeq, at one tenth of the total cost. Furthermore, its unprecedented TruSeq/Nextera hybrid indexing scheme and streamlined workflow can be applied in several different formats, including 96-well plates, which allows users to thoroughly evaluate biological systems under several conditions and timepoints. Care must be taken regarding experimental design and plate layout such that potential pooling effects can be accounted for and corrected. Lastly, further studies using multiple sets of ERCC spike-ins may be used to simulate differential gene expression in a system with known ground-state values.
format Online
Article
Text
id pubmed-6971924
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69719242020-01-27 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling Sholder, Gabriel Lanz, Thomas A. Moccia, Robert Quan, Jie Aparicio-Prat, Estel Stanton, Robert Xi, Hualin S. BMC Genomics Methodology Article BACKGROUND: The advent of Next Generation Sequencing has allowed transcriptomes to be profiled with unprecedented accuracy, but the high costs of full-length mRNA sequencing have posed a limit on the accessibility and scalability of the technology. To address this, we developed 3’Pool-seq: a simple, cost-effective, and scalable RNA-seq method that focuses sequencing to the 3′-end of mRNA. We drew from aspects of SMART-seq, Drop-seq, and TruSeq to implement an easy workflow, and optimized parameters such as input RNA concentrations, tagmentation conditions, and read depth specifically for bulk-RNA. RESULTS: Thorough optimization resulted in a protocol that takes less than 12 h to perform, does not require custom sequencing primers or instrumentation, and cuts over 90% of the costs associated with TruSeq, while still achieving accurate gene expression quantification (Pearson’s correlation coefficient with ERCC theoretical concentration r = 0.96) and differential gene detection (ROC analysis of 3’Pool-seq compared to TruSeq AUC = 0.921). The 3’Pool-seq dual indexing scheme was further adapted for a 96-well plate format, and ERCC spike-ins were used to correct for potential row or column pooling effects. Transcriptional profiling of troglitazone and pioglitazone treatments at multiple doses and time points in HepG2 cells was then used to show how 3’Pool-seq could distinguish the two molecules based on their molecular signatures. CONCLUSIONS: 3’Pool-seq can accurately detect gene expression at a level that is on par with TruSeq, at one tenth of the total cost. Furthermore, its unprecedented TruSeq/Nextera hybrid indexing scheme and streamlined workflow can be applied in several different formats, including 96-well plates, which allows users to thoroughly evaluate biological systems under several conditions and timepoints. Care must be taken regarding experimental design and plate layout such that potential pooling effects can be accounted for and corrected. Lastly, further studies using multiple sets of ERCC spike-ins may be used to simulate differential gene expression in a system with known ground-state values. BioMed Central 2020-01-20 /pmc/articles/PMC6971924/ /pubmed/31959126 http://dx.doi.org/10.1186/s12864-020-6478-3 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sholder, Gabriel
Lanz, Thomas A.
Moccia, Robert
Quan, Jie
Aparicio-Prat, Estel
Stanton, Robert
Xi, Hualin S.
3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title_full 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title_fullStr 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title_full_unstemmed 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title_short 3’Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
title_sort 3’pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6971924/
https://www.ncbi.nlm.nih.gov/pubmed/31959126
http://dx.doi.org/10.1186/s12864-020-6478-3
work_keys_str_mv AT sholdergabriel 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT lanzthomasa 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT mocciarobert 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT quanjie 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT apariciopratestel 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT stantonrobert 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling
AT xihualins 3poolseqanoptimizedcostefficientandscalablemethodofwholetranscriptomegeneexpressionprofiling