Cargando…
Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/ https://www.ncbi.nlm.nih.gov/pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643 |
_version_ | 1783438991955066880 |
---|---|
author | Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher |
author_facet | Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher |
author_sort | Byrne, Ashley |
collection | PubMed |
description | Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic. |
format | Online Article Text |
id | pubmed-6658610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-66586102019-08-02 Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher Front Genet Genetics Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic. Frontiers Media S.A. 2019-07-19 /pmc/articles/PMC6658610/ /pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643 Text en Copyright © 2019 Byrne, Supple, Volden, Laidre, Shapiro and Vollmers http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title | Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title_full | Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title_fullStr | Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title_full_unstemmed | Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title_short | Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) |
title_sort | depletion of hemoglobin transcripts and long-read sequencing improves the transcriptome annotation of the polar bear (ursus maritimus) |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/ https://www.ncbi.nlm.nih.gov/pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643 |
work_keys_str_mv | AT byrneashley depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT supplemegana depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT voldenroger depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT laidrekristinl depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT shapirobeth depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT vollmerschristopher depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus |