Cargando…

Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)

Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is...

Descripción completa

Detalles Bibliográficos
Autores principales: Byrne, Ashley, Supple, Megan A., Volden, Roger, Laidre, Kristin L., Shapiro, Beth, Vollmers, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/
https://www.ncbi.nlm.nih.gov/pubmed/31379921
http://dx.doi.org/10.3389/fgene.2019.00643
_version_ 1783438991955066880
author Byrne, Ashley
Supple, Megan A.
Volden, Roger
Laidre, Kristin L.
Shapiro, Beth
Vollmers, Christopher
author_facet Byrne, Ashley
Supple, Megan A.
Volden, Roger
Laidre, Kristin L.
Shapiro, Beth
Vollmers, Christopher
author_sort Byrne, Ashley
collection PubMed
description Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic.
format Online
Article
Text
id pubmed-6658610
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-66586102019-08-02 Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher Front Genet Genetics Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic. Frontiers Media S.A. 2019-07-19 /pmc/articles/PMC6658610/ /pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643 Text en Copyright © 2019 Byrne, Supple, Volden, Laidre, Shapiro and Vollmers http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Byrne, Ashley
Supple, Megan A.
Volden, Roger
Laidre, Kristin L.
Shapiro, Beth
Vollmers, Christopher
Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_full Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_fullStr Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_full_unstemmed Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_short Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_sort depletion of hemoglobin transcripts and long-read sequencing improves the transcriptome annotation of the polar bear (ursus maritimus)
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/
https://www.ncbi.nlm.nih.gov/pubmed/31379921
http://dx.doi.org/10.3389/fgene.2019.00643
work_keys_str_mv AT byrneashley depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus
AT supplemegana depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus
AT voldenroger depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus
AT laidrekristinl depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus
AT shapirobeth depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus
AT vollmerschristopher depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus