Cargando…

Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)

Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is...

Descripción completa

Detalles Bibliográficos
Autores principales:	Byrne, Ashley, Supple, Megan A., Volden, Roger, Laidre, Kristin L., Shapiro, Beth, Vollmers, Christopher
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2019
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/ https://www.ncbi.nlm.nih.gov/pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643

_version_	1783438991955066880
author	Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher
author_facet	Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher
author_sort	Byrne, Ashley
collection	PubMed
description	Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic.
format	Online Article Text
id	pubmed-6658610
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-66586102019-08-02 Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher Front Genet Genetics Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not only improved the polar bear transcriptome annotations but will serve as the foundation for future efforts to investigate transcriptional dynamics within the 19 polar bear subpopulations around the Arctic. Frontiers Media S.A. 2019-07-19 /pmc/articles/PMC6658610/ /pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643 Text en Copyright © 2019 Byrne, Supple, Volden, Laidre, Shapiro and Vollmers http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Byrne, Ashley Supple, Megan A. Volden, Roger Laidre, Kristin L. Shapiro, Beth Vollmers, Christopher Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title	Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_full	Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_fullStr	Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_full_unstemmed	Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_short	Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)
title_sort	depletion of hemoglobin transcripts and long-read sequencing improves the transcriptome annotation of the polar bear (ursus maritimus)
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658610/ https://www.ncbi.nlm.nih.gov/pubmed/31379921 http://dx.doi.org/10.3389/fgene.2019.00643
work_keys_str_mv	AT byrneashley depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT supplemegana depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT voldenroger depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT laidrekristinl depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT shapirobeth depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus AT vollmerschristopher depletionofhemoglobintranscriptsandlongreadsequencingimprovesthetranscriptomeannotationofthepolarbearursusmaritimus

Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)

Ejemplares similares