Cargando…

High resolution annotation of zebrafish transcriptome using long-read sequencing

With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation...

Descripción completa

Detalles Bibliográficos
Autores principales: Nudelman, German, Frasca, Antonio, Kent, Brandon, Sadler, Kirsten C., Sealfon, Stuart C., Walsh, Martin J., Zaslavsky, Elena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6120630/
https://www.ncbi.nlm.nih.gov/pubmed/30061115
http://dx.doi.org/10.1101/gr.223586.117
_version_ 1783352310111404032
author Nudelman, German
Frasca, Antonio
Kent, Brandon
Sadler, Kirsten C.
Sealfon, Stuart C.
Walsh, Martin J.
Zaslavsky, Elena
author_facet Nudelman, German
Frasca, Antonio
Kent, Brandon
Sadler, Kirsten C.
Sealfon, Stuart C.
Walsh, Martin J.
Zaslavsky, Elena
author_sort Nudelman, German
collection PubMed
description With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short-read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the transcriptome of zebrafish embryos before and after ZGA. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2539 high-confidence novel transcripts that originated from previously unannotated loci and 1835 high-confidence new isoforms in previously annotated genes. We validated these findings using a suite of computational approaches including structural prediction, sequence homology, and functional conservation analyses, as well as by confirmatory transcript quantification with short-read sequencing data. Our analyses provided insight into new homologs and paralogs of functionally important proteins and noncoding RNAs, isoform switching occurrences, and different classes of novel splicing events. Several novel isoforms representing distinct splicing events were validated through PCR experiments, including the discovery and validation of a novel 8-kb transcript spanning multiple mir-430 elements, an important driver of early development. Our study provides a significantly improved zebrafish transcriptome annotation resource.
format Online
Article
Text
id pubmed-6120630
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-61206302019-03-01 High resolution annotation of zebrafish transcriptome using long-read sequencing Nudelman, German Frasca, Antonio Kent, Brandon Sadler, Kirsten C. Sealfon, Stuart C. Walsh, Martin J. Zaslavsky, Elena Genome Res Resource With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short-read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the transcriptome of zebrafish embryos before and after ZGA. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2539 high-confidence novel transcripts that originated from previously unannotated loci and 1835 high-confidence new isoforms in previously annotated genes. We validated these findings using a suite of computational approaches including structural prediction, sequence homology, and functional conservation analyses, as well as by confirmatory transcript quantification with short-read sequencing data. Our analyses provided insight into new homologs and paralogs of functionally important proteins and noncoding RNAs, isoform switching occurrences, and different classes of novel splicing events. Several novel isoforms representing distinct splicing events were validated through PCR experiments, including the discovery and validation of a novel 8-kb transcript spanning multiple mir-430 elements, an important driver of early development. Our study provides a significantly improved zebrafish transcriptome annotation resource. Cold Spring Harbor Laboratory Press 2018-09 /pmc/articles/PMC6120630/ /pubmed/30061115 http://dx.doi.org/10.1101/gr.223586.117 Text en © 2018 Nudelman et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Nudelman, German
Frasca, Antonio
Kent, Brandon
Sadler, Kirsten C.
Sealfon, Stuart C.
Walsh, Martin J.
Zaslavsky, Elena
High resolution annotation of zebrafish transcriptome using long-read sequencing
title High resolution annotation of zebrafish transcriptome using long-read sequencing
title_full High resolution annotation of zebrafish transcriptome using long-read sequencing
title_fullStr High resolution annotation of zebrafish transcriptome using long-read sequencing
title_full_unstemmed High resolution annotation of zebrafish transcriptome using long-read sequencing
title_short High resolution annotation of zebrafish transcriptome using long-read sequencing
title_sort high resolution annotation of zebrafish transcriptome using long-read sequencing
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6120630/
https://www.ncbi.nlm.nih.gov/pubmed/30061115
http://dx.doi.org/10.1101/gr.223586.117
work_keys_str_mv AT nudelmangerman highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT frascaantonio highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT kentbrandon highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT sadlerkirstenc highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT sealfonstuartc highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT walshmartinj highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing
AT zaslavskyelena highresolutionannotationofzebrafishtranscriptomeusinglongreadsequencing