Cargando…

Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain

BACKGROUND: Functional genomic analyses rely on high-quality genome assemblies and annotations. Highly contiguous genome assemblies have become available for a variety of species, but accurate and complete annotation of gene models, inclusive of alternative splice isoforms and transcription start an...

Descripción completa

Detalles Bibliográficos
Autores principales: Shields, Emily J., Sorida, Masato, Sheng, Lihong, Sieriebriennikov, Bogdan, Ding, Long, Bonasio, Roberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8626913/
https://www.ncbi.nlm.nih.gov/pubmed/34838024
http://dx.doi.org/10.1186/s12915-021-01188-w
_version_ 1784606750126637056
author Shields, Emily J.
Sorida, Masato
Sheng, Lihong
Sieriebriennikov, Bogdan
Ding, Long
Bonasio, Roberto
author_facet Shields, Emily J.
Sorida, Masato
Sheng, Lihong
Sieriebriennikov, Bogdan
Ding, Long
Bonasio, Roberto
author_sort Shields, Emily J.
collection PubMed
description BACKGROUND: Functional genomic analyses rely on high-quality genome assemblies and annotations. Highly contiguous genome assemblies have become available for a variety of species, but accurate and complete annotation of gene models, inclusive of alternative splice isoforms and transcription start and termination sites, remains difficult with traditional approaches. RESULTS: Here, we utilized full-length isoform sequencing (Iso-Seq), a long-read RNA sequencing technology, to obtain a comprehensive annotation of the transcriptome of the ant Harpegnathos saltator. The improved genome annotations include additional splice isoforms and extended 3′ untranslated regions for more than 4000 genes. Reanalysis of RNA-seq experiments using these annotations revealed several genes with caste-specific differential expression and tissue- or caste-specific splicing patterns that were missed in previous analyses. The extended 3′ untranslated regions afforded great improvements in the analysis of existing single-cell RNA-seq data, resulting in the recovery of the transcriptomes of 18% more cells. The deeper single-cell transcriptomes obtained with these new annotations allowed us to identify additional markers for several cell types in the ant brain, as well as genes differentially expressed across castes in specific cell types. CONCLUSIONS: Our results demonstrate that Iso-Seq is an efficient and effective approach to improve genome annotations and maximize the amount of information that can be obtained from existing and future genomic datasets in Harpegnathos and other organisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01188-w.
format Online
Article
Text
id pubmed-8626913
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86269132021-11-29 Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain Shields, Emily J. Sorida, Masato Sheng, Lihong Sieriebriennikov, Bogdan Ding, Long Bonasio, Roberto BMC Biol Research Article BACKGROUND: Functional genomic analyses rely on high-quality genome assemblies and annotations. Highly contiguous genome assemblies have become available for a variety of species, but accurate and complete annotation of gene models, inclusive of alternative splice isoforms and transcription start and termination sites, remains difficult with traditional approaches. RESULTS: Here, we utilized full-length isoform sequencing (Iso-Seq), a long-read RNA sequencing technology, to obtain a comprehensive annotation of the transcriptome of the ant Harpegnathos saltator. The improved genome annotations include additional splice isoforms and extended 3′ untranslated regions for more than 4000 genes. Reanalysis of RNA-seq experiments using these annotations revealed several genes with caste-specific differential expression and tissue- or caste-specific splicing patterns that were missed in previous analyses. The extended 3′ untranslated regions afforded great improvements in the analysis of existing single-cell RNA-seq data, resulting in the recovery of the transcriptomes of 18% more cells. The deeper single-cell transcriptomes obtained with these new annotations allowed us to identify additional markers for several cell types in the ant brain, as well as genes differentially expressed across castes in specific cell types. CONCLUSIONS: Our results demonstrate that Iso-Seq is an efficient and effective approach to improve genome annotations and maximize the amount of information that can be obtained from existing and future genomic datasets in Harpegnathos and other organisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-021-01188-w. BioMed Central 2021-11-27 /pmc/articles/PMC8626913/ /pubmed/34838024 http://dx.doi.org/10.1186/s12915-021-01188-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Shields, Emily J.
Sorida, Masato
Sheng, Lihong
Sieriebriennikov, Bogdan
Ding, Long
Bonasio, Roberto
Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title_full Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title_fullStr Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title_full_unstemmed Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title_short Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
title_sort genome annotation with long rna reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8626913/
https://www.ncbi.nlm.nih.gov/pubmed/34838024
http://dx.doi.org/10.1186/s12915-021-01188-w
work_keys_str_mv AT shieldsemilyj genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain
AT soridamasato genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain
AT shenglihong genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain
AT sieriebriennikovbogdan genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain
AT dinglong genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain
AT bonasioroberto genomeannotationwithlongrnareadsrevealsnewpatternsofgeneexpressionandimprovessinglecellanalysesinanantbrain