Cargando…

Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms

Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RN...

Descripción completa

Detalles Bibliográficos
Autores principales: Rastogi, Achal, Maheswari, Uma, Dorrell, Richard G., Vieira, Fabio Rocha Jimenez, Maumus, Florian, Kustka, Adam, McCarthy, James, Allen, Andy E., Kersey, Paul, Bowler, Chris, Tirichine, Leila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5859163/
https://www.ncbi.nlm.nih.gov/pubmed/29556065
http://dx.doi.org/10.1038/s41598-018-23106-x
_version_ 1783307764674592768
author Rastogi, Achal
Maheswari, Uma
Dorrell, Richard G.
Vieira, Fabio Rocha Jimenez
Maumus, Florian
Kustka, Adam
McCarthy, James
Allen, Andy E.
Kersey, Paul
Bowler, Chris
Tirichine, Leila
author_facet Rastogi, Achal
Maheswari, Uma
Dorrell, Richard G.
Vieira, Fabio Rocha Jimenez
Maumus, Florian
Kustka, Adam
McCarthy, James
Allen, Andy E.
Kersey, Paul
Bowler, Chris
Tirichine, Leila
author_sort Rastogi, Achal
collection PubMed
description Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.
format Online
Article
Text
id pubmed-5859163
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-58591632018-03-20 Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms Rastogi, Achal Maheswari, Uma Dorrell, Richard G. Vieira, Fabio Rocha Jimenez Maumus, Florian Kustka, Adam McCarthy, James Allen, Andy E. Kersey, Paul Bowler, Chris Tirichine, Leila Sci Rep Article Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology. Nature Publishing Group UK 2018-03-19 /pmc/articles/PMC5859163/ /pubmed/29556065 http://dx.doi.org/10.1038/s41598-018-23106-x Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Rastogi, Achal
Maheswari, Uma
Dorrell, Richard G.
Vieira, Fabio Rocha Jimenez
Maumus, Florian
Kustka, Adam
McCarthy, James
Allen, Andy E.
Kersey, Paul
Bowler, Chris
Tirichine, Leila
Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_full Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_fullStr Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_full_unstemmed Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_short Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
title_sort integrative analysis of large scale transcriptome data draws a comprehensive landscape of phaeodactylum tricornutum genome and evolutionary origin of diatoms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5859163/
https://www.ncbi.nlm.nih.gov/pubmed/29556065
http://dx.doi.org/10.1038/s41598-018-23106-x
work_keys_str_mv AT rastogiachal integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT maheswariuma integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT dorrellrichardg integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT vieirafabiorochajimenez integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT maumusflorian integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT kustkaadam integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT mccarthyjames integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT allenandye integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT kerseypaul integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT bowlerchris integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms
AT tirichineleila integrativeanalysisoflargescaletranscriptomedatadrawsacomprehensivelandscapeofphaeodactylumtricornutumgenomeandevolutionaryoriginofdiatoms