Cargando…

Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing

BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Lina, Tan, Kenneth Wei Min, Tan, Tin Wee, Lee, Yuan Kun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322580/
https://www.ncbi.nlm.nih.gov/pubmed/28228091
http://dx.doi.org/10.1186/s12859-017-1551-x
_version_ 1782509874051022848
author Yao, Lina
Tan, Kenneth Wei Min
Tan, Tin Wee
Lee, Yuan Kun
author_facet Yao, Lina
Tan, Kenneth Wei Min
Tan, Tin Wee
Lee, Yuan Kun
author_sort Yao, Lina
collection PubMed
description BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or transcriptomic information of non-model microalgae has limited the understanding of their regulatory mechanisms and hampered genetic manipulation to enhance biofuel production. As such, an optimal microalgal transcriptomic database construction is a subject of urgent investigation. RESULTS: Dunaliella tertiolecta, a non-model oleaginous microalgal species, was sequenced via Illumina MISEQ and HISEQ 4000 in RNA-Seq studies. The high quality high-throughout sequencing data were explored using high performance computing (HPC) in a petascale data center and subjected to de novo assembly and parallelized mpiBLASTX search with multiple species. As a result, a transcriptome database of 17,845 was constructed (~95% completeness). This enlarged database constructed fueled the RNA-Seq data analysis, which was validated by a nitrogen deprivation (ND) study that induces triacylglycerol (TAG) production. CONCLUSIONS: The new paralleled assembly and annotation method under HPC presented here allows the solution of large-scale data processing problems in acceptable computation time. There is significant increase in the number of transcriptomic data achieved and observable heterogeneity in the performance to identify differentially expressed genes in the ND treatment paradigm. The results provide new insights as to how response to ND treatment in microalgae is regulated. ND analyses highlight the advantages of this database generated in this study that could also serve as a useful resource for future gene manipulation and transcriptome-wide analysis. We thus demonstrate the usefulness of exploring the transcriptome as an informative platform for functional studies and genetic manipulations in similar species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1551-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5322580
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53225802017-03-01 Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing Yao, Lina Tan, Kenneth Wei Min Tan, Tin Wee Lee, Yuan Kun BMC Bioinformatics Research Article BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or transcriptomic information of non-model microalgae has limited the understanding of their regulatory mechanisms and hampered genetic manipulation to enhance biofuel production. As such, an optimal microalgal transcriptomic database construction is a subject of urgent investigation. RESULTS: Dunaliella tertiolecta, a non-model oleaginous microalgal species, was sequenced via Illumina MISEQ and HISEQ 4000 in RNA-Seq studies. The high quality high-throughout sequencing data were explored using high performance computing (HPC) in a petascale data center and subjected to de novo assembly and parallelized mpiBLASTX search with multiple species. As a result, a transcriptome database of 17,845 was constructed (~95% completeness). This enlarged database constructed fueled the RNA-Seq data analysis, which was validated by a nitrogen deprivation (ND) study that induces triacylglycerol (TAG) production. CONCLUSIONS: The new paralleled assembly and annotation method under HPC presented here allows the solution of large-scale data processing problems in acceptable computation time. There is significant increase in the number of transcriptomic data achieved and observable heterogeneity in the performance to identify differentially expressed genes in the ND treatment paradigm. The results provide new insights as to how response to ND treatment in microalgae is regulated. ND analyses highlight the advantages of this database generated in this study that could also serve as a useful resource for future gene manipulation and transcriptome-wide analysis. We thus demonstrate the usefulness of exploring the transcriptome as an informative platform for functional studies and genetic manipulations in similar species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1551-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-22 /pmc/articles/PMC5322580/ /pubmed/28228091 http://dx.doi.org/10.1186/s12859-017-1551-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Yao, Lina
Tan, Kenneth Wei Min
Tan, Tin Wee
Lee, Yuan Kun
Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title_full Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title_fullStr Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title_full_unstemmed Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title_short Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
title_sort exploring the transcriptome of non-model oleaginous microalga dunaliella tertiolecta through high-throughput sequencing and high performance computing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322580/
https://www.ncbi.nlm.nih.gov/pubmed/28228091
http://dx.doi.org/10.1186/s12859-017-1551-x
work_keys_str_mv AT yaolina exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing
AT tankennethweimin exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing
AT tantinwee exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing
AT leeyuankun exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing