Cargando…
Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing
BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322580/ https://www.ncbi.nlm.nih.gov/pubmed/28228091 http://dx.doi.org/10.1186/s12859-017-1551-x |
_version_ | 1782509874051022848 |
---|---|
author | Yao, Lina Tan, Kenneth Wei Min Tan, Tin Wee Lee, Yuan Kun |
author_facet | Yao, Lina Tan, Kenneth Wei Min Tan, Tin Wee Lee, Yuan Kun |
author_sort | Yao, Lina |
collection | PubMed |
description | BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or transcriptomic information of non-model microalgae has limited the understanding of their regulatory mechanisms and hampered genetic manipulation to enhance biofuel production. As such, an optimal microalgal transcriptomic database construction is a subject of urgent investigation. RESULTS: Dunaliella tertiolecta, a non-model oleaginous microalgal species, was sequenced via Illumina MISEQ and HISEQ 4000 in RNA-Seq studies. The high quality high-throughout sequencing data were explored using high performance computing (HPC) in a petascale data center and subjected to de novo assembly and parallelized mpiBLASTX search with multiple species. As a result, a transcriptome database of 17,845 was constructed (~95% completeness). This enlarged database constructed fueled the RNA-Seq data analysis, which was validated by a nitrogen deprivation (ND) study that induces triacylglycerol (TAG) production. CONCLUSIONS: The new paralleled assembly and annotation method under HPC presented here allows the solution of large-scale data processing problems in acceptable computation time. There is significant increase in the number of transcriptomic data achieved and observable heterogeneity in the performance to identify differentially expressed genes in the ND treatment paradigm. The results provide new insights as to how response to ND treatment in microalgae is regulated. ND analyses highlight the advantages of this database generated in this study that could also serve as a useful resource for future gene manipulation and transcriptome-wide analysis. We thus demonstrate the usefulness of exploring the transcriptome as an informative platform for functional studies and genetic manipulations in similar species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1551-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5322580 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53225802017-03-01 Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing Yao, Lina Tan, Kenneth Wei Min Tan, Tin Wee Lee, Yuan Kun BMC Bioinformatics Research Article BACKGROUND: RNA-Seq technology has received a lot of attention in recent years for microalgal global transcriptomic profiling. It is widely used in transcriptome-wide analysis of gene expression., particularly for microalgal strains with potential as biofuel sources. However, insufficient genomic or transcriptomic information of non-model microalgae has limited the understanding of their regulatory mechanisms and hampered genetic manipulation to enhance biofuel production. As such, an optimal microalgal transcriptomic database construction is a subject of urgent investigation. RESULTS: Dunaliella tertiolecta, a non-model oleaginous microalgal species, was sequenced via Illumina MISEQ and HISEQ 4000 in RNA-Seq studies. The high quality high-throughout sequencing data were explored using high performance computing (HPC) in a petascale data center and subjected to de novo assembly and parallelized mpiBLASTX search with multiple species. As a result, a transcriptome database of 17,845 was constructed (~95% completeness). This enlarged database constructed fueled the RNA-Seq data analysis, which was validated by a nitrogen deprivation (ND) study that induces triacylglycerol (TAG) production. CONCLUSIONS: The new paralleled assembly and annotation method under HPC presented here allows the solution of large-scale data processing problems in acceptable computation time. There is significant increase in the number of transcriptomic data achieved and observable heterogeneity in the performance to identify differentially expressed genes in the ND treatment paradigm. The results provide new insights as to how response to ND treatment in microalgae is regulated. ND analyses highlight the advantages of this database generated in this study that could also serve as a useful resource for future gene manipulation and transcriptome-wide analysis. We thus demonstrate the usefulness of exploring the transcriptome as an informative platform for functional studies and genetic manipulations in similar species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1551-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-22 /pmc/articles/PMC5322580/ /pubmed/28228091 http://dx.doi.org/10.1186/s12859-017-1551-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Yao, Lina Tan, Kenneth Wei Min Tan, Tin Wee Lee, Yuan Kun Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title | Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title_full | Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title_fullStr | Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title_full_unstemmed | Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title_short | Exploring the transcriptome of non-model oleaginous microalga Dunaliella tertiolecta through high-throughput sequencing and high performance computing |
title_sort | exploring the transcriptome of non-model oleaginous microalga dunaliella tertiolecta through high-throughput sequencing and high performance computing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5322580/ https://www.ncbi.nlm.nih.gov/pubmed/28228091 http://dx.doi.org/10.1186/s12859-017-1551-x |
work_keys_str_mv | AT yaolina exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing AT tankennethweimin exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing AT tantinwee exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing AT leeyuankun exploringthetranscriptomeofnonmodeloleaginousmicroalgadunaliellatertiolectathroughhighthroughputsequencingandhighperformancecomputing |