Cargando…
Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome
Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive paralle...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3749127/ https://www.ncbi.nlm.nih.gov/pubmed/23991119 http://dx.doi.org/10.1371/journal.pone.0072516 |
_version_ | 1782281150196088832 |
---|---|
author | Ghangal, Rajesh Chaudhary, Saurabh Jain, Mukesh Purty, Ram Singh Chand Sharma, Prakash |
author_facet | Ghangal, Rajesh Chaudhary, Saurabh Jain, Mukesh Purty, Ram Singh Chand Sharma, Prakash |
author_sort | Ghangal, Rajesh |
collection | PubMed |
description | Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism’s transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. |
format | Online Article Text |
id | pubmed-3749127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37491272013-08-29 Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome Ghangal, Rajesh Chaudhary, Saurabh Jain, Mukesh Purty, Ram Singh Chand Sharma, Prakash PLoS One Research Article Seabuckthorn ( Hippophae rhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism’s transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. Public Library of Science 2013-08-21 /pmc/articles/PMC3749127/ /pubmed/23991119 http://dx.doi.org/10.1371/journal.pone.0072516 Text en © 2013 Ghangal et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Ghangal, Rajesh Chaudhary, Saurabh Jain, Mukesh Purty, Ram Singh Chand Sharma, Prakash Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae rhamnoides L.) Transcriptome |
title | Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae
rhamnoides L.) Transcriptome |
title_full | Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae
rhamnoides L.) Transcriptome |
title_fullStr | Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae
rhamnoides L.) Transcriptome |
title_full_unstemmed | Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae
rhamnoides L.) Transcriptome |
title_short | Optimization of De Novo Short Read Assembly of Seabuckthorn (Hippophae
rhamnoides L.) Transcriptome |
title_sort | optimization of de novo short read assembly of seabuckthorn (hippophae
rhamnoides l.) transcriptome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3749127/ https://www.ncbi.nlm.nih.gov/pubmed/23991119 http://dx.doi.org/10.1371/journal.pone.0072516 |
work_keys_str_mv | AT ghangalrajesh optimizationofdenovoshortreadassemblyofseabuckthornhippophaerhamnoidesltranscriptome AT chaudharysaurabh optimizationofdenovoshortreadassemblyofseabuckthornhippophaerhamnoidesltranscriptome AT jainmukesh optimizationofdenovoshortreadassemblyofseabuckthornhippophaerhamnoidesltranscriptome AT purtyramsingh optimizationofdenovoshortreadassemblyofseabuckthornhippophaerhamnoidesltranscriptome AT chandsharmaprakash optimizationofdenovoshortreadassemblyofseabuckthornhippophaerhamnoidesltranscriptome |