Cargando…
Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Mea...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534443/ https://www.ncbi.nlm.nih.gov/pubmed/32769136 http://dx.doi.org/10.1534/g3.120.401434 |
_version_ | 1783590314658758656 |
---|---|
author | Zhuang, Hongmei Wang, Qiang Han, Hongwei Liu, Huifang Wang, Hao |
author_facet | Zhuang, Hongmei Wang, Qiang Han, Hongwei Liu, Huifang Wang, Hao |
author_sort | Zhuang, Hongmei |
collection | PubMed |
description | To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Meanwhile, next generation sequencing was performed to correct SMRT sequencing data. A series of analyses were performed to investigate the transcript structure. Finally, the obtained transcripts were mapped to the genome of Brassica rapa ssp. pekinesis Chiifu to identify potential novel transcripts. For green turnip (F01), a total of 19.54 Gb clean data were obtained from 8 cells. The number of reads of insert (ROI) and full-length non-chimeric (FLNC) reads were 510,137 and 267,666. In addition, 82,640 consensus isoforms were obtained in the isoform sequences clustering, of which 69,480 were high-quality, and 13,160 low-quality sequences were corrected using Illumina RNA seq data. For purple turnip (F02), there were 20.41 Gb clean data, 552,829 ROIs, and 274,915 FLNC sequences. A total of 93,775 consensus isoforms were obtained, of which 78,798 were high-quality, and the 14,977 low-quality sequences were corrected. Following the removal of redundant sequences, there were 46,516 and 49,429 non-redundant transcripts for F01 and F02, respectively; 7,774 and 9,385 alternative splicing events were predicted for F01 and F02; 63,890 simple sequence repeats, 59,460 complete coding sequences, and 535 long-non coding RNAs were predicted. Moreover, 5,194 and 5,369 novel transcripts were identified by mapping to Brassica rapa ssp. pekinesis Chiifu. The obtained transcriptome data may improve turnip genome annotation and facilitate further study of the Brassica rapa var. Rapa genome and transcriptome. |
format | Online Article Text |
id | pubmed-7534443 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-75344432020-10-13 Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome Zhuang, Hongmei Wang, Qiang Han, Hongwei Liu, Huifang Wang, Hao G3 (Bethesda) Genome Report To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Meanwhile, next generation sequencing was performed to correct SMRT sequencing data. A series of analyses were performed to investigate the transcript structure. Finally, the obtained transcripts were mapped to the genome of Brassica rapa ssp. pekinesis Chiifu to identify potential novel transcripts. For green turnip (F01), a total of 19.54 Gb clean data were obtained from 8 cells. The number of reads of insert (ROI) and full-length non-chimeric (FLNC) reads were 510,137 and 267,666. In addition, 82,640 consensus isoforms were obtained in the isoform sequences clustering, of which 69,480 were high-quality, and 13,160 low-quality sequences were corrected using Illumina RNA seq data. For purple turnip (F02), there were 20.41 Gb clean data, 552,829 ROIs, and 274,915 FLNC sequences. A total of 93,775 consensus isoforms were obtained, of which 78,798 were high-quality, and the 14,977 low-quality sequences were corrected. Following the removal of redundant sequences, there were 46,516 and 49,429 non-redundant transcripts for F01 and F02, respectively; 7,774 and 9,385 alternative splicing events were predicted for F01 and F02; 63,890 simple sequence repeats, 59,460 complete coding sequences, and 535 long-non coding RNAs were predicted. Moreover, 5,194 and 5,369 novel transcripts were identified by mapping to Brassica rapa ssp. pekinesis Chiifu. The obtained transcriptome data may improve turnip genome annotation and facilitate further study of the Brassica rapa var. Rapa genome and transcriptome. Genetics Society of America 2020-08-07 /pmc/articles/PMC7534443/ /pubmed/32769136 http://dx.doi.org/10.1534/g3.120.401434 Text en Copyright © 2020 Zhuang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genome Report Zhuang, Hongmei Wang, Qiang Han, Hongwei Liu, Huifang Wang, Hao Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title | Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title_full | Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title_fullStr | Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title_full_unstemmed | Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title_short | Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome |
title_sort | single-molecule real-time transcript sequencing of turnips unveiling the complexity of the turnip transcriptome |
topic | Genome Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534443/ https://www.ncbi.nlm.nih.gov/pubmed/32769136 http://dx.doi.org/10.1534/g3.120.401434 |
work_keys_str_mv | AT zhuanghongmei singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome AT wangqiang singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome AT hanhongwei singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome AT liuhuifang singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome AT wanghao singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome |