Cargando…

Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome

To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Mea...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Hongmei, Wang, Qiang, Han, Hongwei, Liu, Huifang, Wang, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534443/
https://www.ncbi.nlm.nih.gov/pubmed/32769136
http://dx.doi.org/10.1534/g3.120.401434
_version_ 1783590314658758656
author Zhuang, Hongmei
Wang, Qiang
Han, Hongwei
Liu, Huifang
Wang, Hao
author_facet Zhuang, Hongmei
Wang, Qiang
Han, Hongwei
Liu, Huifang
Wang, Hao
author_sort Zhuang, Hongmei
collection PubMed
description To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Meanwhile, next generation sequencing was performed to correct SMRT sequencing data. A series of analyses were performed to investigate the transcript structure. Finally, the obtained transcripts were mapped to the genome of Brassica rapa ssp. pekinesis Chiifu to identify potential novel transcripts. For green turnip (F01), a total of 19.54 Gb clean data were obtained from 8 cells. The number of reads of insert (ROI) and full-length non-chimeric (FLNC) reads were 510,137 and 267,666. In addition, 82,640 consensus isoforms were obtained in the isoform sequences clustering, of which 69,480 were high-quality, and 13,160 low-quality sequences were corrected using Illumina RNA seq data. For purple turnip (F02), there were 20.41 Gb clean data, 552,829 ROIs, and 274,915 FLNC sequences. A total of 93,775 consensus isoforms were obtained, of which 78,798 were high-quality, and the 14,977 low-quality sequences were corrected. Following the removal of redundant sequences, there were 46,516 and 49,429 non-redundant transcripts for F01 and F02, respectively; 7,774 and 9,385 alternative splicing events were predicted for F01 and F02; 63,890 simple sequence repeats, 59,460 complete coding sequences, and 535 long-non coding RNAs were predicted. Moreover, 5,194 and 5,369 novel transcripts were identified by mapping to Brassica rapa ssp. pekinesis Chiifu. The obtained transcriptome data may improve turnip genome annotation and facilitate further study of the Brassica rapa var. Rapa genome and transcriptome.
format Online
Article
Text
id pubmed-7534443
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-75344432020-10-13 Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome Zhuang, Hongmei Wang, Qiang Han, Hongwei Liu, Huifang Wang, Hao G3 (Bethesda) Genome Report To generate the full-length transcriptome of Xinjiang green and purple turnips, Brassica rapa var. Rapa, using single-molecule real-time (SMRT) sequencing. The samples of two varieties of Brassica rapa var. Rapa at five developmental stages were collected and combined to perform SMRT sequencing. Meanwhile, next generation sequencing was performed to correct SMRT sequencing data. A series of analyses were performed to investigate the transcript structure. Finally, the obtained transcripts were mapped to the genome of Brassica rapa ssp. pekinesis Chiifu to identify potential novel transcripts. For green turnip (F01), a total of 19.54 Gb clean data were obtained from 8 cells. The number of reads of insert (ROI) and full-length non-chimeric (FLNC) reads were 510,137 and 267,666. In addition, 82,640 consensus isoforms were obtained in the isoform sequences clustering, of which 69,480 were high-quality, and 13,160 low-quality sequences were corrected using Illumina RNA seq data. For purple turnip (F02), there were 20.41 Gb clean data, 552,829 ROIs, and 274,915 FLNC sequences. A total of 93,775 consensus isoforms were obtained, of which 78,798 were high-quality, and the 14,977 low-quality sequences were corrected. Following the removal of redundant sequences, there were 46,516 and 49,429 non-redundant transcripts for F01 and F02, respectively; 7,774 and 9,385 alternative splicing events were predicted for F01 and F02; 63,890 simple sequence repeats, 59,460 complete coding sequences, and 535 long-non coding RNAs were predicted. Moreover, 5,194 and 5,369 novel transcripts were identified by mapping to Brassica rapa ssp. pekinesis Chiifu. The obtained transcriptome data may improve turnip genome annotation and facilitate further study of the Brassica rapa var. Rapa genome and transcriptome. Genetics Society of America 2020-08-07 /pmc/articles/PMC7534443/ /pubmed/32769136 http://dx.doi.org/10.1534/g3.120.401434 Text en Copyright © 2020 Zhuang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Zhuang, Hongmei
Wang, Qiang
Han, Hongwei
Liu, Huifang
Wang, Hao
Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title_full Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title_fullStr Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title_full_unstemmed Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title_short Single-Molecule Real-Time Transcript Sequencing of Turnips Unveiling the Complexity of the Turnip Transcriptome
title_sort single-molecule real-time transcript sequencing of turnips unveiling the complexity of the turnip transcriptome
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534443/
https://www.ncbi.nlm.nih.gov/pubmed/32769136
http://dx.doi.org/10.1534/g3.120.401434
work_keys_str_mv AT zhuanghongmei singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome
AT wangqiang singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome
AT hanhongwei singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome
AT liuhuifang singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome
AT wanghao singlemoleculerealtimetranscriptsequencingofturnipsunveilingthecomplexityoftheturniptranscriptome