Cargando…

PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome

BACKGROUND: The pearl oyster Pinctada fucata martensii is an economically valuable shellfish for seawater pearl production, and production of pearls depends on its growth. To date, the molecular mechanisms of the growth of this species remain poorly understood. The transcriptome sequencing has been...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hua, Xu, Hanzhi, Liu, Huiru, Pan, Xiaolan, Xu, Meng, Zhang, Gege, He, Maoxian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7359550/
https://www.ncbi.nlm.nih.gov/pubmed/32660426
http://dx.doi.org/10.1186/s12864-020-06894-3
_version_ 1783559071795773440
author Zhang, Hua
Xu, Hanzhi
Liu, Huiru
Pan, Xiaolan
Xu, Meng
Zhang, Gege
He, Maoxian
author_facet Zhang, Hua
Xu, Hanzhi
Liu, Huiru
Pan, Xiaolan
Xu, Meng
Zhang, Gege
He, Maoxian
author_sort Zhang, Hua
collection PubMed
description BACKGROUND: The pearl oyster Pinctada fucata martensii is an economically valuable shellfish for seawater pearl production, and production of pearls depends on its growth. To date, the molecular mechanisms of the growth of this species remain poorly understood. The transcriptome sequencing has been considered to understanding of the complexity of mechanisms of the growth of P. f. martensii. The recently released genome sequences of P. f. martensii, as well as emerging Pacific Bioscience (PacBio) single-molecular sequencing technologies, provide an opportunity to thoroughly investigate these molecular mechanisms. RESULTS: Herein, the full-length transcriptome was analysed by combining PacBio single-molecule long-read sequencing (PacBio sequencing) and Illumina sequencing. A total of 20.65 Gb of clean data were generated, including 574,561 circular consensus reads, among which 443,944 full-length non-chimeric (FLNC) sequences were identified. Through transcript clustering analysis of FLNC reads, 32,755 consensus isoforms were identified, including 32,095 high-quality consensus sequences. After removing redundant reads, 16,388 transcripts were obtained, and 641 fusion transcripts were derived by performing fusion transcript prediction of consensus sequences. Alternative splicing analysis of the 16,388 transcripts was performed after accounting for redundancy, and 9097 gene loci were detected, including 1607 new gene loci and 14,946 newly discovered transcripts. The original boundary of 11,235 genes on the chromosomes was corrected, 12,025 complete open reading frame sequences and 635 long non-coding RNAs (LncRNAs) were predicted, and functional annotation of 13,482 new transcripts was achieved. Two thousand three hundred eighteen alternative splicing events were detected. A total of 228 differentially expressed transcripts (DETs) were identified between the largest (L) and smallest (S) pearl oysters. Compared with the S, the L showed 99 and 129 significantly up-and down-regulated DETs, respectively. Six of these DETs were further confirmed by quantitative real-time RT-PCR (RT-qPCR) in independent experiment. CONCLUSIONS: Our results significantly improve existing gene models and genome annotations, optimise the genome structure, and in-depth understanding of the complexity and diversity of the differential growth patterns of P. f. martensii.
format Online
Article
Text
id pubmed-7359550
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73595502020-07-17 PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome Zhang, Hua Xu, Hanzhi Liu, Huiru Pan, Xiaolan Xu, Meng Zhang, Gege He, Maoxian BMC Genomics Research Article BACKGROUND: The pearl oyster Pinctada fucata martensii is an economically valuable shellfish for seawater pearl production, and production of pearls depends on its growth. To date, the molecular mechanisms of the growth of this species remain poorly understood. The transcriptome sequencing has been considered to understanding of the complexity of mechanisms of the growth of P. f. martensii. The recently released genome sequences of P. f. martensii, as well as emerging Pacific Bioscience (PacBio) single-molecular sequencing technologies, provide an opportunity to thoroughly investigate these molecular mechanisms. RESULTS: Herein, the full-length transcriptome was analysed by combining PacBio single-molecule long-read sequencing (PacBio sequencing) and Illumina sequencing. A total of 20.65 Gb of clean data were generated, including 574,561 circular consensus reads, among which 443,944 full-length non-chimeric (FLNC) sequences were identified. Through transcript clustering analysis of FLNC reads, 32,755 consensus isoforms were identified, including 32,095 high-quality consensus sequences. After removing redundant reads, 16,388 transcripts were obtained, and 641 fusion transcripts were derived by performing fusion transcript prediction of consensus sequences. Alternative splicing analysis of the 16,388 transcripts was performed after accounting for redundancy, and 9097 gene loci were detected, including 1607 new gene loci and 14,946 newly discovered transcripts. The original boundary of 11,235 genes on the chromosomes was corrected, 12,025 complete open reading frame sequences and 635 long non-coding RNAs (LncRNAs) were predicted, and functional annotation of 13,482 new transcripts was achieved. Two thousand three hundred eighteen alternative splicing events were detected. A total of 228 differentially expressed transcripts (DETs) were identified between the largest (L) and smallest (S) pearl oysters. Compared with the S, the L showed 99 and 129 significantly up-and down-regulated DETs, respectively. Six of these DETs were further confirmed by quantitative real-time RT-PCR (RT-qPCR) in independent experiment. CONCLUSIONS: Our results significantly improve existing gene models and genome annotations, optimise the genome structure, and in-depth understanding of the complexity and diversity of the differential growth patterns of P. f. martensii. BioMed Central 2020-07-13 /pmc/articles/PMC7359550/ /pubmed/32660426 http://dx.doi.org/10.1186/s12864-020-06894-3 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Zhang, Hua
Xu, Hanzhi
Liu, Huiru
Pan, Xiaolan
Xu, Meng
Zhang, Gege
He, Maoxian
PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title_full PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title_fullStr PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title_full_unstemmed PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title_short PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome
title_sort pacbio single molecule long-read sequencing provides insight into the complexity and diversity of the pinctada fucata martensii transcriptome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7359550/
https://www.ncbi.nlm.nih.gov/pubmed/32660426
http://dx.doi.org/10.1186/s12864-020-06894-3
work_keys_str_mv AT zhanghua pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT xuhanzhi pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT liuhuiru pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT panxiaolan pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT xumeng pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT zhanggege pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome
AT hemaoxian pacbiosinglemoleculelongreadsequencingprovidesinsightintothecomplexityanddiversityofthepinctadafucatamartensiitranscriptome