Cargando…

Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)

SIMPLE SUMMARY: In this study, a full-length transcriptome was analyzed with single-molecule real-time (SMRT) sequencing, which was first used to discover simple sequence repeat (SSR) genetic markers from B. dorsalis. Moreover, SSR markers from isoforms were screened for the identification of specie...

Descripción completa

Detalles Bibliográficos
Autores principales: Ouyang, Huili, Wang, Xiaoyun, Zheng, Xialin, Lu, Wen, Qin, Fengping, Chen, Chao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537375/
https://www.ncbi.nlm.nih.gov/pubmed/34680707
http://dx.doi.org/10.3390/insects12100938
_version_ 1784588235569102848
author Ouyang, Huili
Wang, Xiaoyun
Zheng, Xialin
Lu, Wen
Qin, Fengping
Chen, Chao
author_facet Ouyang, Huili
Wang, Xiaoyun
Zheng, Xialin
Lu, Wen
Qin, Fengping
Chen, Chao
author_sort Ouyang, Huili
collection PubMed
description SIMPLE SUMMARY: In this study, a full-length transcriptome was analyzed with single-molecule real-time (SMRT) sequencing, which was first used to discover simple sequence repeat (SSR) genetic markers from B. dorsalis. Moreover, SSR markers from isoforms were screened for the identification of species diversity. These results could provide molecular biology methods for further population research. ABSTRACT: Bactrocera dorsalis (Hendel), as one of the most notorious and destructive invasive agricultural pests in the world, causes damage to over 250 different types of fruits and vegetables throughout tropical and subtropical areas. PacBio single-molecule real-time (SMRT) sequencing was used to generate the full-length transcriptome data of B. dorsalis. A total of 40,319,890 subreads (76.6 Gb, clean reads) were generated, including 535,241 circular consensus sequences (CCSs) and 386,916 full-length non-concatemer reads (FLNCs). Transcript cluster analysis of the FLNC reads revealed 22,780 high-quality reads (HQs). In total, 12,274 transcripts were functionally annotated based on four different databases. A total of 1978 SSR loci were distributed throughout 1714 HQ transcripts, of which 1926 were complete SSRs and 52 were complex SSRs. Among the total SSR loci, 2–3 nucleotide repeats were dominant, occupying 83.62%, of which di- and tri- nucleotide repeats were 39.38% and 44.24%, respectively. We detected 105 repeat motifs, of which AT/AT (50.19%), AC/GT (39.15%), CAA/TTG (32.46%), and ACA/TGT (10.86%) were the most common in di- and tri-nucleotide repeats. The repeat SSR motifs were 12–190 bp in length, and 1638 (88.02%) were shorter than 20 bp. According to the randomly selected microsatellite sequence, 80 pairs of primers were designed, and 174 individuals were randomly amplified by PCR using primers. The number of primers that had amplification products with clear bands and showed good polymorphism came to 41, indicating that this was a feasible way to explore SSR markers from the transcriptomic data of B. dorsalis. These results lay a foundation for developing highly polymorphic microsatellites for researching the functional genomics, population genetic structure, and genetic diversity of B. dorsalis.
format Online
Article
Text
id pubmed-8537375
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85373752021-10-24 Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel) Ouyang, Huili Wang, Xiaoyun Zheng, Xialin Lu, Wen Qin, Fengping Chen, Chao Insects Article SIMPLE SUMMARY: In this study, a full-length transcriptome was analyzed with single-molecule real-time (SMRT) sequencing, which was first used to discover simple sequence repeat (SSR) genetic markers from B. dorsalis. Moreover, SSR markers from isoforms were screened for the identification of species diversity. These results could provide molecular biology methods for further population research. ABSTRACT: Bactrocera dorsalis (Hendel), as one of the most notorious and destructive invasive agricultural pests in the world, causes damage to over 250 different types of fruits and vegetables throughout tropical and subtropical areas. PacBio single-molecule real-time (SMRT) sequencing was used to generate the full-length transcriptome data of B. dorsalis. A total of 40,319,890 subreads (76.6 Gb, clean reads) were generated, including 535,241 circular consensus sequences (CCSs) and 386,916 full-length non-concatemer reads (FLNCs). Transcript cluster analysis of the FLNC reads revealed 22,780 high-quality reads (HQs). In total, 12,274 transcripts were functionally annotated based on four different databases. A total of 1978 SSR loci were distributed throughout 1714 HQ transcripts, of which 1926 were complete SSRs and 52 were complex SSRs. Among the total SSR loci, 2–3 nucleotide repeats were dominant, occupying 83.62%, of which di- and tri- nucleotide repeats were 39.38% and 44.24%, respectively. We detected 105 repeat motifs, of which AT/AT (50.19%), AC/GT (39.15%), CAA/TTG (32.46%), and ACA/TGT (10.86%) were the most common in di- and tri-nucleotide repeats. The repeat SSR motifs were 12–190 bp in length, and 1638 (88.02%) were shorter than 20 bp. According to the randomly selected microsatellite sequence, 80 pairs of primers were designed, and 174 individuals were randomly amplified by PCR using primers. The number of primers that had amplification products with clear bands and showed good polymorphism came to 41, indicating that this was a feasible way to explore SSR markers from the transcriptomic data of B. dorsalis. These results lay a foundation for developing highly polymorphic microsatellites for researching the functional genomics, population genetic structure, and genetic diversity of B. dorsalis. MDPI 2021-10-14 /pmc/articles/PMC8537375/ /pubmed/34680707 http://dx.doi.org/10.3390/insects12100938 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ouyang, Huili
Wang, Xiaoyun
Zheng, Xialin
Lu, Wen
Qin, Fengping
Chen, Chao
Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title_full Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title_fullStr Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title_full_unstemmed Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title_short Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
title_sort full-length smrt transcriptome sequencing and ssr analysis of bactrocera dorsalis (hendel)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537375/
https://www.ncbi.nlm.nih.gov/pubmed/34680707
http://dx.doi.org/10.3390/insects12100938
work_keys_str_mv AT ouyanghuili fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel
AT wangxiaoyun fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel
AT zhengxialin fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel
AT luwen fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel
AT qinfengping fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel
AT chenchao fulllengthsmrttranscriptomesequencingandssranalysisofbactroceradorsalishendel