Cargando…
Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859871/ https://www.ncbi.nlm.nih.gov/pubmed/31741783 http://dx.doi.org/10.7717/peerj.7933 |
_version_ | 1783471205329666048 |
---|---|
author | Ding, Na Cui, Huihui Miao, Ying Tang, Jun Cao, Qinghe Luo, Yonghai |
author_facet | Ding, Na Cui, Huihui Miao, Ying Tang, Jun Cao, Qinghe Luo, Yonghai |
author_sort | Ding, Na |
collection | PubMed |
description | BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available, which in turn impedes progress in research studies in I. batatas functional genomics and molecular breeding. METHODS: In this study, we sequenced full-length transcriptomes in I. batatas and its diploid ancestor I. trifida by single-molecule real-time sequencing and Illumina second-generation sequencing technologies. With the generated datasets, we conducted comprehensive intraspecific and interspecific sequence analyses and experimental characterization. RESULTS: A total of 53,861/51,184 high-quality long-read transcripts were obtained, which covered about 10,439/10,452 loci in the I. batatas/I. trifida genome. These datasets enabled us to predict open reading frames successfully in 96.83%/96.82% of transcripts and identify 34,963/33,637 full-length cDNA sequences, 1,401/1,457 transcription factors, 25,315/27,090 simple sequence repeats, 1,656/1,389 long non-coding RNAs, and 5,251/8,901 alternative splicing events. Approximately, 32.34%/38.54% of transcripts and 46.22%/51.18% multi-exon transcripts underwent alternative splicing in I. batatas/I. trifida. Moreover, we validated one alternative splicing event in each of 10 genes and identified tuberous-root-specific expressed isoforms from a starch-branching enzyme, an alpha-glucan phosphorylase, a neutral invertase, and several ABC transporters. Overall, the collection and analysis of large-scale long-read transcripts generated in this study will serve as a valuable resource for the I. batatas research community, which may accelerate the progress in its structural, functional, and comparative genomics studies. |
format | Online Article Text |
id | pubmed-6859871 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68598712019-11-18 Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato Ding, Na Cui, Huihui Miao, Ying Tang, Jun Cao, Qinghe Luo, Yonghai PeerJ Agricultural Science BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available, which in turn impedes progress in research studies in I. batatas functional genomics and molecular breeding. METHODS: In this study, we sequenced full-length transcriptomes in I. batatas and its diploid ancestor I. trifida by single-molecule real-time sequencing and Illumina second-generation sequencing technologies. With the generated datasets, we conducted comprehensive intraspecific and interspecific sequence analyses and experimental characterization. RESULTS: A total of 53,861/51,184 high-quality long-read transcripts were obtained, which covered about 10,439/10,452 loci in the I. batatas/I. trifida genome. These datasets enabled us to predict open reading frames successfully in 96.83%/96.82% of transcripts and identify 34,963/33,637 full-length cDNA sequences, 1,401/1,457 transcription factors, 25,315/27,090 simple sequence repeats, 1,656/1,389 long non-coding RNAs, and 5,251/8,901 alternative splicing events. Approximately, 32.34%/38.54% of transcripts and 46.22%/51.18% multi-exon transcripts underwent alternative splicing in I. batatas/I. trifida. Moreover, we validated one alternative splicing event in each of 10 genes and identified tuberous-root-specific expressed isoforms from a starch-branching enzyme, an alpha-glucan phosphorylase, a neutral invertase, and several ABC transporters. Overall, the collection and analysis of large-scale long-read transcripts generated in this study will serve as a valuable resource for the I. batatas research community, which may accelerate the progress in its structural, functional, and comparative genomics studies. PeerJ Inc. 2019-11-15 /pmc/articles/PMC6859871/ /pubmed/31741783 http://dx.doi.org/10.7717/peerj.7933 Text en ©2019 Ding et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Agricultural Science Ding, Na Cui, Huihui Miao, Ying Tang, Jun Cao, Qinghe Luo, Yonghai Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title | Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title_full | Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title_fullStr | Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title_full_unstemmed | Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title_short | Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
title_sort | single-molecule real-time sequencing identifies massive full-length cdnas and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato |
topic | Agricultural Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859871/ https://www.ncbi.nlm.nih.gov/pubmed/31741783 http://dx.doi.org/10.7717/peerj.7933 |
work_keys_str_mv | AT dingna singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato AT cuihuihui singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato AT miaoying singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato AT tangjun singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato AT caoqinghe singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato AT luoyonghai singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato |