Cargando…

Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato

BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Na, Cui, Huihui, Miao, Ying, Tang, Jun, Cao, Qinghe, Luo, Yonghai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859871/
https://www.ncbi.nlm.nih.gov/pubmed/31741783
http://dx.doi.org/10.7717/peerj.7933
_version_ 1783471205329666048
author Ding, Na
Cui, Huihui
Miao, Ying
Tang, Jun
Cao, Qinghe
Luo, Yonghai
author_facet Ding, Na
Cui, Huihui
Miao, Ying
Tang, Jun
Cao, Qinghe
Luo, Yonghai
author_sort Ding, Na
collection PubMed
description BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available, which in turn impedes progress in research studies in I. batatas functional genomics and molecular breeding. METHODS: In this study, we sequenced full-length transcriptomes in I. batatas and its diploid ancestor I. trifida by single-molecule real-time sequencing and Illumina second-generation sequencing technologies. With the generated datasets, we conducted comprehensive intraspecific and interspecific sequence analyses and experimental characterization. RESULTS: A total of 53,861/51,184 high-quality long-read transcripts were obtained, which covered about 10,439/10,452 loci in the I. batatas/I. trifida genome. These datasets enabled us to predict open reading frames successfully in 96.83%/96.82% of transcripts and identify 34,963/33,637 full-length cDNA sequences, 1,401/1,457 transcription factors, 25,315/27,090 simple sequence repeats, 1,656/1,389 long non-coding RNAs, and 5,251/8,901 alternative splicing events. Approximately, 32.34%/38.54% of transcripts and 46.22%/51.18% multi-exon transcripts underwent alternative splicing in I. batatas/I. trifida. Moreover, we validated one alternative splicing event in each of 10 genes and identified tuberous-root-specific expressed isoforms from a starch-branching enzyme, an alpha-glucan phosphorylase, a neutral invertase, and several ABC transporters. Overall, the collection and analysis of large-scale long-read transcripts generated in this study will serve as a valuable resource for the I. batatas research community, which may accelerate the progress in its structural, functional, and comparative genomics studies.
format Online
Article
Text
id pubmed-6859871
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-68598712019-11-18 Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato Ding, Na Cui, Huihui Miao, Ying Tang, Jun Cao, Qinghe Luo, Yonghai PeerJ Agricultural Science BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available, which in turn impedes progress in research studies in I. batatas functional genomics and molecular breeding. METHODS: In this study, we sequenced full-length transcriptomes in I. batatas and its diploid ancestor I. trifida by single-molecule real-time sequencing and Illumina second-generation sequencing technologies. With the generated datasets, we conducted comprehensive intraspecific and interspecific sequence analyses and experimental characterization. RESULTS: A total of 53,861/51,184 high-quality long-read transcripts were obtained, which covered about 10,439/10,452 loci in the I. batatas/I. trifida genome. These datasets enabled us to predict open reading frames successfully in 96.83%/96.82% of transcripts and identify 34,963/33,637 full-length cDNA sequences, 1,401/1,457 transcription factors, 25,315/27,090 simple sequence repeats, 1,656/1,389 long non-coding RNAs, and 5,251/8,901 alternative splicing events. Approximately, 32.34%/38.54% of transcripts and 46.22%/51.18% multi-exon transcripts underwent alternative splicing in I. batatas/I. trifida. Moreover, we validated one alternative splicing event in each of 10 genes and identified tuberous-root-specific expressed isoforms from a starch-branching enzyme, an alpha-glucan phosphorylase, a neutral invertase, and several ABC transporters. Overall, the collection and analysis of large-scale long-read transcripts generated in this study will serve as a valuable resource for the I. batatas research community, which may accelerate the progress in its structural, functional, and comparative genomics studies. PeerJ Inc. 2019-11-15 /pmc/articles/PMC6859871/ /pubmed/31741783 http://dx.doi.org/10.7717/peerj.7933 Text en ©2019 Ding et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Agricultural Science
Ding, Na
Cui, Huihui
Miao, Ying
Tang, Jun
Cao, Qinghe
Luo, Yonghai
Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title_full Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title_fullStr Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title_full_unstemmed Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title_short Single-molecule real-time sequencing identifies massive full-length cDNAs and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
title_sort single-molecule real-time sequencing identifies massive full-length cdnas and alternative-splicing events that facilitate comparative and functional genomics study in the hexaploid crop sweet potato
topic Agricultural Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859871/
https://www.ncbi.nlm.nih.gov/pubmed/31741783
http://dx.doi.org/10.7717/peerj.7933
work_keys_str_mv AT dingna singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato
AT cuihuihui singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato
AT miaoying singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato
AT tangjun singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato
AT caoqinghe singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato
AT luoyonghai singlemoleculerealtimesequencingidentifiesmassivefulllengthcdnasandalternativesplicingeventsthatfacilitatecomparativeandfunctionalgenomicsstudyinthehexaploidcropsweetpotato