Cargando…

Mining sequence variations in representative polyploid sugarcane germplasm accessions

BACKGROUND: Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Xiping, Song, Jian, You, Qian, Paudel, Dev R., Zhang, Jisen, Wang, Jianping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5551020/
https://www.ncbi.nlm.nih.gov/pubmed/28793856
http://dx.doi.org/10.1186/s12864-017-3980-3
_version_ 1783256230978912256
author Yang, Xiping
Song, Jian
You, Qian
Paudel, Dev R.
Zhang, Jisen
Wang, Jianping
author_facet Yang, Xiping
Song, Jian
You, Qian
Paudel, Dev R.
Zhang, Jisen
Wang, Jianping
author_sort Yang, Xiping
collection PubMed
description BACKGROUND: Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. RESULTS: To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. CONCLUSIONS: The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.
format Online
Article
Text
id pubmed-5551020
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55510202017-08-14 Mining sequence variations in representative polyploid sugarcane germplasm accessions Yang, Xiping Song, Jian You, Qian Paudel, Dev R. Zhang, Jisen Wang, Jianping BMC Genomics Research Article BACKGROUND: Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. RESULTS: To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. CONCLUSIONS: The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species. BioMed Central 2017-08-09 /pmc/articles/PMC5551020/ /pubmed/28793856 http://dx.doi.org/10.1186/s12864-017-3980-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Yang, Xiping
Song, Jian
You, Qian
Paudel, Dev R.
Zhang, Jisen
Wang, Jianping
Mining sequence variations in representative polyploid sugarcane germplasm accessions
title Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_full Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_fullStr Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_full_unstemmed Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_short Mining sequence variations in representative polyploid sugarcane germplasm accessions
title_sort mining sequence variations in representative polyploid sugarcane germplasm accessions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5551020/
https://www.ncbi.nlm.nih.gov/pubmed/28793856
http://dx.doi.org/10.1186/s12864-017-3980-3
work_keys_str_mv AT yangxiping miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT songjian miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT youqian miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT paudeldevr miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT zhangjisen miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions
AT wangjianping miningsequencevariationsinrepresentativepolyploidsugarcanegermplasmaccessions