Cargando…
Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries
BACKGROUND: Flax (Linum usitatissimum L.) is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reprodu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3557168/ https://www.ncbi.nlm.nih.gov/pubmed/23216845 http://dx.doi.org/10.1186/1471-2164-13-684 |
_version_ | 1782257274959429632 |
---|---|
author | Kumar, Santosh You, Frank M Cloutier, Sylvie |
author_facet | Kumar, Santosh You, Frank M Cloutier, Sylvie |
author_sort | Kumar, Santosh |
collection | PubMed |
description | BACKGROUND: Flax (Linum usitatissimum L.) is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F(6)-derived recombinant inbred line population provided validation of the SNPs. RESULTS: Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents). Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F(6) individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. CONCLUSIONS: Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from flax. The genotyping-by-sequencing approach proved to be efficient for validation. The SNP resources generated in this work will assist in generating high density maps of flax and facilitate QTL discovery, marker-assisted selection, phylogenetic analyses, association mapping and anchoring of the whole genome shotgun sequence. |
format | Online Article Text |
id | pubmed-3557168 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35571682013-01-31 Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries Kumar, Santosh You, Frank M Cloutier, Sylvie BMC Genomics Research Article BACKGROUND: Flax (Linum usitatissimum L.) is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F(6)-derived recombinant inbred line population provided validation of the SNPs. RESULTS: Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents). Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F(6) individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. CONCLUSIONS: Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from flax. The genotyping-by-sequencing approach proved to be efficient for validation. The SNP resources generated in this work will assist in generating high density maps of flax and facilitate QTL discovery, marker-assisted selection, phylogenetic analyses, association mapping and anchoring of the whole genome shotgun sequence. BioMed Central 2012-12-06 /pmc/articles/PMC3557168/ /pubmed/23216845 http://dx.doi.org/10.1186/1471-2164-13-684 Text en Copyright ©2012 Kumar et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kumar, Santosh You, Frank M Cloutier, Sylvie Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title | Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title_full | Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title_fullStr | Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title_full_unstemmed | Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title_short | Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries |
title_sort | genome wide snp discovery in flax through next generation sequencing of reduced representation libraries |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3557168/ https://www.ncbi.nlm.nih.gov/pubmed/23216845 http://dx.doi.org/10.1186/1471-2164-13-684 |
work_keys_str_mv | AT kumarsantosh genomewidesnpdiscoveryinflaxthroughnextgenerationsequencingofreducedrepresentationlibraries AT youfrankm genomewidesnpdiscoveryinflaxthroughnextgenerationsequencingofreducedrepresentationlibraries AT cloutiersylvie genomewidesnpdiscoveryinflaxthroughnextgenerationsequencingofreducedrepresentationlibraries |