Cargando…

Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding

BACKGROUND: Cultivated tomato (Solanum lycopersicum L.) has narrow genetic diversity that makes it difficult to identify polymorphisms between elite germplasm. We explored array-based single feature polymorphism (SFP) discovery as a high-throughput approach for marker development in cultivated tomat...

Descripción completa

Detalles Bibliográficos
Autores principales: Sim, Sung-Chur, Robbins, Matthew D, Chilcott, Charles, Zhu, Tong, Francis, David M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2763011/
https://www.ncbi.nlm.nih.gov/pubmed/19818135
http://dx.doi.org/10.1186/1471-2164-10-466
_version_ 1782172976159916032
author Sim, Sung-Chur
Robbins, Matthew D
Chilcott, Charles
Zhu, Tong
Francis, David M
author_facet Sim, Sung-Chur
Robbins, Matthew D
Chilcott, Charles
Zhu, Tong
Francis, David M
author_sort Sim, Sung-Chur
collection PubMed
description BACKGROUND: Cultivated tomato (Solanum lycopersicum L.) has narrow genetic diversity that makes it difficult to identify polymorphisms between elite germplasm. We explored array-based single feature polymorphism (SFP) discovery as a high-throughput approach for marker development in cultivated tomato. RESULTS: Three varieties, FL7600 (fresh-market), OH9242 (processing), and PI114490 (cherry) were used as a source of genomic DNA for hybridization to oligonucleotide arrays. Identification of SFPs was based on outlier detection using regression analysis of normalized hybridization data within a probe set for each gene. A subset of 189 putative SFPs was sequenced for validation. The rate of validation depended on the desired level of significance (α) used to define the confidence interval (CI), and ranged from 76% for polymorphisms identified at α ≤ 10(-6 )to 60% for those identified at α ≤ 10(-2). Validation percentage reached a plateau between α ≤ 10(-4 )and α ≤ 10(-7), but failure to identify known SFPs (Type II error) increased dramatically at α ≤ 10(-6). Trough sequence validation, we identified 279 SNPs and 27 InDels in 111 loci. Sixty loci contained ≥ 2 SNPs per locus. We used a subset of validated SNPs for genetic diversity analysis of 92 tomato varieties and accessions. Pairwise estimation of θ (Fst) suggested significant differentiation between collections of fresh-market, processing, vintage, Latin American (landrace), and S. pimpinellifolium accessions. The fresh-market and processing groups displayed high genetic diversity relative to vintage and landrace groups. Furthermore, the patterns of SNP variation indicated that domestication and early breeding practices have led to progressive genetic bottlenecks while modern breeding practices have reintroduced genetic variation into the crop from wild species. Finally, we examined the ratio of non-synonymous (Ka) to synonymous substitutions (Ks) for 20 loci with multiple SNPs (≥ 4 per locus). Six of 20 loci showed ratios of Ka/Ks ≥ 0.9. CONCLUSION: Array-based SFP discovery was an efficient method to identify a large number of molecular markers for genetics and breeding in elite tomato germplasm. Patterns of sequence variation across five major tomato groups provided insight into to the effect of human selection on genetic variation.
format Text
id pubmed-2763011
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27630112009-10-17 Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding Sim, Sung-Chur Robbins, Matthew D Chilcott, Charles Zhu, Tong Francis, David M BMC Genomics Research Article BACKGROUND: Cultivated tomato (Solanum lycopersicum L.) has narrow genetic diversity that makes it difficult to identify polymorphisms between elite germplasm. We explored array-based single feature polymorphism (SFP) discovery as a high-throughput approach for marker development in cultivated tomato. RESULTS: Three varieties, FL7600 (fresh-market), OH9242 (processing), and PI114490 (cherry) were used as a source of genomic DNA for hybridization to oligonucleotide arrays. Identification of SFPs was based on outlier detection using regression analysis of normalized hybridization data within a probe set for each gene. A subset of 189 putative SFPs was sequenced for validation. The rate of validation depended on the desired level of significance (α) used to define the confidence interval (CI), and ranged from 76% for polymorphisms identified at α ≤ 10(-6 )to 60% for those identified at α ≤ 10(-2). Validation percentage reached a plateau between α ≤ 10(-4 )and α ≤ 10(-7), but failure to identify known SFPs (Type II error) increased dramatically at α ≤ 10(-6). Trough sequence validation, we identified 279 SNPs and 27 InDels in 111 loci. Sixty loci contained ≥ 2 SNPs per locus. We used a subset of validated SNPs for genetic diversity analysis of 92 tomato varieties and accessions. Pairwise estimation of θ (Fst) suggested significant differentiation between collections of fresh-market, processing, vintage, Latin American (landrace), and S. pimpinellifolium accessions. The fresh-market and processing groups displayed high genetic diversity relative to vintage and landrace groups. Furthermore, the patterns of SNP variation indicated that domestication and early breeding practices have led to progressive genetic bottlenecks while modern breeding practices have reintroduced genetic variation into the crop from wild species. Finally, we examined the ratio of non-synonymous (Ka) to synonymous substitutions (Ks) for 20 loci with multiple SNPs (≥ 4 per locus). Six of 20 loci showed ratios of Ka/Ks ≥ 0.9. CONCLUSION: Array-based SFP discovery was an efficient method to identify a large number of molecular markers for genetics and breeding in elite tomato germplasm. Patterns of sequence variation across five major tomato groups provided insight into to the effect of human selection on genetic variation. BioMed Central 2009-10-09 /pmc/articles/PMC2763011/ /pubmed/19818135 http://dx.doi.org/10.1186/1471-2164-10-466 Text en Copyright © 2009 Sim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sim, Sung-Chur
Robbins, Matthew D
Chilcott, Charles
Zhu, Tong
Francis, David M
Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title_full Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title_fullStr Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title_full_unstemmed Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title_short Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding
title_sort oligonucleotide array discovery of polymorphisms in cultivated tomato (solanum lycopersicum l.) reveals patterns of snp variation associated with breeding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2763011/
https://www.ncbi.nlm.nih.gov/pubmed/19818135
http://dx.doi.org/10.1186/1471-2164-10-466
work_keys_str_mv AT simsungchur oligonucleotidearraydiscoveryofpolymorphismsincultivatedtomatosolanumlycopersicumlrevealspatternsofsnpvariationassociatedwithbreeding
AT robbinsmatthewd oligonucleotidearraydiscoveryofpolymorphismsincultivatedtomatosolanumlycopersicumlrevealspatternsofsnpvariationassociatedwithbreeding
AT chilcottcharles oligonucleotidearraydiscoveryofpolymorphismsincultivatedtomatosolanumlycopersicumlrevealspatternsofsnpvariationassociatedwithbreeding
AT zhutong oligonucleotidearraydiscoveryofpolymorphismsincultivatedtomatosolanumlycopersicumlrevealspatternsofsnpvariationassociatedwithbreeding
AT francisdavidm oligonucleotidearraydiscoveryofpolymorphismsincultivatedtomatosolanumlycopersicumlrevealspatternsofsnpvariationassociatedwithbreeding