Cargando…

Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

BACKGROUND: Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Cavagnaro, Pablo F, Senalik, Douglas A, Yang, Luming, Simon, Philipp W, Harkins, Timothy T, Kodira, Chinnappa D, Huang, Sanwen, Weng, Yiqun
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091718/
https://www.ncbi.nlm.nih.gov/pubmed/20950470
http://dx.doi.org/10.1186/1471-2164-11-569
_version_ 1782203311528607744
author Cavagnaro, Pablo F
Senalik, Douglas A
Yang, Luming
Simon, Philipp W
Harkins, Timothy T
Kodira, Chinnappa D
Huang, Sanwen
Weng, Yiqun
author_facet Cavagnaro, Pablo F
Senalik, Douglas A
Yang, Luming
Simon, Philipp W
Harkins, Timothy T
Kodira, Chinnappa D
Huang, Sanwen
Weng, Yiqun
author_sort Cavagnaro, Pablo F
collection PubMed
description BACKGROUND: Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber. RESULTS: A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The in silico PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available. CONCLUSIONS: The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.
format Text
id pubmed-3091718
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30917182011-05-11 Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.) Cavagnaro, Pablo F Senalik, Douglas A Yang, Luming Simon, Philipp W Harkins, Timothy T Kodira, Chinnappa D Huang, Sanwen Weng, Yiqun BMC Genomics Research Article BACKGROUND: Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber. RESULTS: A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The in silico PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available. CONCLUSIONS: The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community. BioMed Central 2010-10-15 /pmc/articles/PMC3091718/ /pubmed/20950470 http://dx.doi.org/10.1186/1471-2164-11-569 Text en Copyright ©2010 Cavagnaro et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cavagnaro, Pablo F
Senalik, Douglas A
Yang, Luming
Simon, Philipp W
Harkins, Timothy T
Kodira, Chinnappa D
Huang, Sanwen
Weng, Yiqun
Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title_full Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title_fullStr Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title_full_unstemmed Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title_short Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
title_sort genome-wide characterization of simple sequence repeats in cucumber (cucumis sativus l.)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091718/
https://www.ncbi.nlm.nih.gov/pubmed/20950470
http://dx.doi.org/10.1186/1471-2164-11-569
work_keys_str_mv AT cavagnaropablof genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT senalikdouglasa genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT yangluming genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT simonphilippw genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT harkinstimothyt genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT kodirachinnappad genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT huangsanwen genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl
AT wengyiqun genomewidecharacterizationofsimplesequencerepeatsincucumbercucumissativusl