Cargando…

Distribution and characterization of simple sequence repeats in Gossypium raimondii genome

Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447...

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Changsong, Lu, Cairui, Zhang, Youping, Song, Guoli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488841/
https://www.ncbi.nlm.nih.gov/pubmed/23139588
http://dx.doi.org/10.6026/97320630008801
_version_ 1782248686829436928
author Zou, Changsong
Lu, Cairui
Zhang, Youping
Song, Guoli
author_facet Zou, Changsong
Lu, Cairui
Zhang, Youping
Song, Guoli
author_sort Zou, Changsong
collection PubMed
description Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.
format Online
Article
Text
id pubmed-3488841
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-34888412012-11-08 Distribution and characterization of simple sequence repeats in Gossypium raimondii genome Zou, Changsong Lu, Cairui Zhang, Youping Song, Guoli Bioinformation Hypothesis Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton. Biomedical Informatics 2012-09-11 /pmc/articles/PMC3488841/ /pubmed/23139588 http://dx.doi.org/10.6026/97320630008801 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Zou, Changsong
Lu, Cairui
Zhang, Youping
Song, Guoli
Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title_full Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title_fullStr Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title_full_unstemmed Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title_short Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
title_sort distribution and characterization of simple sequence repeats in gossypium raimondii genome
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488841/
https://www.ncbi.nlm.nih.gov/pubmed/23139588
http://dx.doi.org/10.6026/97320630008801
work_keys_str_mv AT zouchangsong distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome
AT lucairui distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome
AT zhangyouping distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome
AT songguoli distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome