Cargando…
Distribution and characterization of simple sequence repeats in Gossypium raimondii genome
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488841/ https://www.ncbi.nlm.nih.gov/pubmed/23139588 http://dx.doi.org/10.6026/97320630008801 |
_version_ | 1782248686829436928 |
---|---|
author | Zou, Changsong Lu, Cairui Zhang, Youping Song, Guoli |
author_facet | Zou, Changsong Lu, Cairui Zhang, Youping Song, Guoli |
author_sort | Zou, Changsong |
collection | PubMed |
description | Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton. |
format | Online Article Text |
id | pubmed-3488841 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Biomedical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-34888412012-11-08 Distribution and characterization of simple sequence repeats in Gossypium raimondii genome Zou, Changsong Lu, Cairui Zhang, Youping Song, Guoli Bioinformation Hypothesis Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton. Biomedical Informatics 2012-09-11 /pmc/articles/PMC3488841/ /pubmed/23139588 http://dx.doi.org/10.6026/97320630008801 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Hypothesis Zou, Changsong Lu, Cairui Zhang, Youping Song, Guoli Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title | Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title_full | Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title_fullStr | Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title_full_unstemmed | Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title_short | Distribution and characterization of simple sequence repeats in Gossypium raimondii genome |
title_sort | distribution and characterization of simple sequence repeats in gossypium raimondii genome |
topic | Hypothesis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488841/ https://www.ncbi.nlm.nih.gov/pubmed/23139588 http://dx.doi.org/10.6026/97320630008801 |
work_keys_str_mv | AT zouchangsong distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome AT lucairui distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome AT zhangyouping distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome AT songguoli distributionandcharacterizationofsimplesequencerepeatsingossypiumraimondiigenome |