Cargando…

Large-scale identification of polymorphic microsatellites using an in silico approach

BACKGROUND: Simple Sequence Repeat (SSR) or microsatellite markers are valuable for genetic research. Experimental methods to develop SSR markers are laborious, time consuming and expensive. In silico approaches have become a practicable and relatively inexpensive alternative during the last decade,...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Jifeng, Baldwin, Samantha J, Jacobs, Jeanne ME, Linden, C Gerard van der, Voorrips, Roeland E, Leunissen, Jack AM, van Eck, Herman, Vosman, Ben
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2562394/
https://www.ncbi.nlm.nih.gov/pubmed/18793407
http://dx.doi.org/10.1186/1471-2105-9-374
_version_ 1782159747416326144
author Tang, Jifeng
Baldwin, Samantha J
Jacobs, Jeanne ME
Linden, C Gerard van der
Voorrips, Roeland E
Leunissen, Jack AM
van Eck, Herman
Vosman, Ben
author_facet Tang, Jifeng
Baldwin, Samantha J
Jacobs, Jeanne ME
Linden, C Gerard van der
Voorrips, Roeland E
Leunissen, Jack AM
van Eck, Herman
Vosman, Ben
author_sort Tang, Jifeng
collection PubMed
description BACKGROUND: Simple Sequence Repeat (SSR) or microsatellite markers are valuable for genetic research. Experimental methods to develop SSR markers are laborious, time consuming and expensive. In silico approaches have become a practicable and relatively inexpensive alternative during the last decade, although testing putative SSR markers still is time consuming and expensive. In many species only a relatively small percentage of SSR markers turn out to be polymorphic. This is particularly true for markers derived from expressed sequence tags (ESTs). In EST databases a large redundancy of sequences is present, which may contain information on length-polymorphisms in the SSR they contain, and whether they have been derived from heterozygotes or from different genotypes. Up to now, although a number of programs have been developed to identify SSRs in EST sequences, no software can detect putatively polymorphic SSRs. RESULTS: We have developed PolySSR, a new pipeline to identify polymorphic SSRs rather than just SSRs. Sequence information is obtained from public EST databases derived from heterozygous individuals and/or at least two different genotypes. The pipeline includes PCR-primer design for the putatively polymorphic SSR markers, taking into account Single Nucleotide Polymorphisms (SNPs) in the flanking regions, thereby improving the success rate of the potential markers. A large number of polymorphic SSRs were identified using publicly available EST sequences of potato, tomato, rice, Arabidopsis, Brassica and chicken. The SSRs obtained were divided into long and short based on the number of times the motif was repeated. Surprisingly, the frequency of polymorphic SSRs was much higher in the short SSRs. CONCLUSION: PolySSR is a very effective tool to identify polymorphic SSRs. Using PolySSR, several hundred putative markers were developed and stored in a searchable database. Validation experiments showed that almost all markers that were indicated as putatively polymorphic by polySSR were indeed polymorphic. This greatly improves the efficiency of marker development, especially in species where there are low levels of polymorphism, like tomato. When combined with the new sequencing technologies PolySSR will have a big impact on the development of polymorphic SSRs in any species. PolySSR and the polymorphic SSR marker database are available from .
format Text
id pubmed-2562394
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25623942008-10-07 Large-scale identification of polymorphic microsatellites using an in silico approach Tang, Jifeng Baldwin, Samantha J Jacobs, Jeanne ME Linden, C Gerard van der Voorrips, Roeland E Leunissen, Jack AM van Eck, Herman Vosman, Ben BMC Bioinformatics Methodology Article BACKGROUND: Simple Sequence Repeat (SSR) or microsatellite markers are valuable for genetic research. Experimental methods to develop SSR markers are laborious, time consuming and expensive. In silico approaches have become a practicable and relatively inexpensive alternative during the last decade, although testing putative SSR markers still is time consuming and expensive. In many species only a relatively small percentage of SSR markers turn out to be polymorphic. This is particularly true for markers derived from expressed sequence tags (ESTs). In EST databases a large redundancy of sequences is present, which may contain information on length-polymorphisms in the SSR they contain, and whether they have been derived from heterozygotes or from different genotypes. Up to now, although a number of programs have been developed to identify SSRs in EST sequences, no software can detect putatively polymorphic SSRs. RESULTS: We have developed PolySSR, a new pipeline to identify polymorphic SSRs rather than just SSRs. Sequence information is obtained from public EST databases derived from heterozygous individuals and/or at least two different genotypes. The pipeline includes PCR-primer design for the putatively polymorphic SSR markers, taking into account Single Nucleotide Polymorphisms (SNPs) in the flanking regions, thereby improving the success rate of the potential markers. A large number of polymorphic SSRs were identified using publicly available EST sequences of potato, tomato, rice, Arabidopsis, Brassica and chicken. The SSRs obtained were divided into long and short based on the number of times the motif was repeated. Surprisingly, the frequency of polymorphic SSRs was much higher in the short SSRs. CONCLUSION: PolySSR is a very effective tool to identify polymorphic SSRs. Using PolySSR, several hundred putative markers were developed and stored in a searchable database. Validation experiments showed that almost all markers that were indicated as putatively polymorphic by polySSR were indeed polymorphic. This greatly improves the efficiency of marker development, especially in species where there are low levels of polymorphism, like tomato. When combined with the new sequencing technologies PolySSR will have a big impact on the development of polymorphic SSRs in any species. PolySSR and the polymorphic SSR marker database are available from . BioMed Central 2008-09-15 /pmc/articles/PMC2562394/ /pubmed/18793407 http://dx.doi.org/10.1186/1471-2105-9-374 Text en Copyright © 2008 Tang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Tang, Jifeng
Baldwin, Samantha J
Jacobs, Jeanne ME
Linden, C Gerard van der
Voorrips, Roeland E
Leunissen, Jack AM
van Eck, Herman
Vosman, Ben
Large-scale identification of polymorphic microsatellites using an in silico approach
title Large-scale identification of polymorphic microsatellites using an in silico approach
title_full Large-scale identification of polymorphic microsatellites using an in silico approach
title_fullStr Large-scale identification of polymorphic microsatellites using an in silico approach
title_full_unstemmed Large-scale identification of polymorphic microsatellites using an in silico approach
title_short Large-scale identification of polymorphic microsatellites using an in silico approach
title_sort large-scale identification of polymorphic microsatellites using an in silico approach
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2562394/
https://www.ncbi.nlm.nih.gov/pubmed/18793407
http://dx.doi.org/10.1186/1471-2105-9-374
work_keys_str_mv AT tangjifeng largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT baldwinsamanthaj largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT jacobsjeanneme largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT lindencgerardvander largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT voorripsroelande largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT leunissenjackam largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT vaneckherman largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach
AT vosmanben largescaleidentificationofpolymorphicmicrosatellitesusinganinsilicoapproach