Cargando…

Detecting microsatellites within genomes: significant variation among algorithms

BACKGROUND: Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we co...

Descripción completa

Detalles Bibliográficos
Autores principales: Leclercq, Sébastien, Rivals, Eric, Jarne, Philippe
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1876248/
https://www.ncbi.nlm.nih.gov/pubmed/17442102
http://dx.doi.org/10.1186/1471-2105-8-125
_version_ 1782133520074801152
author Leclercq, Sébastien
Rivals, Eric
Jarne, Philippe
author_facet Leclercq, Sébastien
Rivals, Eric
Jarne, Philippe
author_sort Leclercq, Sébastien
collection PubMed
description BACKGROUND: Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). RESULTS: Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. CONCLUSION: Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.
format Text
id pubmed-1876248
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18762482007-05-22 Detecting microsatellites within genomes: significant variation among algorithms Leclercq, Sébastien Rivals, Eric Jarne, Philippe BMC Bioinformatics Research Article BACKGROUND: Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). RESULTS: Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. CONCLUSION: Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions. BioMed Central 2007-04-18 /pmc/articles/PMC1876248/ /pubmed/17442102 http://dx.doi.org/10.1186/1471-2105-8-125 Text en Copyright © 2007 Leclercq et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Leclercq, Sébastien
Rivals, Eric
Jarne, Philippe
Detecting microsatellites within genomes: significant variation among algorithms
title Detecting microsatellites within genomes: significant variation among algorithms
title_full Detecting microsatellites within genomes: significant variation among algorithms
title_fullStr Detecting microsatellites within genomes: significant variation among algorithms
title_full_unstemmed Detecting microsatellites within genomes: significant variation among algorithms
title_short Detecting microsatellites within genomes: significant variation among algorithms
title_sort detecting microsatellites within genomes: significant variation among algorithms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1876248/
https://www.ncbi.nlm.nih.gov/pubmed/17442102
http://dx.doi.org/10.1186/1471-2105-8-125
work_keys_str_mv AT leclercqsebastien detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms
AT rivalseric detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms
AT jarnephilippe detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms