Cargando…

Patterns of microsatellite distribution across eukaryotic genomes

BACKGROUND: Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Srivastava, Surabhi, Avvaru, Akshay Kumar, Sowpati, Divya Tej, Mishra, Rakesh K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387519/
https://www.ncbi.nlm.nih.gov/pubmed/30795733
http://dx.doi.org/10.1186/s12864-019-5516-5
_version_ 1783397600494354432
author Srivastava, Surabhi
Avvaru, Akshay Kumar
Sowpati, Divya Tej
Mishra, Rakesh K.
author_facet Srivastava, Surabhi
Avvaru, Akshay Kumar
Sowpati, Divya Tej
Mishra, Rakesh K.
author_sort Srivastava, Surabhi
collection PubMed
description BACKGROUND: Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been studied in a limited context and compared across relatively few species. RESULTS: We have identified ~ 685 million eukaryotic microsatellites and analyzed their genomic trends across 15 taxonomic subgroups from protists to mammals. The distribution of SSRs reveals taxon-specific variations in their exonic, intronic and intergenic densities. Our analysis reveals the differences among non-related species and novel patterns uniquely demarcating closely related species. We document several repeats common across subgroups as well as rare SSRs that are excluded almost throughout evolution. We further identify species-specific signatures in pathogens like Leishmania as well as in cereal crops, Drosophila, birds and primates. We also find that distinct SSRs preferentially exist as long repeating units in different subgroups; most unicellular organisms show no length preference for any SSR class, while many SSR motifs accumulate as long repeats in complex organisms, especially in mammals. CONCLUSIONS: We present a comprehensive analysis of SSRs across taxa at an unprecedented scale. Our analysis indicates that the SSR composition of organisms with heterogeneous cell types is highly constrained, while simpler organisms such as protists, green algae and fungi show greater diversity in motif abundance, density and GC content. The microsatellite dataset generated in this work provides a large number of candidates for functional analysis and for studying their roles across the evolutionary landscape. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5516-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6387519
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63875192019-03-04 Patterns of microsatellite distribution across eukaryotic genomes Srivastava, Surabhi Avvaru, Akshay Kumar Sowpati, Divya Tej Mishra, Rakesh K. BMC Genomics Research Article BACKGROUND: Microsatellites, or Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes. Emerging evidence points to their role in cellular processes and gene regulation. Despite the huge resource of genomic information currently available, SSRs have been studied in a limited context and compared across relatively few species. RESULTS: We have identified ~ 685 million eukaryotic microsatellites and analyzed their genomic trends across 15 taxonomic subgroups from protists to mammals. The distribution of SSRs reveals taxon-specific variations in their exonic, intronic and intergenic densities. Our analysis reveals the differences among non-related species and novel patterns uniquely demarcating closely related species. We document several repeats common across subgroups as well as rare SSRs that are excluded almost throughout evolution. We further identify species-specific signatures in pathogens like Leishmania as well as in cereal crops, Drosophila, birds and primates. We also find that distinct SSRs preferentially exist as long repeating units in different subgroups; most unicellular organisms show no length preference for any SSR class, while many SSR motifs accumulate as long repeats in complex organisms, especially in mammals. CONCLUSIONS: We present a comprehensive analysis of SSRs across taxa at an unprecedented scale. Our analysis indicates that the SSR composition of organisms with heterogeneous cell types is highly constrained, while simpler organisms such as protists, green algae and fungi show greater diversity in motif abundance, density and GC content. The microsatellite dataset generated in this work provides a large number of candidates for functional analysis and for studying their roles across the evolutionary landscape. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5516-5) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-22 /pmc/articles/PMC6387519/ /pubmed/30795733 http://dx.doi.org/10.1186/s12864-019-5516-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Srivastava, Surabhi
Avvaru, Akshay Kumar
Sowpati, Divya Tej
Mishra, Rakesh K.
Patterns of microsatellite distribution across eukaryotic genomes
title Patterns of microsatellite distribution across eukaryotic genomes
title_full Patterns of microsatellite distribution across eukaryotic genomes
title_fullStr Patterns of microsatellite distribution across eukaryotic genomes
title_full_unstemmed Patterns of microsatellite distribution across eukaryotic genomes
title_short Patterns of microsatellite distribution across eukaryotic genomes
title_sort patterns of microsatellite distribution across eukaryotic genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387519/
https://www.ncbi.nlm.nih.gov/pubmed/30795733
http://dx.doi.org/10.1186/s12864-019-5516-5
work_keys_str_mv AT srivastavasurabhi patternsofmicrosatellitedistributionacrosseukaryoticgenomes
AT avvaruakshaykumar patternsofmicrosatellitedistributionacrosseukaryoticgenomes
AT sowpatidivyatej patternsofmicrosatellitedistributionacrosseukaryoticgenomes
AT mishrarakeshk patternsofmicrosatellitedistributionacrosseukaryoticgenomes