Cargando…

Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects

BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remai...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Simin, Wang, Shuping, He, Kang, Jiang, Mingxing, Li, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674736/
https://www.ncbi.nlm.nih.gov/pubmed/29110701
http://dx.doi.org/10.1186/s12864-017-4234-0
_version_ 1783276838451150848
author Ding, Simin
Wang, Shuping
He, Kang
Jiang, Mingxing
Li, Fei
author_facet Ding, Simin
Wang, Shuping
He, Kang
Jiang, Mingxing
Li, Fei
author_sort Ding, Simin
collection PubMed
description BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remain largely unknown. RESULTS: We identified 15.01 million SSRs across 136 insect genomes. The number of identified SSRs was positively associated with genome size in insects, but the frequency and density per megabase of genomes were not. Most insect SSRs (56.2−93.1%) were perfect (no mismatch). Imperfect (at least one mismatch) SSRs (average length 22−73 bp) were longer than perfect SSRs (16−30 bp). The most abundant insect SSRs were the di- and trinucleotide types, which accounted for 27.2% and 22.0% of all SSRs, respectively. On average, 59.1%, 36.8%, and 3.7% of insect SSRs were located in intergenic, intronic, and exonic regions, respectively. The percentages of various types of SSRs were similar among insects from the same family. However, they were dissimilar among insects from different families within orders. We carried out a phylogenetic analysis using the SSR frequencies. Species from the same family were generally clustered together in the evolutionary tree. However, insects from the same order but not in the same family did not cluster together. These results indicated that although SSRs undergo rapid expansions and contractions in different populations of the same species, the general genomic features of insect SSRs remain conserved at the family level. CONCLUSION: Millions of insect SSRs were identified and their genome features were analyzed. Most insect SSRs were perfect and were located in intergenic regions. We presented evidence that the variance of insect SSRs accumulated after the differentiation of insect families. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4234-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5674736
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56747362017-11-15 Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects Ding, Simin Wang, Shuping He, Kang Jiang, Mingxing Li, Fei BMC Genomics Research Article BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remain largely unknown. RESULTS: We identified 15.01 million SSRs across 136 insect genomes. The number of identified SSRs was positively associated with genome size in insects, but the frequency and density per megabase of genomes were not. Most insect SSRs (56.2−93.1%) were perfect (no mismatch). Imperfect (at least one mismatch) SSRs (average length 22−73 bp) were longer than perfect SSRs (16−30 bp). The most abundant insect SSRs were the di- and trinucleotide types, which accounted for 27.2% and 22.0% of all SSRs, respectively. On average, 59.1%, 36.8%, and 3.7% of insect SSRs were located in intergenic, intronic, and exonic regions, respectively. The percentages of various types of SSRs were similar among insects from the same family. However, they were dissimilar among insects from different families within orders. We carried out a phylogenetic analysis using the SSR frequencies. Species from the same family were generally clustered together in the evolutionary tree. However, insects from the same order but not in the same family did not cluster together. These results indicated that although SSRs undergo rapid expansions and contractions in different populations of the same species, the general genomic features of insect SSRs remain conserved at the family level. CONCLUSION: Millions of insect SSRs were identified and their genome features were analyzed. Most insect SSRs were perfect and were located in intergenic regions. We presented evidence that the variance of insect SSRs accumulated after the differentiation of insect families. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4234-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-06 /pmc/articles/PMC5674736/ /pubmed/29110701 http://dx.doi.org/10.1186/s12864-017-4234-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ding, Simin
Wang, Shuping
He, Kang
Jiang, Mingxing
Li, Fei
Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title_full Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title_fullStr Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title_full_unstemmed Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title_short Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
title_sort large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674736/
https://www.ncbi.nlm.nih.gov/pubmed/29110701
http://dx.doi.org/10.1186/s12864-017-4234-0
work_keys_str_mv AT dingsimin largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects
AT wangshuping largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects
AT hekang largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects
AT jiangmingxing largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects
AT lifei largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects