Cargando…
Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects
BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remai...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674736/ https://www.ncbi.nlm.nih.gov/pubmed/29110701 http://dx.doi.org/10.1186/s12864-017-4234-0 |
_version_ | 1783276838451150848 |
---|---|
author | Ding, Simin Wang, Shuping He, Kang Jiang, Mingxing Li, Fei |
author_facet | Ding, Simin Wang, Shuping He, Kang Jiang, Mingxing Li, Fei |
author_sort | Ding, Simin |
collection | PubMed |
description | BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remain largely unknown. RESULTS: We identified 15.01 million SSRs across 136 insect genomes. The number of identified SSRs was positively associated with genome size in insects, but the frequency and density per megabase of genomes were not. Most insect SSRs (56.2−93.1%) were perfect (no mismatch). Imperfect (at least one mismatch) SSRs (average length 22−73 bp) were longer than perfect SSRs (16−30 bp). The most abundant insect SSRs were the di- and trinucleotide types, which accounted for 27.2% and 22.0% of all SSRs, respectively. On average, 59.1%, 36.8%, and 3.7% of insect SSRs were located in intergenic, intronic, and exonic regions, respectively. The percentages of various types of SSRs were similar among insects from the same family. However, they were dissimilar among insects from different families within orders. We carried out a phylogenetic analysis using the SSR frequencies. Species from the same family were generally clustered together in the evolutionary tree. However, insects from the same order but not in the same family did not cluster together. These results indicated that although SSRs undergo rapid expansions and contractions in different populations of the same species, the general genomic features of insect SSRs remain conserved at the family level. CONCLUSION: Millions of insect SSRs were identified and their genome features were analyzed. Most insect SSRs were perfect and were located in intergenic regions. We presented evidence that the variance of insect SSRs accumulated after the differentiation of insect families. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4234-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5674736 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56747362017-11-15 Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects Ding, Simin Wang, Shuping He, Kang Jiang, Mingxing Li, Fei BMC Genomics Research Article BACKGROUND: Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remain largely unknown. RESULTS: We identified 15.01 million SSRs across 136 insect genomes. The number of identified SSRs was positively associated with genome size in insects, but the frequency and density per megabase of genomes were not. Most insect SSRs (56.2−93.1%) were perfect (no mismatch). Imperfect (at least one mismatch) SSRs (average length 22−73 bp) were longer than perfect SSRs (16−30 bp). The most abundant insect SSRs were the di- and trinucleotide types, which accounted for 27.2% and 22.0% of all SSRs, respectively. On average, 59.1%, 36.8%, and 3.7% of insect SSRs were located in intergenic, intronic, and exonic regions, respectively. The percentages of various types of SSRs were similar among insects from the same family. However, they were dissimilar among insects from different families within orders. We carried out a phylogenetic analysis using the SSR frequencies. Species from the same family were generally clustered together in the evolutionary tree. However, insects from the same order but not in the same family did not cluster together. These results indicated that although SSRs undergo rapid expansions and contractions in different populations of the same species, the general genomic features of insect SSRs remain conserved at the family level. CONCLUSION: Millions of insect SSRs were identified and their genome features were analyzed. Most insect SSRs were perfect and were located in intergenic regions. We presented evidence that the variance of insect SSRs accumulated after the differentiation of insect families. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4234-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-06 /pmc/articles/PMC5674736/ /pubmed/29110701 http://dx.doi.org/10.1186/s12864-017-4234-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Ding, Simin Wang, Shuping He, Kang Jiang, Mingxing Li, Fei Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title | Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title_full | Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title_fullStr | Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title_full_unstemmed | Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title_short | Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
title_sort | large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674736/ https://www.ncbi.nlm.nih.gov/pubmed/29110701 http://dx.doi.org/10.1186/s12864-017-4234-0 |
work_keys_str_mv | AT dingsimin largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects AT wangshuping largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects AT hekang largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects AT jiangmingxing largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects AT lifei largescaleanalysisrevealsthatthegenomefeaturesofsimplesequencerepeatsaregenerallyconservedatthefamilylevelininsects |