Cargando…

LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis

Vertebrate genomes differ considerably in size and structure. Among the features that show the most variation is the abundance of Long Interspersed Nuclear Elements (LINEs). Mammalian genomes contain 100,000s LINEs that belong to a single clade, L1, and in most species a single family is usually act...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruggiero, Robert P., Bourgeois, Yann, Boissinot, Stéphane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389967/
https://www.ncbi.nlm.nih.gov/pubmed/28450881
http://dx.doi.org/10.3389/fgene.2017.00044
_version_ 1782521361892114432
author Ruggiero, Robert P.
Bourgeois, Yann
Boissinot, Stéphane
author_facet Ruggiero, Robert P.
Bourgeois, Yann
Boissinot, Stéphane
author_sort Ruggiero, Robert P.
collection PubMed
description Vertebrate genomes differ considerably in size and structure. Among the features that show the most variation is the abundance of Long Interspersed Nuclear Elements (LINEs). Mammalian genomes contain 100,000s LINEs that belong to a single clade, L1, and in most species a single family is usually active at a time. In contrast, non-mammalian vertebrates (fish, amphibians and reptiles) contain multiple active families, belonging to several clades, but each of them is represented by a small number of recently inserted copies. It is unclear why vertebrate genomes harbor such drastic differences in LINE composition. To address this issue, we conducted whole genome resequencing to investigate the population genomics of LINEs across 13 genomes of the lizard Anolis carolinensis sampled from two geographically and genetically distinct populations in the Eastern Florida and the Gulf Atlantic regions of the United States. We used the Mobile Element Locator Tool to identify and genotype polymorphic insertions from five major clades of LINEs (CR1, L1, L2, RTE and R4) and the 41 subfamilies that constitute them. Across these groups we found large variation in the frequency of polymorphic insertions and the observed length distributions of these insertions, suggesting these groups vary in their activity and how frequently they successfully generate full-length, potentially active copies. Though we found an abundance of polymorphic insertions (over 45,000) most of these were observed at low frequencies and typically appeared as singletons. Site frequency spectra for most LINEs showed a significant shift toward low frequency alleles compared to the spectra observed for total genomic single nucleotide polymorphisms. Using Tajima’s D, F(ST) and the mean number of pairwise differences in LINE insertion polymorphisms, we found evidence that negative selection is acting on LINE families in a length-dependent manner, its effects being stronger in the larger Eastern Florida population. Our results suggest that a large effective population size and negative selection limit the expansion of polymorphic LINE insertions across these populations and that the probability of LINE polymorphisms reaching fixation is extremely low.
format Online
Article
Text
id pubmed-5389967
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-53899672017-04-27 LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis Ruggiero, Robert P. Bourgeois, Yann Boissinot, Stéphane Front Genet Genetics Vertebrate genomes differ considerably in size and structure. Among the features that show the most variation is the abundance of Long Interspersed Nuclear Elements (LINEs). Mammalian genomes contain 100,000s LINEs that belong to a single clade, L1, and in most species a single family is usually active at a time. In contrast, non-mammalian vertebrates (fish, amphibians and reptiles) contain multiple active families, belonging to several clades, but each of them is represented by a small number of recently inserted copies. It is unclear why vertebrate genomes harbor such drastic differences in LINE composition. To address this issue, we conducted whole genome resequencing to investigate the population genomics of LINEs across 13 genomes of the lizard Anolis carolinensis sampled from two geographically and genetically distinct populations in the Eastern Florida and the Gulf Atlantic regions of the United States. We used the Mobile Element Locator Tool to identify and genotype polymorphic insertions from five major clades of LINEs (CR1, L1, L2, RTE and R4) and the 41 subfamilies that constitute them. Across these groups we found large variation in the frequency of polymorphic insertions and the observed length distributions of these insertions, suggesting these groups vary in their activity and how frequently they successfully generate full-length, potentially active copies. Though we found an abundance of polymorphic insertions (over 45,000) most of these were observed at low frequencies and typically appeared as singletons. Site frequency spectra for most LINEs showed a significant shift toward low frequency alleles compared to the spectra observed for total genomic single nucleotide polymorphisms. Using Tajima’s D, F(ST) and the mean number of pairwise differences in LINE insertion polymorphisms, we found evidence that negative selection is acting on LINE families in a length-dependent manner, its effects being stronger in the larger Eastern Florida population. Our results suggest that a large effective population size and negative selection limit the expansion of polymorphic LINE insertions across these populations and that the probability of LINE polymorphisms reaching fixation is extremely low. Frontiers Media S.A. 2017-04-13 /pmc/articles/PMC5389967/ /pubmed/28450881 http://dx.doi.org/10.3389/fgene.2017.00044 Text en Copyright © 2017 Ruggiero, Bourgeois and Boissinot. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ruggiero, Robert P.
Bourgeois, Yann
Boissinot, Stéphane
LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title_full LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title_fullStr LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title_full_unstemmed LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title_short LINE Insertion Polymorphisms are Abundant but at Low Frequencies across Populations of Anolis carolinensis
title_sort line insertion polymorphisms are abundant but at low frequencies across populations of anolis carolinensis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389967/
https://www.ncbi.nlm.nih.gov/pubmed/28450881
http://dx.doi.org/10.3389/fgene.2017.00044
work_keys_str_mv AT ruggierorobertp lineinsertionpolymorphismsareabundantbutatlowfrequenciesacrosspopulationsofanoliscarolinensis
AT bourgeoisyann lineinsertionpolymorphismsareabundantbutatlowfrequenciesacrosspopulationsofanoliscarolinensis
AT boissinotstephane lineinsertionpolymorphismsareabundantbutatlowfrequenciesacrosspopulationsofanoliscarolinensis