Cargando…

Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project

The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. Durin...

Descripción completa

Detalles Bibliográficos
Autores principales: Konkel, Miriam K., Walker, Jerilyn A., Hotard, Ashley B., Ranck, Megan C., Fontenot, Catherine C., Storer, Jessica, Stewart, Chip, Marth, Gabor T., Batzer, Mark A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4607524/
https://www.ncbi.nlm.nih.gov/pubmed/26319576
http://dx.doi.org/10.1093/gbe/evv167
_version_ 1782395529489022976
author Konkel, Miriam K.
Walker, Jerilyn A.
Hotard, Ashley B.
Ranck, Megan C.
Fontenot, Catherine C.
Storer, Jessica
Stewart, Chip
Marth, Gabor T.
Batzer, Mark A.
author_facet Konkel, Miriam K.
Walker, Jerilyn A.
Hotard, Ashley B.
Ranck, Megan C.
Fontenot, Catherine C.
Storer, Jessica
Stewart, Chip
Marth, Gabor T.
Batzer, Mark A.
author_sort Konkel, Miriam K.
collection PubMed
description The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic “young” Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.
format Online
Article
Text
id pubmed-4607524
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46075242015-10-19 Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project Konkel, Miriam K. Walker, Jerilyn A. Hotard, Ashley B. Ranck, Megan C. Fontenot, Catherine C. Storer, Jessica Stewart, Chip Marth, Gabor T. Batzer, Mark A. Genome Biol Evol Research Article The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic “young” Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. Oxford University Press 2015-08-29 /pmc/articles/PMC4607524/ /pubmed/26319576 http://dx.doi.org/10.1093/gbe/evv167 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research Article
Konkel, Miriam K.
Walker, Jerilyn A.
Hotard, Ashley B.
Ranck, Megan C.
Fontenot, Catherine C.
Storer, Jessica
Stewart, Chip
Marth, Gabor T.
Batzer, Mark A.
Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title_full Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title_fullStr Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title_full_unstemmed Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title_short Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project
title_sort sequence analysis and characterization of active human alu subfamilies based on the 1000 genomes pilot project
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4607524/
https://www.ncbi.nlm.nih.gov/pubmed/26319576
http://dx.doi.org/10.1093/gbe/evv167
work_keys_str_mv AT konkelmiriamk sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT walkerjerilyna sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT hotardashleyb sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT ranckmeganc sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT fontenotcatherinec sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT storerjessica sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT stewartchip sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT marthgabort sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject
AT batzermarka sequenceanalysisandcharacterizationofactivehumanalusubfamiliesbasedonthe1000genomespilotproject