Cargando…

Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis

Copy number variants (CNVs) are a class of structural variants that may involve complex genomic rearrangements (CGRs) and are hypothesized to have additional mutations around their breakpoints. Understanding the mechanisms underlying CNV formation is fundamental for understanding the repair and muta...

Descripción completa

Detalles Bibliográficos
Autores principales: Dhokarh, Dhananjay, Abyzov, Alexej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937565/
https://www.ncbi.nlm.nih.gov/pubmed/27216746
http://dx.doi.org/10.1101/gr.205484.116
_version_ 1782441729735000064
author Dhokarh, Dhananjay
Abyzov, Alexej
author_facet Dhokarh, Dhananjay
Abyzov, Alexej
author_sort Dhokarh, Dhananjay
collection PubMed
description Copy number variants (CNVs) are a class of structural variants that may involve complex genomic rearrangements (CGRs) and are hypothesized to have additional mutations around their breakpoints. Understanding the mechanisms underlying CNV formation is fundamental for understanding the repair and mutation mechanisms in cells, thereby shedding light on evolution, genomic disorders, cancer, and complex human traits. In this study, we used data from the 1000 Genomes Project to analyze hundreds of loci harboring heterozygous germline deletions in the subjects NA12878 and NA19240. By utilizing synthetic long-read data (longer than 2 kbp) in combination with high coverage short-read data and, in parallel, by comparing with parental genomes, we interrogated the phasing of these deletions with the flanking tens of thousands of heterozygous SNPs and indels. We found that the density of SNPs/indels flanking the breakpoints of deletions (in-phase variants) is approximately twice as high as the corresponding density for the variants on the haplotype without deletion (out-of-phase variants). This fold change was even larger for the subset of deletions with signatures of replication-based mechanism of formation. The allele frequency (AF) spectrum for deletions is enriched for rare events; and the AF spectrum for in-phase SNPs is shifted toward this deletion spectrum, thus offering evidence consistent with the concomitance of the in-phase SNPs/indels with the deletion events. These findings therefore lend support to the hypothesis that the mutational mechanisms underlying CNV formation are error prone. Our results could also be relevant for resolving mutation-rate discrepancies in human and to explain kataegis.
format Online
Article
Text
id pubmed-4937565
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-49375652017-01-01 Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis Dhokarh, Dhananjay Abyzov, Alexej Genome Res Research Copy number variants (CNVs) are a class of structural variants that may involve complex genomic rearrangements (CGRs) and are hypothesized to have additional mutations around their breakpoints. Understanding the mechanisms underlying CNV formation is fundamental for understanding the repair and mutation mechanisms in cells, thereby shedding light on evolution, genomic disorders, cancer, and complex human traits. In this study, we used data from the 1000 Genomes Project to analyze hundreds of loci harboring heterozygous germline deletions in the subjects NA12878 and NA19240. By utilizing synthetic long-read data (longer than 2 kbp) in combination with high coverage short-read data and, in parallel, by comparing with parental genomes, we interrogated the phasing of these deletions with the flanking tens of thousands of heterozygous SNPs and indels. We found that the density of SNPs/indels flanking the breakpoints of deletions (in-phase variants) is approximately twice as high as the corresponding density for the variants on the haplotype without deletion (out-of-phase variants). This fold change was even larger for the subset of deletions with signatures of replication-based mechanism of formation. The allele frequency (AF) spectrum for deletions is enriched for rare events; and the AF spectrum for in-phase SNPs is shifted toward this deletion spectrum, thus offering evidence consistent with the concomitance of the in-phase SNPs/indels with the deletion events. These findings therefore lend support to the hypothesis that the mutational mechanisms underlying CNV formation are error prone. Our results could also be relevant for resolving mutation-rate discrepancies in human and to explain kataegis. Cold Spring Harbor Laboratory Press 2016-07 /pmc/articles/PMC4937565/ /pubmed/27216746 http://dx.doi.org/10.1101/gr.205484.116 Text en © 2016 Dhokarh and Abyzov; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Research
Dhokarh, Dhananjay
Abyzov, Alexej
Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title_full Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title_fullStr Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title_full_unstemmed Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title_short Elevated variant density around SV breakpoints in germline lineage lends support to error-prone replication hypothesis
title_sort elevated variant density around sv breakpoints in germline lineage lends support to error-prone replication hypothesis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937565/
https://www.ncbi.nlm.nih.gov/pubmed/27216746
http://dx.doi.org/10.1101/gr.205484.116
work_keys_str_mv AT dhokarhdhananjay elevatedvariantdensityaroundsvbreakpointsingermlinelineagelendssupporttoerrorpronereplicationhypothesis
AT abyzovalexej elevatedvariantdensityaroundsvbreakpointsingermlinelineagelendssupporttoerrorpronereplicationhypothesis