Cargando…
Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era
Hardy-Weinberg Equilibrium (HWE) is used to estimate the number of homozygous and heterozygous variant carriers based on its allele frequency in populations that are not evolving. Deviations from HWE in large population databases have been used to detect genotyping errors, which can result in extrem...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083100/ https://www.ncbi.nlm.nih.gov/pubmed/32231685 http://dx.doi.org/10.3389/fgene.2020.00210 |
_version_ | 1783508474547666944 |
---|---|
author | Abramovs, Nikita Brass, Andrew Tassabehji, May |
author_facet | Abramovs, Nikita Brass, Andrew Tassabehji, May |
author_sort | Abramovs, Nikita |
collection | PubMed |
description | Hardy-Weinberg Equilibrium (HWE) is used to estimate the number of homozygous and heterozygous variant carriers based on its allele frequency in populations that are not evolving. Deviations from HWE in large population databases have been used to detect genotyping errors, which can result in extreme heterozygote excess (HetExc). However, HetExc might also be a sign of natural selection since recessive disease causing variants should occur less frequently in a homozygous state in the population, but may reach high allele frequency in a heterozygous state, especially if they are advantageous. We developed a filtering strategy to detect these variants and applied it on genome data from 137,842 individuals. The main limitations of this approach were quality of genotype calls and insufficient population sizes, whereas population structure and inbreeding can reduce sensitivity, but not precision, in certain populations. Nevertheless, we identified 161 HetExc variants in 149 genes, most of which were specific to African/African American populations (∼79.5%). Although the majority of them were not associated with known diseases, or were classified as clinically “benign,” they were enriched in genes associated with autosomal recessive diseases. The resulting dataset also contained two known recessive disease causing variants with evidence of heterozygote advantage in the sickle-cell anemia (HBB) and cystic fibrosis (CFTR). Finally, we provide supporting in silico evidence of a novel heterozygote advantageous variant in the chromodomain helicase DNA binding protein 6 gene (CHD6; involved in influenza virus replication). We anticipate that our approach will aid the detection of rare recessive disease causing variants in the future. |
format | Online Article Text |
id | pubmed-7083100 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70831002020-03-30 Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era Abramovs, Nikita Brass, Andrew Tassabehji, May Front Genet Genetics Hardy-Weinberg Equilibrium (HWE) is used to estimate the number of homozygous and heterozygous variant carriers based on its allele frequency in populations that are not evolving. Deviations from HWE in large population databases have been used to detect genotyping errors, which can result in extreme heterozygote excess (HetExc). However, HetExc might also be a sign of natural selection since recessive disease causing variants should occur less frequently in a homozygous state in the population, but may reach high allele frequency in a heterozygous state, especially if they are advantageous. We developed a filtering strategy to detect these variants and applied it on genome data from 137,842 individuals. The main limitations of this approach were quality of genotype calls and insufficient population sizes, whereas population structure and inbreeding can reduce sensitivity, but not precision, in certain populations. Nevertheless, we identified 161 HetExc variants in 149 genes, most of which were specific to African/African American populations (∼79.5%). Although the majority of them were not associated with known diseases, or were classified as clinically “benign,” they were enriched in genes associated with autosomal recessive diseases. The resulting dataset also contained two known recessive disease causing variants with evidence of heterozygote advantage in the sickle-cell anemia (HBB) and cystic fibrosis (CFTR). Finally, we provide supporting in silico evidence of a novel heterozygote advantageous variant in the chromodomain helicase DNA binding protein 6 gene (CHD6; involved in influenza virus replication). We anticipate that our approach will aid the detection of rare recessive disease causing variants in the future. Frontiers Media S.A. 2020-03-13 /pmc/articles/PMC7083100/ /pubmed/32231685 http://dx.doi.org/10.3389/fgene.2020.00210 Text en Copyright © 2020 Abramovs, Brass and Tassabehji. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Abramovs, Nikita Brass, Andrew Tassabehji, May Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title | Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title_full | Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title_fullStr | Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title_full_unstemmed | Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title_short | Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era |
title_sort | hardy-weinberg equilibrium in the large scale genomic sequencing era |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083100/ https://www.ncbi.nlm.nih.gov/pubmed/32231685 http://dx.doi.org/10.3389/fgene.2020.00210 |
work_keys_str_mv | AT abramovsnikita hardyweinbergequilibriuminthelargescalegenomicsequencingera AT brassandrew hardyweinbergequilibriuminthelargescalegenomicsequencingera AT tassabehjimay hardyweinbergequilibriuminthelargescalegenomicsequencingera |