Cargando…

Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD

An unexpectedly high proportion of SNPs on the X chromosome in the 1000 Genomes Project phase 3 data were identified with significant sex differences in minor allele frequencies (sdMAF). sdMAF persisted for many of these SNPs in the recently released high coverage whole genome sequence of the 1000 G...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhong, Sun, Lei, Paterson, Andrew D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187127/
https://www.ncbi.nlm.nih.gov/pubmed/35639794
http://dx.doi.org/10.1371/journal.pgen.1010231
_version_ 1784725101968621568
author Wang, Zhong
Sun, Lei
Paterson, Andrew D.
author_facet Wang, Zhong
Sun, Lei
Paterson, Andrew D.
author_sort Wang, Zhong
collection PubMed
description An unexpectedly high proportion of SNPs on the X chromosome in the 1000 Genomes Project phase 3 data were identified with significant sex differences in minor allele frequencies (sdMAF). sdMAF persisted for many of these SNPs in the recently released high coverage whole genome sequence of the 1000 Genomes Project that was aligned to GRCh38, and it was consistent between the five super-populations. Among the 245,825 common (MAF>5%) biallelic X-chromosomal SNPs in the phase 3 data presumed to be of high quality, 2,039 have genome-wide significant sdMAF (p-value <5e-8). sdMAF varied by location: non-pseudo-autosomal region (NPR) = 0.83%, pseudo-autosomal regions (PAR1) = 0.29%, PAR2 = 13.1%, and X-transposed region (XTR)/PAR3 = 0.85% of SNPs had sdMAF, and they were clustered at the NPR-PAR boundaries, among others. sdMAF at the NPR-PAR boundaries are biologically expected due to sex-linkage, but have generally been ignored in association studies. For comparison, similar analyses found only 6, 1 and 0 SNPs with significant sdMAF on chromosomes 1, 7 and 22, respectively. Similar sdMAF results for the X chromosome were obtained from the high coverage whole genome sequence data from gnomAD V 3.1.2 for both the non-Finnish European and African/African American samples. Future X chromosome analyses need to take sdMAF into account.
format Online
Article
Text
id pubmed-9187127
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91871272022-06-11 Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD Wang, Zhong Sun, Lei Paterson, Andrew D. PLoS Genet Research Article An unexpectedly high proportion of SNPs on the X chromosome in the 1000 Genomes Project phase 3 data were identified with significant sex differences in minor allele frequencies (sdMAF). sdMAF persisted for many of these SNPs in the recently released high coverage whole genome sequence of the 1000 Genomes Project that was aligned to GRCh38, and it was consistent between the five super-populations. Among the 245,825 common (MAF>5%) biallelic X-chromosomal SNPs in the phase 3 data presumed to be of high quality, 2,039 have genome-wide significant sdMAF (p-value <5e-8). sdMAF varied by location: non-pseudo-autosomal region (NPR) = 0.83%, pseudo-autosomal regions (PAR1) = 0.29%, PAR2 = 13.1%, and X-transposed region (XTR)/PAR3 = 0.85% of SNPs had sdMAF, and they were clustered at the NPR-PAR boundaries, among others. sdMAF at the NPR-PAR boundaries are biologically expected due to sex-linkage, but have generally been ignored in association studies. For comparison, similar analyses found only 6, 1 and 0 SNPs with significant sdMAF on chromosomes 1, 7 and 22, respectively. Similar sdMAF results for the X chromosome were obtained from the high coverage whole genome sequence data from gnomAD V 3.1.2 for both the non-Finnish European and African/African American samples. Future X chromosome analyses need to take sdMAF into account. Public Library of Science 2022-05-31 /pmc/articles/PMC9187127/ /pubmed/35639794 http://dx.doi.org/10.1371/journal.pgen.1010231 Text en © 2022 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Zhong
Sun, Lei
Paterson, Andrew D.
Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title_full Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title_fullStr Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title_full_unstemmed Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title_short Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD
title_sort major sex differences in allele frequencies for x chromosomal variants in both the 1000 genomes project and gnomad
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187127/
https://www.ncbi.nlm.nih.gov/pubmed/35639794
http://dx.doi.org/10.1371/journal.pgen.1010231
work_keys_str_mv AT wangzhong majorsexdifferencesinallelefrequenciesforxchromosomalvariantsinboththe1000genomesprojectandgnomad
AT sunlei majorsexdifferencesinallelefrequenciesforxchromosomalvariantsinboththe1000genomesprojectandgnomad
AT patersonandrewd majorsexdifferencesinallelefrequenciesforxchromosomalvariantsinboththe1000genomesprojectandgnomad