Cargando…

An evaluation of approaches for rare variant association analyses of binary traits in related samples

Recognizing that family data provide unique advantage of identifying rare risk variants in genetic association studies, many cohorts with related samples have gone through whole genome sequencing in large initiatives such as the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. Analyzing ra...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ming-Huei, Pitsillides, Achilleas, Yang, Qiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7862354/
https://www.ncbi.nlm.nih.gov/pubmed/33542345
http://dx.doi.org/10.1038/s41598-021-82547-z
_version_ 1783647272638087168
author Chen, Ming-Huei
Pitsillides, Achilleas
Yang, Qiong
author_facet Chen, Ming-Huei
Pitsillides, Achilleas
Yang, Qiong
author_sort Chen, Ming-Huei
collection PubMed
description Recognizing that family data provide unique advantage of identifying rare risk variants in genetic association studies, many cohorts with related samples have gone through whole genome sequencing in large initiatives such as the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. Analyzing rare variants poses challenges for binary traits in that some genotype categories may have few or no observed events, causing bias and inflation in commonly used methods. Several methods have recently been proposed to better handle rare variants while accounting for family relationship, but their performances have not been thoroughly evaluated together. Here we compare several existing approaches including SAIGE but not limited to related samples using simulations based on the Framingham Heart Study samples and genotype data from Illumina HumanExome BeadChip where rare variants are the majority. We found that logistic regression with likelihood ratio test applied to related samples was the only approach that did not have inflated type I error rates in both single variant test (SVT) and gene-based tests, followed by Firth logistic regression that had inflation in its direction insensitive gene-based test at prevalence 0.01 only, applied to either related or unrelated samples, though theoretically logistic regression and Firth logistic regression do not account for relatedness in samples. SAIGE had inflation in SVT at prevalence 0.1 or lower and the inflation was eliminated with a minor allele count filter of 5. As for power, there was no approach that outperformed others consistently among all single variant tests and gene-based tests.
format Online
Article
Text
id pubmed-7862354
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78623542021-02-05 An evaluation of approaches for rare variant association analyses of binary traits in related samples Chen, Ming-Huei Pitsillides, Achilleas Yang, Qiong Sci Rep Article Recognizing that family data provide unique advantage of identifying rare risk variants in genetic association studies, many cohorts with related samples have gone through whole genome sequencing in large initiatives such as the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. Analyzing rare variants poses challenges for binary traits in that some genotype categories may have few or no observed events, causing bias and inflation in commonly used methods. Several methods have recently been proposed to better handle rare variants while accounting for family relationship, but their performances have not been thoroughly evaluated together. Here we compare several existing approaches including SAIGE but not limited to related samples using simulations based on the Framingham Heart Study samples and genotype data from Illumina HumanExome BeadChip where rare variants are the majority. We found that logistic regression with likelihood ratio test applied to related samples was the only approach that did not have inflated type I error rates in both single variant test (SVT) and gene-based tests, followed by Firth logistic regression that had inflation in its direction insensitive gene-based test at prevalence 0.01 only, applied to either related or unrelated samples, though theoretically logistic regression and Firth logistic regression do not account for relatedness in samples. SAIGE had inflation in SVT at prevalence 0.1 or lower and the inflation was eliminated with a minor allele count filter of 5. As for power, there was no approach that outperformed others consistently among all single variant tests and gene-based tests. Nature Publishing Group UK 2021-02-04 /pmc/articles/PMC7862354/ /pubmed/33542345 http://dx.doi.org/10.1038/s41598-021-82547-z Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chen, Ming-Huei
Pitsillides, Achilleas
Yang, Qiong
An evaluation of approaches for rare variant association analyses of binary traits in related samples
title An evaluation of approaches for rare variant association analyses of binary traits in related samples
title_full An evaluation of approaches for rare variant association analyses of binary traits in related samples
title_fullStr An evaluation of approaches for rare variant association analyses of binary traits in related samples
title_full_unstemmed An evaluation of approaches for rare variant association analyses of binary traits in related samples
title_short An evaluation of approaches for rare variant association analyses of binary traits in related samples
title_sort evaluation of approaches for rare variant association analyses of binary traits in related samples
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7862354/
https://www.ncbi.nlm.nih.gov/pubmed/33542345
http://dx.doi.org/10.1038/s41598-021-82547-z
work_keys_str_mv AT chenminghuei anevaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples
AT pitsillidesachilleas anevaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples
AT yangqiong anevaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples
AT chenminghuei evaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples
AT pitsillidesachilleas evaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples
AT yangqiong evaluationofapproachesforrarevariantassociationanalysesofbinarytraitsinrelatedsamples