Cargando…

Fast score test with global null estimation regardless of missing genotypes

In genome-wide association studies (GWASs) for binary traits (or case-control samples) in the presence of covariates to be adjusted for, researchers often use a logistic regression model to test variants for disease association. Popular tests include Wald, likelihood ratio, and score tests. For like...

Descripción completa

Detalles Bibliográficos
Autores principales: Sato, Shuntaro, Ueki, Masao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033421/
https://www.ncbi.nlm.nih.gov/pubmed/29975732
http://dx.doi.org/10.1371/journal.pone.0199692
_version_ 1783337700412096512
author Sato, Shuntaro
Ueki, Masao
author_facet Sato, Shuntaro
Ueki, Masao
author_sort Sato, Shuntaro
collection PubMed
description In genome-wide association studies (GWASs) for binary traits (or case-control samples) in the presence of covariates to be adjusted for, researchers often use a logistic regression model to test variants for disease association. Popular tests include Wald, likelihood ratio, and score tests. For likelihood ratio test and Wald test, maximum likelihood estimation (MLE), which requires iterative procedure, must be computed for each single nucleotide polymorphism (SNP). In contrast, the score test only requires MLE under the null model, being lower in computational cost than other tests. Usually, genotype data include missing genotypes because of assay failures. It loses computational efficiency in the conventional score test (CST), which requires null estimation by excluding individuals with missing genotype for each SNP. In this study, we propose two new score tests, called PM1 and PM2, that use a single global null estimator for all SNPs regardless of missing genotypes, thereby enabling faster computation than CST. We prove that PM2 and CST have an equivalent asymptotic power and that the power of PM1 is asymptotically lower than that of PM2. We evaluate the performance of the proposed methods in terms of type I error rates and power by simulation studies and application to real GWAS data provided by the Alzheimer’s Disease Neuroimaging Initiative (ADNI), confirming our theoretical results. ADNI-GWAS application demonstrated that the proposed score tests improve computational speed about 6–18 times faster than the existing tests, CST, Wald tests and likelihood ratio tests. Our score tests are general and applicable to other regression models.
format Online
Article
Text
id pubmed-6033421
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60334212018-07-19 Fast score test with global null estimation regardless of missing genotypes Sato, Shuntaro Ueki, Masao PLoS One Research Article In genome-wide association studies (GWASs) for binary traits (or case-control samples) in the presence of covariates to be adjusted for, researchers often use a logistic regression model to test variants for disease association. Popular tests include Wald, likelihood ratio, and score tests. For likelihood ratio test and Wald test, maximum likelihood estimation (MLE), which requires iterative procedure, must be computed for each single nucleotide polymorphism (SNP). In contrast, the score test only requires MLE under the null model, being lower in computational cost than other tests. Usually, genotype data include missing genotypes because of assay failures. It loses computational efficiency in the conventional score test (CST), which requires null estimation by excluding individuals with missing genotype for each SNP. In this study, we propose two new score tests, called PM1 and PM2, that use a single global null estimator for all SNPs regardless of missing genotypes, thereby enabling faster computation than CST. We prove that PM2 and CST have an equivalent asymptotic power and that the power of PM1 is asymptotically lower than that of PM2. We evaluate the performance of the proposed methods in terms of type I error rates and power by simulation studies and application to real GWAS data provided by the Alzheimer’s Disease Neuroimaging Initiative (ADNI), confirming our theoretical results. ADNI-GWAS application demonstrated that the proposed score tests improve computational speed about 6–18 times faster than the existing tests, CST, Wald tests and likelihood ratio tests. Our score tests are general and applicable to other regression models. Public Library of Science 2018-07-05 /pmc/articles/PMC6033421/ /pubmed/29975732 http://dx.doi.org/10.1371/journal.pone.0199692 Text en © 2018 Sato et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sato, Shuntaro
Ueki, Masao
Fast score test with global null estimation regardless of missing genotypes
title Fast score test with global null estimation regardless of missing genotypes
title_full Fast score test with global null estimation regardless of missing genotypes
title_fullStr Fast score test with global null estimation regardless of missing genotypes
title_full_unstemmed Fast score test with global null estimation regardless of missing genotypes
title_short Fast score test with global null estimation regardless of missing genotypes
title_sort fast score test with global null estimation regardless of missing genotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033421/
https://www.ncbi.nlm.nih.gov/pubmed/29975732
http://dx.doi.org/10.1371/journal.pone.0199692
work_keys_str_mv AT satoshuntaro fastscoretestwithglobalnullestimationregardlessofmissinggenotypes
AT uekimasao fastscoretestwithglobalnullestimationregardlessofmissinggenotypes
AT fastscoretestwithglobalnullestimationregardlessofmissinggenotypes