Cargando…

Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure

By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under rea...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoo, Yun Joo, Sun, Lei, Poirier, Julia G., Paterson, Andrew D., Bull, Shelley B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5245123/
https://www.ncbi.nlm.nih.gov/pubmed/27885705
http://dx.doi.org/10.1002/gepi.22024
_version_ 1782496772848877568
author Yoo, Yun Joo
Sun, Lei
Poirier, Julia G.
Paterson, Andrew D.
Bull, Shelley B.
author_facet Yoo, Yun Joo
Sun, Lei
Poirier, Julia G.
Paterson, Andrew D.
Bull, Shelley B.
author_sort Yoo, Yun Joo
collection PubMed
description By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster‐specific effects in a quadratic sum of squares and cross‐products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well‐powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P‐value, variance‐component, and principal‐component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene‐specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome‐wide analysis. The cluster construction of the MLC test statistics helps reveal within‐gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations.
format Online
Article
Text
id pubmed-5245123
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-52451232017-02-01 Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure Yoo, Yun Joo Sun, Lei Poirier, Julia G. Paterson, Andrew D. Bull, Shelley B. Genet Epidemiol Research Articles By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster‐specific effects in a quadratic sum of squares and cross‐products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well‐powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P‐value, variance‐component, and principal‐component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene‐specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome‐wide analysis. The cluster construction of the MLC test statistics helps reveal within‐gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations. John Wiley and Sons Inc. 2016-11-25 2017-02 /pmc/articles/PMC5245123/ /pubmed/27885705 http://dx.doi.org/10.1002/gepi.22024 Text en © 2016 The Authors Genetic Epidemiology Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Yoo, Yun Joo
Sun, Lei
Poirier, Julia G.
Paterson, Andrew D.
Bull, Shelley B.
Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title_full Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title_fullStr Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title_full_unstemmed Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title_short Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
title_sort multiple linear combination (mlc) regression tests for common variants adapted to linkage disequilibrium structure
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5245123/
https://www.ncbi.nlm.nih.gov/pubmed/27885705
http://dx.doi.org/10.1002/gepi.22024
work_keys_str_mv AT yooyunjoo multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure
AT sunlei multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure
AT poirierjuliag multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure
AT patersonandrewd multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure
AT bullshelleyb multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure