Cargando…
Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure
By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under rea...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5245123/ https://www.ncbi.nlm.nih.gov/pubmed/27885705 http://dx.doi.org/10.1002/gepi.22024 |
_version_ | 1782496772848877568 |
---|---|
author | Yoo, Yun Joo Sun, Lei Poirier, Julia G. Paterson, Andrew D. Bull, Shelley B. |
author_facet | Yoo, Yun Joo Sun, Lei Poirier, Julia G. Paterson, Andrew D. Bull, Shelley B. |
author_sort | Yoo, Yun Joo |
collection | PubMed |
description | By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster‐specific effects in a quadratic sum of squares and cross‐products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well‐powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P‐value, variance‐component, and principal‐component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene‐specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome‐wide analysis. The cluster construction of the MLC test statistics helps reveal within‐gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations. |
format | Online Article Text |
id | pubmed-5245123 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-52451232017-02-01 Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure Yoo, Yun Joo Sun, Lei Poirier, Julia G. Paterson, Andrew D. Bull, Shelley B. Genet Epidemiol Research Articles By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster‐specific effects in a quadratic sum of squares and cross‐products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well‐powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P‐value, variance‐component, and principal‐component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene‐specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome‐wide analysis. The cluster construction of the MLC test statistics helps reveal within‐gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations. John Wiley and Sons Inc. 2016-11-25 2017-02 /pmc/articles/PMC5245123/ /pubmed/27885705 http://dx.doi.org/10.1002/gepi.22024 Text en © 2016 The Authors Genetic Epidemiology Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Research Articles Yoo, Yun Joo Sun, Lei Poirier, Julia G. Paterson, Andrew D. Bull, Shelley B. Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title | Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title_full | Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title_fullStr | Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title_full_unstemmed | Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title_short | Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure |
title_sort | multiple linear combination (mlc) regression tests for common variants adapted to linkage disequilibrium structure |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5245123/ https://www.ncbi.nlm.nih.gov/pubmed/27885705 http://dx.doi.org/10.1002/gepi.22024 |
work_keys_str_mv | AT yooyunjoo multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure AT sunlei multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure AT poirierjuliag multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure AT patersonandrewd multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure AT bullshelleyb multiplelinearcombinationmlcregressiontestsforcommonvariantsadaptedtolinkagedisequilibriumstructure |