Cargando…
Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always c...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4102448/ https://www.ncbi.nlm.nih.gov/pubmed/25033443 http://dx.doi.org/10.1371/journal.pgen.1004445 |
_version_ | 1782481038690222080 |
---|---|
author | Eu-ahsunthornwattana, Jakris Miller, E. Nancy Fakiola, Michaela Jeronimo, Selma M. B. Blackwell, Jenefer M. Cordell, Heather J. |
author_facet | Eu-ahsunthornwattana, Jakris Miller, E. Nancy Fakiola, Michaela Jeronimo, Selma M. B. Blackwell, Jenefer M. Cordell, Heather J. |
author_sort | Eu-ahsunthornwattana, Jakris |
collection | PubMed |
description | Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use. |
format | Online Article Text |
id | pubmed-4102448 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41024482014-07-21 Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data Eu-ahsunthornwattana, Jakris Miller, E. Nancy Fakiola, Michaela Jeronimo, Selma M. B. Blackwell, Jenefer M. Cordell, Heather J. PLoS Genet Research Article Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use. Public Library of Science 2014-07-17 /pmc/articles/PMC4102448/ /pubmed/25033443 http://dx.doi.org/10.1371/journal.pgen.1004445 Text en © 2014 Eu-ahsunthornwattana et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Eu-ahsunthornwattana, Jakris Miller, E. Nancy Fakiola, Michaela Jeronimo, Selma M. B. Blackwell, Jenefer M. Cordell, Heather J. Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title | Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title_full | Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title_fullStr | Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title_full_unstemmed | Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title_short | Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data |
title_sort | comparison of methods to account for relatedness in genome-wide association studies with family-based data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4102448/ https://www.ncbi.nlm.nih.gov/pubmed/25033443 http://dx.doi.org/10.1371/journal.pgen.1004445 |
work_keys_str_mv | AT euahsunthornwattanajakris comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT millerenancy comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT fakiolamichaela comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT jeronimoselmamb comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT blackwelljeneferm comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata AT cordellheatherj comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata |