Cargando…

Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data

Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always c...

Descripción completa

Detalles Bibliográficos
Autores principales: Eu-ahsunthornwattana, Jakris, Miller, E. Nancy, Fakiola, Michaela, Jeronimo, Selma M. B., Blackwell, Jenefer M., Cordell, Heather J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4102448/
https://www.ncbi.nlm.nih.gov/pubmed/25033443
http://dx.doi.org/10.1371/journal.pgen.1004445
_version_ 1782481038690222080
author Eu-ahsunthornwattana, Jakris
Miller, E. Nancy
Fakiola, Michaela
Jeronimo, Selma M. B.
Blackwell, Jenefer M.
Cordell, Heather J.
author_facet Eu-ahsunthornwattana, Jakris
Miller, E. Nancy
Fakiola, Michaela
Jeronimo, Selma M. B.
Blackwell, Jenefer M.
Cordell, Heather J.
author_sort Eu-ahsunthornwattana, Jakris
collection PubMed
description Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use.
format Online
Article
Text
id pubmed-4102448
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41024482014-07-21 Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data Eu-ahsunthornwattana, Jakris Miller, E. Nancy Fakiola, Michaela Jeronimo, Selma M. B. Blackwell, Jenefer M. Cordell, Heather J. PLoS Genet Research Article Approaches based on linear mixed models (LMMs) have recently gained popularity for modelling population substructure and relatedness in genome-wide association studies. In the last few years, a bewildering variety of different LMM methods/software packages have been developed, but it is not always clear how (or indeed whether) any newly-proposed method differs from previously-proposed implementations. Here we compare the performance of several LMM approaches (and software implementations, including EMMAX, GenABEL, FaST-LMM, Mendel, GEMMA and MMM) via their application to a genome-wide association study of visceral leishmaniasis in 348 Brazilian families comprising 3626 individuals (1972 genotyped). The implementations differ in precise details of methodology implemented and through various user-chosen options such as the method and number of SNPs used to estimate the kinship (relatedness) matrix. We investigate sensitivity to these choices and the success (or otherwise) of the approaches in controlling the overall genome-wide error-rate for both real and simulated phenotypes. We compare the LMM results to those obtained using traditional family-based association tests (based on transmission of alleles within pedigrees) and to alternative approaches implemented in the software packages MQLS, ROADTRIPS and MASTOR. We find strong concordance between the results from different LMM approaches, and all are successful in controlling the genome-wide error rate (except for some approaches when applied naively to longitudinal data with many repeated measures). We also find high correlation between LMMs and alternative approaches (apart from transmission-based approaches when applied to SNPs with small or non-existent effects). We conclude that LMM approaches perform well in comparison to competing approaches. Given their strong concordance, in most applications, the choice of precise LMM implementation cannot be based on power/type I error considerations but must instead be based on considerations such as speed and ease-of-use. Public Library of Science 2014-07-17 /pmc/articles/PMC4102448/ /pubmed/25033443 http://dx.doi.org/10.1371/journal.pgen.1004445 Text en © 2014 Eu-ahsunthornwattana et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Eu-ahsunthornwattana, Jakris
Miller, E. Nancy
Fakiola, Michaela
Jeronimo, Selma M. B.
Blackwell, Jenefer M.
Cordell, Heather J.
Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_full Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_fullStr Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_full_unstemmed Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_short Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data
title_sort comparison of methods to account for relatedness in genome-wide association studies with family-based data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4102448/
https://www.ncbi.nlm.nih.gov/pubmed/25033443
http://dx.doi.org/10.1371/journal.pgen.1004445
work_keys_str_mv AT euahsunthornwattanajakris comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT millerenancy comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT fakiolamichaela comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT jeronimoselmamb comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT blackwelljeneferm comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata
AT cordellheatherj comparisonofmethodstoaccountforrelatednessingenomewideassociationstudieswithfamilybaseddata