Cargando…

LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies

Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable atten...

Descripción completa

Detalles Bibliográficos
Autores principales: Caye, Kevin, Jumentier, Basile, Lepeule, Johanna, François, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659841/
https://www.ncbi.nlm.nih.gov/pubmed/30657943
http://dx.doi.org/10.1093/molbev/msz008
_version_ 1783439210286415872
author Caye, Kevin
Jumentier, Basile
Lepeule, Johanna
François, Olivier
author_facet Caye, Kevin
Jumentier, Basile
Lepeule, Johanna
François, Olivier
author_sort Caye, Kevin
collection PubMed
description Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable attention, the proposed approaches do not scale with the high-dimensionality of genomic data. Here, we present a new estimation method for latent factor mixed models (LFMMs) implemented in an upgraded version of the corresponding computer program. We developed a least-squares estimation approach for confounder estimation that provides a unique framework for several categories of genomic data, not restricted to genotypes. The speed of the new algorithm is several order faster than existing GEA approaches and then our previous version of the LFMM program. In addition, the new method outperforms other fast approaches based on principal component or surrogate variable analysis. We illustrate the program use with analyses of the 1000 Genomes Project data set, leading to new findings on adaptation of humans to their environment, and with analyses of DNA methylation profiles providing insights on how tobacco consumption could affect DNA methylation in patients with rheumatoid arthritis. Software availability: Software is available in the R package lfmm at https://bcm-uga.github.io/lfmm/.
format Online
Article
Text
id pubmed-6659841
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66598412019-08-02 LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies Caye, Kevin Jumentier, Basile Lepeule, Johanna François, Olivier Mol Biol Evol Resources Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable attention, the proposed approaches do not scale with the high-dimensionality of genomic data. Here, we present a new estimation method for latent factor mixed models (LFMMs) implemented in an upgraded version of the corresponding computer program. We developed a least-squares estimation approach for confounder estimation that provides a unique framework for several categories of genomic data, not restricted to genotypes. The speed of the new algorithm is several order faster than existing GEA approaches and then our previous version of the LFMM program. In addition, the new method outperforms other fast approaches based on principal component or surrogate variable analysis. We illustrate the program use with analyses of the 1000 Genomes Project data set, leading to new findings on adaptation of humans to their environment, and with analyses of DNA methylation profiles providing insights on how tobacco consumption could affect DNA methylation in patients with rheumatoid arthritis. Software availability: Software is available in the R package lfmm at https://bcm-uga.github.io/lfmm/. Oxford University Press 2019-04 2019-01-17 /pmc/articles/PMC6659841/ /pubmed/30657943 http://dx.doi.org/10.1093/molbev/msz008 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Resources
Caye, Kevin
Jumentier, Basile
Lepeule, Johanna
François, Olivier
LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title_full LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title_fullStr LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title_full_unstemmed LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title_short LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
title_sort lfmm 2: fast and accurate inference of gene-environment associations in genome-wide studies
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659841/
https://www.ncbi.nlm.nih.gov/pubmed/30657943
http://dx.doi.org/10.1093/molbev/msz008
work_keys_str_mv AT cayekevin lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies
AT jumentierbasile lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies
AT lepeulejohanna lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies
AT francoisolivier lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies