Cargando…
LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies
Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable atten...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659841/ https://www.ncbi.nlm.nih.gov/pubmed/30657943 http://dx.doi.org/10.1093/molbev/msz008 |
_version_ | 1783439210286415872 |
---|---|
author | Caye, Kevin Jumentier, Basile Lepeule, Johanna François, Olivier |
author_facet | Caye, Kevin Jumentier, Basile Lepeule, Johanna François, Olivier |
author_sort | Caye, Kevin |
collection | PubMed |
description | Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable attention, the proposed approaches do not scale with the high-dimensionality of genomic data. Here, we present a new estimation method for latent factor mixed models (LFMMs) implemented in an upgraded version of the corresponding computer program. We developed a least-squares estimation approach for confounder estimation that provides a unique framework for several categories of genomic data, not restricted to genotypes. The speed of the new algorithm is several order faster than existing GEA approaches and then our previous version of the LFMM program. In addition, the new method outperforms other fast approaches based on principal component or surrogate variable analysis. We illustrate the program use with analyses of the 1000 Genomes Project data set, leading to new findings on adaptation of humans to their environment, and with analyses of DNA methylation profiles providing insights on how tobacco consumption could affect DNA methylation in patients with rheumatoid arthritis. Software availability: Software is available in the R package lfmm at https://bcm-uga.github.io/lfmm/. |
format | Online Article Text |
id | pubmed-6659841 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66598412019-08-02 LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies Caye, Kevin Jumentier, Basile Lepeule, Johanna François, Olivier Mol Biol Evol Resources Gene-environment association (GEA) studies are essential to understand the past and ongoing adaptations of organisms to their environment, but those studies are complicated by confounding due to unobserved demographic factors. Although the confounding problem has recently received considerable attention, the proposed approaches do not scale with the high-dimensionality of genomic data. Here, we present a new estimation method for latent factor mixed models (LFMMs) implemented in an upgraded version of the corresponding computer program. We developed a least-squares estimation approach for confounder estimation that provides a unique framework for several categories of genomic data, not restricted to genotypes. The speed of the new algorithm is several order faster than existing GEA approaches and then our previous version of the LFMM program. In addition, the new method outperforms other fast approaches based on principal component or surrogate variable analysis. We illustrate the program use with analyses of the 1000 Genomes Project data set, leading to new findings on adaptation of humans to their environment, and with analyses of DNA methylation profiles providing insights on how tobacco consumption could affect DNA methylation in patients with rheumatoid arthritis. Software availability: Software is available in the R package lfmm at https://bcm-uga.github.io/lfmm/. Oxford University Press 2019-04 2019-01-17 /pmc/articles/PMC6659841/ /pubmed/30657943 http://dx.doi.org/10.1093/molbev/msz008 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Resources Caye, Kevin Jumentier, Basile Lepeule, Johanna François, Olivier LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title | LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title_full | LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title_fullStr | LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title_full_unstemmed | LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title_short | LFMM 2: Fast and Accurate Inference of Gene-Environment Associations in Genome-Wide Studies |
title_sort | lfmm 2: fast and accurate inference of gene-environment associations in genome-wide studies |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659841/ https://www.ncbi.nlm.nih.gov/pubmed/30657943 http://dx.doi.org/10.1093/molbev/msz008 |
work_keys_str_mv | AT cayekevin lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies AT jumentierbasile lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies AT lepeulejohanna lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies AT francoisolivier lfmm2fastandaccurateinferenceofgeneenvironmentassociationsingenomewidestudies |