Cargando…

A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination

Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Collins, Caitlin, Didelot, Xavier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5814097/
https://www.ncbi.nlm.nih.gov/pubmed/29401456
http://dx.doi.org/10.1371/journal.pcbi.1005958
_version_ 1783300283740192768
author Collins, Caitlin
Didelot, Xavier
author_facet Collins, Caitlin
Didelot, Xavier
author_sort Collins, Caitlin
collection PubMed
description Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.
format Online
Article
Text
id pubmed-5814097
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58140972018-03-15 A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination Collins, Caitlin Didelot, Xavier PLoS Comput Biol Research Article Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS. Public Library of Science 2018-02-05 /pmc/articles/PMC5814097/ /pubmed/29401456 http://dx.doi.org/10.1371/journal.pcbi.1005958 Text en © 2018 Collins, Didelot http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Collins, Caitlin
Didelot, Xavier
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title_full A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title_fullStr A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title_full_unstemmed A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title_short A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
title_sort phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5814097/
https://www.ncbi.nlm.nih.gov/pubmed/29401456
http://dx.doi.org/10.1371/journal.pcbi.1005958
work_keys_str_mv AT collinscaitlin aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT didelotxavier aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT collinscaitlin phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT didelotxavier phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination