Cargando…
A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6258240/ https://www.ncbi.nlm.nih.gov/pubmed/30419019 http://dx.doi.org/10.1371/journal.pgen.1007758 |
_version_ | 1783374460715270144 |
---|---|
author | Jaillard, Magali Lima, Leandro Tournoud, Maud Mahé, Pierre van Belkum, Alex Lacroix, Vincent Jacob, Laurent |
author_facet | Jaillard, Magali Lima, Leandro Tournoud, Maud Mahé, Pierre van Belkum, Alex Lacroix, Vincent Jacob, Laurent |
author_sort | Jaillard, Magali |
collection | PubMed |
description | Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However, they lead to redundant descriptions and results which are sometimes hard to interpret. Here we introduce DBGWAS, an extended k-mer-based GWAS method producing interpretable genetic variants associated with distinct phenotypes. Relying on compacted De Bruijn graphs (cDBG), our method gathers cDBG nodes, identified by the association model, into subgraphs defined from their neighbourhood in the initial cDBG. DBGWAS is alignment-free and only requires a set of contigs and phenotypes. In particular, it does not require prior annotation or reference genomes. It produces subgraphs representing phenotype-associated genetic variants such as local polymorphisms and mobile genetic elements (MGE). It offers a graphical framework which helps interpret GWAS results. Importantly it is also computationally efficient—experiments took one hour and a half on average. We validated our method using antibiotic resistance phenotypes for three bacterial species. DBGWAS recovered known resistance determinants such as mutations in core genes in Mycobacterium tuberculosis, and genes acquired by horizontal transfer in Staphylococcus aureus and Pseudomonas aeruginosa—along with their MGE context. It also enabled us to formulate new hypotheses involving genetic variants not yet described in the antibiotic resistance literature. An open-source tool implementing DBGWAS is available at https://gitlab.com/leoisl/dbgwas. |
format | Online Article Text |
id | pubmed-6258240 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-62582402018-12-06 A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events Jaillard, Magali Lima, Leandro Tournoud, Maud Mahé, Pierre van Belkum, Alex Lacroix, Vincent Jacob, Laurent PLoS Genet Research Article Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However, they lead to redundant descriptions and results which are sometimes hard to interpret. Here we introduce DBGWAS, an extended k-mer-based GWAS method producing interpretable genetic variants associated with distinct phenotypes. Relying on compacted De Bruijn graphs (cDBG), our method gathers cDBG nodes, identified by the association model, into subgraphs defined from their neighbourhood in the initial cDBG. DBGWAS is alignment-free and only requires a set of contigs and phenotypes. In particular, it does not require prior annotation or reference genomes. It produces subgraphs representing phenotype-associated genetic variants such as local polymorphisms and mobile genetic elements (MGE). It offers a graphical framework which helps interpret GWAS results. Importantly it is also computationally efficient—experiments took one hour and a half on average. We validated our method using antibiotic resistance phenotypes for three bacterial species. DBGWAS recovered known resistance determinants such as mutations in core genes in Mycobacterium tuberculosis, and genes acquired by horizontal transfer in Staphylococcus aureus and Pseudomonas aeruginosa—along with their MGE context. It also enabled us to formulate new hypotheses involving genetic variants not yet described in the antibiotic resistance literature. An open-source tool implementing DBGWAS is available at https://gitlab.com/leoisl/dbgwas. Public Library of Science 2018-11-12 /pmc/articles/PMC6258240/ /pubmed/30419019 http://dx.doi.org/10.1371/journal.pgen.1007758 Text en © 2018 Jaillard et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Jaillard, Magali Lima, Leandro Tournoud, Maud Mahé, Pierre van Belkum, Alex Lacroix, Vincent Jacob, Laurent A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title | A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title_full | A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title_fullStr | A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title_full_unstemmed | A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title_short | A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events |
title_sort | fast and agnostic method for bacterial genome-wide association studies: bridging the gap between k-mers and genetic events |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6258240/ https://www.ncbi.nlm.nih.gov/pubmed/30419019 http://dx.doi.org/10.1371/journal.pgen.1007758 |
work_keys_str_mv | AT jaillardmagali afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT limaleandro afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT tournoudmaud afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT mahepierre afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT vanbelkumalex afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT lacroixvincent afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT jacoblaurent afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT jaillardmagali fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT limaleandro fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT tournoudmaud fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT mahepierre fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT vanbelkumalex fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT lacroixvincent fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents AT jacoblaurent fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents |