Cargando…

A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events

Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jaillard, Magali, Lima, Leandro, Tournoud, Maud, Mahé, Pierre, van Belkum, Alex, Lacroix, Vincent, Jacob, Laurent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6258240/
https://www.ncbi.nlm.nih.gov/pubmed/30419019
http://dx.doi.org/10.1371/journal.pgen.1007758
_version_ 1783374460715270144
author Jaillard, Magali
Lima, Leandro
Tournoud, Maud
Mahé, Pierre
van Belkum, Alex
Lacroix, Vincent
Jacob, Laurent
author_facet Jaillard, Magali
Lima, Leandro
Tournoud, Maud
Mahé, Pierre
van Belkum, Alex
Lacroix, Vincent
Jacob, Laurent
author_sort Jaillard, Magali
collection PubMed
description Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However, they lead to redundant descriptions and results which are sometimes hard to interpret. Here we introduce DBGWAS, an extended k-mer-based GWAS method producing interpretable genetic variants associated with distinct phenotypes. Relying on compacted De Bruijn graphs (cDBG), our method gathers cDBG nodes, identified by the association model, into subgraphs defined from their neighbourhood in the initial cDBG. DBGWAS is alignment-free and only requires a set of contigs and phenotypes. In particular, it does not require prior annotation or reference genomes. It produces subgraphs representing phenotype-associated genetic variants such as local polymorphisms and mobile genetic elements (MGE). It offers a graphical framework which helps interpret GWAS results. Importantly it is also computationally efficient—experiments took one hour and a half on average. We validated our method using antibiotic resistance phenotypes for three bacterial species. DBGWAS recovered known resistance determinants such as mutations in core genes in Mycobacterium tuberculosis, and genes acquired by horizontal transfer in Staphylococcus aureus and Pseudomonas aeruginosa—along with their MGE context. It also enabled us to formulate new hypotheses involving genetic variants not yet described in the antibiotic resistance literature. An open-source tool implementing DBGWAS is available at https://gitlab.com/leoisl/dbgwas.
format Online
Article
Text
id pubmed-6258240
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62582402018-12-06 A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events Jaillard, Magali Lima, Leandro Tournoud, Maud Mahé, Pierre van Belkum, Alex Lacroix, Vincent Jacob, Laurent PLoS Genet Research Article Genome-wide association study (GWAS) methods applied to bacterial genomes have shown promising results for genetic marker discovery or detailed assessment of marker effect. Recently, alignment-free methods based on k-mer composition have proven their ability to explore the accessory genome. However, they lead to redundant descriptions and results which are sometimes hard to interpret. Here we introduce DBGWAS, an extended k-mer-based GWAS method producing interpretable genetic variants associated with distinct phenotypes. Relying on compacted De Bruijn graphs (cDBG), our method gathers cDBG nodes, identified by the association model, into subgraphs defined from their neighbourhood in the initial cDBG. DBGWAS is alignment-free and only requires a set of contigs and phenotypes. In particular, it does not require prior annotation or reference genomes. It produces subgraphs representing phenotype-associated genetic variants such as local polymorphisms and mobile genetic elements (MGE). It offers a graphical framework which helps interpret GWAS results. Importantly it is also computationally efficient—experiments took one hour and a half on average. We validated our method using antibiotic resistance phenotypes for three bacterial species. DBGWAS recovered known resistance determinants such as mutations in core genes in Mycobacterium tuberculosis, and genes acquired by horizontal transfer in Staphylococcus aureus and Pseudomonas aeruginosa—along with their MGE context. It also enabled us to formulate new hypotheses involving genetic variants not yet described in the antibiotic resistance literature. An open-source tool implementing DBGWAS is available at https://gitlab.com/leoisl/dbgwas. Public Library of Science 2018-11-12 /pmc/articles/PMC6258240/ /pubmed/30419019 http://dx.doi.org/10.1371/journal.pgen.1007758 Text en © 2018 Jaillard et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jaillard, Magali
Lima, Leandro
Tournoud, Maud
Mahé, Pierre
van Belkum, Alex
Lacroix, Vincent
Jacob, Laurent
A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title_full A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title_fullStr A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title_full_unstemmed A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title_short A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events
title_sort fast and agnostic method for bacterial genome-wide association studies: bridging the gap between k-mers and genetic events
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6258240/
https://www.ncbi.nlm.nih.gov/pubmed/30419019
http://dx.doi.org/10.1371/journal.pgen.1007758
work_keys_str_mv AT jaillardmagali afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT limaleandro afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT tournoudmaud afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT mahepierre afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT vanbelkumalex afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT lacroixvincent afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT jacoblaurent afastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT jaillardmagali fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT limaleandro fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT tournoudmaud fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT mahepierre fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT vanbelkumalex fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT lacroixvincent fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents
AT jacoblaurent fastandagnosticmethodforbacterialgenomewideassociationstudiesbridgingthegapbetweenkmersandgeneticevents