Cargando…

Indexed variation graphs for efficient and accurate resistome profiling

MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment o...

Descripción completa

Detalles Bibliográficos
Autores principales: Rowe, Will P M, Winn, Martyn D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198860/
https://www.ncbi.nlm.nih.gov/pubmed/29762644
http://dx.doi.org/10.1093/bioinformatics/bty387
_version_ 1783365030759104512
author Rowe, Will P M
Winn, Martyn D
author_facet Rowe, Will P M
Winn, Martyn D
author_sort Rowe, Will P M
collection PubMed
description MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment of bacterial infections and more sustainable use of antimicrobials. However, resistome profiling can be complicated by high similarity between reference genes, as well as the sheer volume of sequencing data and the complexity of analysis workflows. We have developed an efficient and accurate method for resistome profiling that addresses these complications and improves upon currently available tools. RESULTS: Our method combines a variation graph representation of gene sets with a locality-sensitive hashing Forest indexing scheme to allow for fast classification of metagenomic sequence reads using similarity-search queries. Subsequent hierarchical local alignment of classified reads against graph traversals enables accurate reconstruction of full-length gene sequences using a scoring scheme. We provide our implementation, graphing Resistance Out Of meTagenomes (GROOT), and show it to be both faster and more accurate than a current reference-dependent tool for resistome profiling. GROOT runs on a laptop and can process a typical 2 gigabyte metagenome in 2 min using a single CPU. Our method is not restricted to resistome profiling and has the potential to improve current metagenomic workflows. AVAILABILITY AND IMPLEMENTATION: GROOT is written in Go and is available at https://github.com/will-rowe/groot (MIT license). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6198860
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61988602018-10-26 Indexed variation graphs for efficient and accurate resistome profiling Rowe, Will P M Winn, Martyn D Bioinformatics Original Papers MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment of bacterial infections and more sustainable use of antimicrobials. However, resistome profiling can be complicated by high similarity between reference genes, as well as the sheer volume of sequencing data and the complexity of analysis workflows. We have developed an efficient and accurate method for resistome profiling that addresses these complications and improves upon currently available tools. RESULTS: Our method combines a variation graph representation of gene sets with a locality-sensitive hashing Forest indexing scheme to allow for fast classification of metagenomic sequence reads using similarity-search queries. Subsequent hierarchical local alignment of classified reads against graph traversals enables accurate reconstruction of full-length gene sequences using a scoring scheme. We provide our implementation, graphing Resistance Out Of meTagenomes (GROOT), and show it to be both faster and more accurate than a current reference-dependent tool for resistome profiling. GROOT runs on a laptop and can process a typical 2 gigabyte metagenome in 2 min using a single CPU. Our method is not restricted to resistome profiling and has the potential to improve current metagenomic workflows. AVAILABILITY AND IMPLEMENTATION: GROOT is written in Go and is available at https://github.com/will-rowe/groot (MIT license). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-11-01 2018-05-14 /pmc/articles/PMC6198860/ /pubmed/29762644 http://dx.doi.org/10.1093/bioinformatics/bty387 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Rowe, Will P M
Winn, Martyn D
Indexed variation graphs for efficient and accurate resistome profiling
title Indexed variation graphs for efficient and accurate resistome profiling
title_full Indexed variation graphs for efficient and accurate resistome profiling
title_fullStr Indexed variation graphs for efficient and accurate resistome profiling
title_full_unstemmed Indexed variation graphs for efficient and accurate resistome profiling
title_short Indexed variation graphs for efficient and accurate resistome profiling
title_sort indexed variation graphs for efficient and accurate resistome profiling
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198860/
https://www.ncbi.nlm.nih.gov/pubmed/29762644
http://dx.doi.org/10.1093/bioinformatics/bty387
work_keys_str_mv AT rowewillpm indexedvariationgraphsforefficientandaccurateresistomeprofiling
AT winnmartynd indexedvariationgraphsforefficientandaccurateresistomeprofiling