Cargando…
Indexed variation graphs for efficient and accurate resistome profiling
MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment o...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198860/ https://www.ncbi.nlm.nih.gov/pubmed/29762644 http://dx.doi.org/10.1093/bioinformatics/bty387 |
_version_ | 1783365030759104512 |
---|---|
author | Rowe, Will P M Winn, Martyn D |
author_facet | Rowe, Will P M Winn, Martyn D |
author_sort | Rowe, Will P M |
collection | PubMed |
description | MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment of bacterial infections and more sustainable use of antimicrobials. However, resistome profiling can be complicated by high similarity between reference genes, as well as the sheer volume of sequencing data and the complexity of analysis workflows. We have developed an efficient and accurate method for resistome profiling that addresses these complications and improves upon currently available tools. RESULTS: Our method combines a variation graph representation of gene sets with a locality-sensitive hashing Forest indexing scheme to allow for fast classification of metagenomic sequence reads using similarity-search queries. Subsequent hierarchical local alignment of classified reads against graph traversals enables accurate reconstruction of full-length gene sequences using a scoring scheme. We provide our implementation, graphing Resistance Out Of meTagenomes (GROOT), and show it to be both faster and more accurate than a current reference-dependent tool for resistome profiling. GROOT runs on a laptop and can process a typical 2 gigabyte metagenome in 2 min using a single CPU. Our method is not restricted to resistome profiling and has the potential to improve current metagenomic workflows. AVAILABILITY AND IMPLEMENTATION: GROOT is written in Go and is available at https://github.com/will-rowe/groot (MIT license). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6198860 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61988602018-10-26 Indexed variation graphs for efficient and accurate resistome profiling Rowe, Will P M Winn, Martyn D Bioinformatics Original Papers MOTIVATION: Antimicrobial resistance (AMR) remains a major threat to global health. Profiling the collective AMR genes within a metagenome (the ‘resistome’) facilitates greater understanding of AMR gene diversity and dynamics. In turn, this can allow for gene surveillance, individualized treatment of bacterial infections and more sustainable use of antimicrobials. However, resistome profiling can be complicated by high similarity between reference genes, as well as the sheer volume of sequencing data and the complexity of analysis workflows. We have developed an efficient and accurate method for resistome profiling that addresses these complications and improves upon currently available tools. RESULTS: Our method combines a variation graph representation of gene sets with a locality-sensitive hashing Forest indexing scheme to allow for fast classification of metagenomic sequence reads using similarity-search queries. Subsequent hierarchical local alignment of classified reads against graph traversals enables accurate reconstruction of full-length gene sequences using a scoring scheme. We provide our implementation, graphing Resistance Out Of meTagenomes (GROOT), and show it to be both faster and more accurate than a current reference-dependent tool for resistome profiling. GROOT runs on a laptop and can process a typical 2 gigabyte metagenome in 2 min using a single CPU. Our method is not restricted to resistome profiling and has the potential to improve current metagenomic workflows. AVAILABILITY AND IMPLEMENTATION: GROOT is written in Go and is available at https://github.com/will-rowe/groot (MIT license). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-11-01 2018-05-14 /pmc/articles/PMC6198860/ /pubmed/29762644 http://dx.doi.org/10.1093/bioinformatics/bty387 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Rowe, Will P M Winn, Martyn D Indexed variation graphs for efficient and accurate resistome profiling |
title | Indexed variation graphs for efficient and accurate resistome profiling |
title_full | Indexed variation graphs for efficient and accurate resistome profiling |
title_fullStr | Indexed variation graphs for efficient and accurate resistome profiling |
title_full_unstemmed | Indexed variation graphs for efficient and accurate resistome profiling |
title_short | Indexed variation graphs for efficient and accurate resistome profiling |
title_sort | indexed variation graphs for efficient and accurate resistome profiling |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198860/ https://www.ncbi.nlm.nih.gov/pubmed/29762644 http://dx.doi.org/10.1093/bioinformatics/bty387 |
work_keys_str_mv | AT rowewillpm indexedvariationgraphsforefficientandaccurateresistomeprofiling AT winnmartynd indexedvariationgraphsforefficientandaccurateresistomeprofiling |