Cargando…

AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data

BACKGROUND: Antimicrobial resistance (AMR) is a global health concern. High-throughput metagenomic sequencing of microbial samples enables profiling of AMR genes through comparison with curated AMR databases. However, the performance of current methods is often hampered by database incompleteness an...

Descripción completa

Detalles Bibliográficos
Autores principales: Marini, Simone, Oliva, Marco, Slizovskiy, Ilya B, Das, Rishabh A, Noyes, Noelle Robertson, Kahveci, Tamer, Boucher, Christina, Prosperi, Mattia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116207/
https://www.ncbi.nlm.nih.gov/pubmed/35583675
http://dx.doi.org/10.1093/gigascience/giac029
_version_ 1784710069891366912
author Marini, Simone
Oliva, Marco
Slizovskiy, Ilya B
Das, Rishabh A
Noyes, Noelle Robertson
Kahveci, Tamer
Boucher, Christina
Prosperi, Mattia
author_facet Marini, Simone
Oliva, Marco
Slizovskiy, Ilya B
Das, Rishabh A
Noyes, Noelle Robertson
Kahveci, Tamer
Boucher, Christina
Prosperi, Mattia
author_sort Marini, Simone
collection PubMed
description BACKGROUND: Antimicrobial resistance (AMR) is a global health concern. High-throughput metagenomic sequencing of microbial samples enables profiling of AMR genes through comparison with curated AMR databases. However, the performance of current methods is often hampered by database incompleteness and the presence of homology/homoplasy with other non-AMR genes in sequenced samples. RESULTS: We present AMR-meta, a database-free and alignment-free approach, based on k-mers, which combines algebraic matrix factorization into metafeatures with regularized regression. Metafeatures capture multi-level gene diversity across the main antibiotic classes. AMR-meta takes in reads from metagenomic shotgun sequencing and outputs predictions about whether those reads contribute to resistance against specific classes of antibiotics. In addition, AMR-meta uses an augmented training strategy that joins an AMR gene database with non-AMR genes (used as negative examples). We compare AMR-meta with AMRPlusPlus, DeepARG, and Meta-MARC, further testing their ensemble via a voting system. In cross-validation, AMR-meta has a median f-score of 0.7 (interquartile range, 0.2–0.9). On semi-synthetic metagenomic data—external test—on average AMR-meta yields a 1.3-fold hit rate increase over existing methods. In terms of run-time, AMR-meta is 3 times faster than DeepARG, 30 times faster than Meta-MARC, and as fast as AMRPlusPlus. Finally, we note that differences in AMR ontologies and observed variance of all tools in classification outputs call for further development on standardization of benchmarking data and protocols. CONCLUSIONS: AMR-meta is a fast, accurate classifier that exploits non-AMR negative sets to improve sensitivity and specificity. The differences in AMR ontologies and the high variance of all tools in classification outputs call for the deployment of standard benchmarking data and protocols, to fairly compare AMR prediction tools.
format Online
Article
Text
id pubmed-9116207
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91162072022-05-19 AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data Marini, Simone Oliva, Marco Slizovskiy, Ilya B Das, Rishabh A Noyes, Noelle Robertson Kahveci, Tamer Boucher, Christina Prosperi, Mattia Gigascience Research BACKGROUND: Antimicrobial resistance (AMR) is a global health concern. High-throughput metagenomic sequencing of microbial samples enables profiling of AMR genes through comparison with curated AMR databases. However, the performance of current methods is often hampered by database incompleteness and the presence of homology/homoplasy with other non-AMR genes in sequenced samples. RESULTS: We present AMR-meta, a database-free and alignment-free approach, based on k-mers, which combines algebraic matrix factorization into metafeatures with regularized regression. Metafeatures capture multi-level gene diversity across the main antibiotic classes. AMR-meta takes in reads from metagenomic shotgun sequencing and outputs predictions about whether those reads contribute to resistance against specific classes of antibiotics. In addition, AMR-meta uses an augmented training strategy that joins an AMR gene database with non-AMR genes (used as negative examples). We compare AMR-meta with AMRPlusPlus, DeepARG, and Meta-MARC, further testing their ensemble via a voting system. In cross-validation, AMR-meta has a median f-score of 0.7 (interquartile range, 0.2–0.9). On semi-synthetic metagenomic data—external test—on average AMR-meta yields a 1.3-fold hit rate increase over existing methods. In terms of run-time, AMR-meta is 3 times faster than DeepARG, 30 times faster than Meta-MARC, and as fast as AMRPlusPlus. Finally, we note that differences in AMR ontologies and observed variance of all tools in classification outputs call for further development on standardization of benchmarking data and protocols. CONCLUSIONS: AMR-meta is a fast, accurate classifier that exploits non-AMR negative sets to improve sensitivity and specificity. The differences in AMR ontologies and the high variance of all tools in classification outputs call for the deployment of standard benchmarking data and protocols, to fairly compare AMR prediction tools. Oxford University Press 2022-05-18 /pmc/articles/PMC9116207/ /pubmed/35583675 http://dx.doi.org/10.1093/gigascience/giac029 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Marini, Simone
Oliva, Marco
Slizovskiy, Ilya B
Das, Rishabh A
Noyes, Noelle Robertson
Kahveci, Tamer
Boucher, Christina
Prosperi, Mattia
AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title_full AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title_fullStr AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title_full_unstemmed AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title_short AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
title_sort amr-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116207/
https://www.ncbi.nlm.nih.gov/pubmed/35583675
http://dx.doi.org/10.1093/gigascience/giac029
work_keys_str_mv AT marinisimone amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT olivamarco amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT slizovskiyilyab amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT dasrishabha amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT noyesnoellerobertson amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT kahvecitamer amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT boucherchristina amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata
AT prosperimattia amrmetaakmerandmetafeatureapproachtoclassifyantimicrobialresistancefromhighthroughputshortreadmetagenomicsdata