Cargando…

TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis

MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researche...

Descripción completa

Detalles Bibliográficos
Autores principales: Libiseller-Egger, Julian, Wang, Linfeng, Deelder, Wouter, Campino, Susana, Clark, Taane G, Phelan, Jody E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10074023/
https://www.ncbi.nlm.nih.gov/pubmed/37033466
http://dx.doi.org/10.1093/bioadv/vbad040
_version_ 1785019688895381504
author Libiseller-Egger, Julian
Wang, Linfeng
Deelder, Wouter
Campino, Susana
Clark, Taane G
Phelan, Jody E
author_facet Libiseller-Egger, Julian
Wang, Linfeng
Deelder, Wouter
Campino, Susana
Clark, Taane G
Phelan, Jody E
author_sort Libiseller-Egger, Julian
collection PubMed
description MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. RESULTS: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. AVAILABILITY AND IMPLEMENTATION: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. CONTACT: jody.phelan@lshtm.ac.uk
format Online
Article
Text
id pubmed-10074023
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100740232023-04-06 TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis Libiseller-Egger, Julian Wang, Linfeng Deelder, Wouter Campino, Susana Clark, Taane G Phelan, Jody E Bioinform Adv Application Note MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. RESULTS: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. AVAILABILITY AND IMPLEMENTATION: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. CONTACT: jody.phelan@lshtm.ac.uk Oxford University Press 2023-03-23 /pmc/articles/PMC10074023/ /pubmed/37033466 http://dx.doi.org/10.1093/bioadv/vbad040 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Note
Libiseller-Egger, Julian
Wang, Linfeng
Deelder, Wouter
Campino, Susana
Clark, Taane G
Phelan, Jody E
TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title_full TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title_fullStr TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title_full_unstemmed TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title_short TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
title_sort tb-ml—a framework for comparing machine learning approaches to predict drug resistance of mycobacterium tuberculosis
topic Application Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10074023/
https://www.ncbi.nlm.nih.gov/pubmed/37033466
http://dx.doi.org/10.1093/bioadv/vbad040
work_keys_str_mv AT libisellereggerjulian tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis
AT wanglinfeng tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis
AT deelderwouter tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis
AT campinosusana tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis
AT clarktaaneg tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis
AT phelanjodye tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis