Cargando…
TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis
MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researche...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10074023/ https://www.ncbi.nlm.nih.gov/pubmed/37033466 http://dx.doi.org/10.1093/bioadv/vbad040 |
_version_ | 1785019688895381504 |
---|---|
author | Libiseller-Egger, Julian Wang, Linfeng Deelder, Wouter Campino, Susana Clark, Taane G Phelan, Jody E |
author_facet | Libiseller-Egger, Julian Wang, Linfeng Deelder, Wouter Campino, Susana Clark, Taane G Phelan, Jody E |
author_sort | Libiseller-Egger, Julian |
collection | PubMed |
description | MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. RESULTS: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. AVAILABILITY AND IMPLEMENTATION: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. CONTACT: jody.phelan@lshtm.ac.uk |
format | Online Article Text |
id | pubmed-10074023 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100740232023-04-06 TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis Libiseller-Egger, Julian Wang, Linfeng Deelder, Wouter Campino, Susana Clark, Taane G Phelan, Jody E Bioinform Adv Application Note MOTIVATION: Machine learning (ML) has shown impressive performance in predicting antimicrobial resistance (AMR) from sequence data, including for Mycobacterium tuberculosis, the causative agent of tuberculosis. However, current ML development and publication practices make it difficult for researchers and clinicians to use, test or reproduce published models. RESULTS: We packaged a number of published and unpublished ML models for predicting AMR of M.tuberculosis into Docker containers. Similarly, the pipelines required for pre-processing genomic data into the formats required by the models were also packaged into separate containers. By following a minimal container I/O standard, we ensured as much interoperability as possible. We also created a command-line application, TB-ML, which can be used to easily combine pre-processing and prediction containers into complete pipelines ready for predicting resistance from novel, raw data with a single command. As long as there is adherence to this minimal standard for the container interface, containers produced by researchers holding new models can likewise be included in these pipelines, making benchmark comparisons of different models simple and facilitating faster uptake in the clinic. AVAILABILITY AND IMPLEMENTATION: TB-ML contains a simple Docker API written in Python and is available at https://github.com/jodyphelan/tb-ml. Example Docker containers for resistance prediction and corresponding data pre-processing as well as a tutorial on how to create new containers for TB-ML are available at https://tb-ml.github.io/tb-ml-containers/. CONTACT: jody.phelan@lshtm.ac.uk Oxford University Press 2023-03-23 /pmc/articles/PMC10074023/ /pubmed/37033466 http://dx.doi.org/10.1093/bioadv/vbad040 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Application Note Libiseller-Egger, Julian Wang, Linfeng Deelder, Wouter Campino, Susana Clark, Taane G Phelan, Jody E TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title | TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title_full | TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title_fullStr | TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title_full_unstemmed | TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title_short | TB-ML—a framework for comparing machine learning approaches to predict drug resistance of Mycobacterium tuberculosis |
title_sort | tb-ml—a framework for comparing machine learning approaches to predict drug resistance of mycobacterium tuberculosis |
topic | Application Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10074023/ https://www.ncbi.nlm.nih.gov/pubmed/37033466 http://dx.doi.org/10.1093/bioadv/vbad040 |
work_keys_str_mv | AT libisellereggerjulian tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis AT wanglinfeng tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis AT deelderwouter tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis AT campinosusana tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis AT clarktaaneg tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis AT phelanjodye tbmlaframeworkforcomparingmachinelearningapproachestopredictdrugresistanceofmycobacteriumtuberculosis |