Cargando…

mAML: an automated machine learning pipeline with a microbiome repository for human disease classification

Due to the concerted efforts to utilize the microbial features to improve disease prediction capabilities, automated machine learning (AutoML) systems aiming to get rid of the tediousness in manually performing ML tasks are in great demand. Here we developed mAML, an ML model-building pipeline, whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Fenglong, Zou, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316531/
https://www.ncbi.nlm.nih.gov/pubmed/32588040
http://dx.doi.org/10.1093/database/baaa050
_version_ 1783550450693308416
author Yang, Fenglong
Zou, Quan
author_facet Yang, Fenglong
Zou, Quan
author_sort Yang, Fenglong
collection PubMed
description Due to the concerted efforts to utilize the microbial features to improve disease prediction capabilities, automated machine learning (AutoML) systems aiming to get rid of the tediousness in manually performing ML tasks are in great demand. Here we developed mAML, an ML model-building pipeline, which can automatically and rapidly generate optimized and interpretable models for personalized microbiome-based classification tasks in a reproducible way. The pipeline is deployed on a web-based platform, while the server is user-friendly and flexible and has been designed to be scalable according to the specific requirements. This pipeline exhibits high performance for 13 benchmark datasets including both binary and multi-class classification tasks. In addition, to facilitate the application of mAML and expand the human disease-related microbiome learning repository, we developed GMrepo ML repository (GMrepo Microbiome Learning repository) from the GMrepo database. The repository involves 120 microbiome-based classification tasks for 85 human-disease phenotypes referring to 12 429 metagenomic samples and 38 643 amplicon samples. The mAML pipeline and the GMrepo ML repository are expected to be important resources for researches in microbiology and algorithm developments. Database URL: http://lab.malab.cn/soft/mAML
format Online
Article
Text
id pubmed-7316531
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73165312020-07-01 mAML: an automated machine learning pipeline with a microbiome repository for human disease classification Yang, Fenglong Zou, Quan Database (Oxford) Database Tool Due to the concerted efforts to utilize the microbial features to improve disease prediction capabilities, automated machine learning (AutoML) systems aiming to get rid of the tediousness in manually performing ML tasks are in great demand. Here we developed mAML, an ML model-building pipeline, which can automatically and rapidly generate optimized and interpretable models for personalized microbiome-based classification tasks in a reproducible way. The pipeline is deployed on a web-based platform, while the server is user-friendly and flexible and has been designed to be scalable according to the specific requirements. This pipeline exhibits high performance for 13 benchmark datasets including both binary and multi-class classification tasks. In addition, to facilitate the application of mAML and expand the human disease-related microbiome learning repository, we developed GMrepo ML repository (GMrepo Microbiome Learning repository) from the GMrepo database. The repository involves 120 microbiome-based classification tasks for 85 human-disease phenotypes referring to 12 429 metagenomic samples and 38 643 amplicon samples. The mAML pipeline and the GMrepo ML repository are expected to be important resources for researches in microbiology and algorithm developments. Database URL: http://lab.malab.cn/soft/mAML Oxford University Press 2020-06-25 /pmc/articles/PMC7316531/ /pubmed/32588040 http://dx.doi.org/10.1093/database/baaa050 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Yang, Fenglong
Zou, Quan
mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title_full mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title_fullStr mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title_full_unstemmed mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title_short mAML: an automated machine learning pipeline with a microbiome repository for human disease classification
title_sort maml: an automated machine learning pipeline with a microbiome repository for human disease classification
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316531/
https://www.ncbi.nlm.nih.gov/pubmed/32588040
http://dx.doi.org/10.1093/database/baaa050
work_keys_str_mv AT yangfenglong mamlanautomatedmachinelearningpipelinewithamicrobiomerepositoryforhumandiseaseclassification
AT zouquan mamlanautomatedmachinelearningpipelinewithamicrobiomerepositoryforhumandiseaseclassification