Cargando…

Faecal microbiome-based machine learning for multi-class disease diagnosis

Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Qi, Liu, Qin, Lau, Raphaela Iris, Zhang, Jingwan, Xu, Zhilu, Yeoh, Yun Kit, Leung, Thomas W. H., Tang, Whitney, Zhang, Lin, Liang, Jessie Q. Y., Yau, Yuk Kam, Zheng, Jiaying, Liu, Chengyu, Zhang, Mengjing, Cheung, Chun Pan, Ching, Jessica Y. L., Tun, Hein M., Yu, Jun, Chan, Francis K. L., Ng, Siew C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649010/
https://www.ncbi.nlm.nih.gov/pubmed/36357393
http://dx.doi.org/10.1038/s41467-022-34405-3
_version_ 1784827706927480832
author Su, Qi
Liu, Qin
Lau, Raphaela Iris
Zhang, Jingwan
Xu, Zhilu
Yeoh, Yun Kit
Leung, Thomas W. H.
Tang, Whitney
Zhang, Lin
Liang, Jessie Q. Y.
Yau, Yuk Kam
Zheng, Jiaying
Liu, Chengyu
Zhang, Mengjing
Cheung, Chun Pan
Ching, Jessica Y. L.
Tun, Hein M.
Yu, Jun
Chan, Francis K. L.
Ng, Siew C.
author_facet Su, Qi
Liu, Qin
Lau, Raphaela Iris
Zhang, Jingwan
Xu, Zhilu
Yeoh, Yun Kit
Leung, Thomas W. H.
Tang, Whitney
Zhang, Lin
Liang, Jessie Q. Y.
Yau, Yuk Kam
Zheng, Jiaying
Liu, Chengyu
Zhang, Mengjing
Cheung, Chun Pan
Ching, Jessica Y. L.
Tun, Hein M.
Yu, Jun
Chan, Francis K. L.
Ng, Siew C.
author_sort Su, Qi
collection PubMed
description Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration.
format Online
Article
Text
id pubmed-9649010
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96490102022-11-14 Faecal microbiome-based machine learning for multi-class disease diagnosis Su, Qi Liu, Qin Lau, Raphaela Iris Zhang, Jingwan Xu, Zhilu Yeoh, Yun Kit Leung, Thomas W. H. Tang, Whitney Zhang, Lin Liang, Jessie Q. Y. Yau, Yuk Kam Zheng, Jiaying Liu, Chengyu Zhang, Mengjing Cheung, Chun Pan Ching, Jessica Y. L. Tun, Hein M. Yu, Jun Chan, Francis K. L. Ng, Siew C. Nat Commun Article Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9649010/ /pubmed/36357393 http://dx.doi.org/10.1038/s41467-022-34405-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Su, Qi
Liu, Qin
Lau, Raphaela Iris
Zhang, Jingwan
Xu, Zhilu
Yeoh, Yun Kit
Leung, Thomas W. H.
Tang, Whitney
Zhang, Lin
Liang, Jessie Q. Y.
Yau, Yuk Kam
Zheng, Jiaying
Liu, Chengyu
Zhang, Mengjing
Cheung, Chun Pan
Ching, Jessica Y. L.
Tun, Hein M.
Yu, Jun
Chan, Francis K. L.
Ng, Siew C.
Faecal microbiome-based machine learning for multi-class disease diagnosis
title Faecal microbiome-based machine learning for multi-class disease diagnosis
title_full Faecal microbiome-based machine learning for multi-class disease diagnosis
title_fullStr Faecal microbiome-based machine learning for multi-class disease diagnosis
title_full_unstemmed Faecal microbiome-based machine learning for multi-class disease diagnosis
title_short Faecal microbiome-based machine learning for multi-class disease diagnosis
title_sort faecal microbiome-based machine learning for multi-class disease diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649010/
https://www.ncbi.nlm.nih.gov/pubmed/36357393
http://dx.doi.org/10.1038/s41467-022-34405-3
work_keys_str_mv AT suqi faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT liuqin faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT lauraphaelairis faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT zhangjingwan faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT xuzhilu faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT yeohyunkit faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT leungthomaswh faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT tangwhitney faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT zhanglin faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT liangjessieqy faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT yauyukkam faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT zhengjiaying faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT liuchengyu faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT zhangmengjing faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT cheungchunpan faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT chingjessicayl faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT tunheinm faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT yujun faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT chanfranciskl faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis
AT ngsiewc faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis