Cargando…
Faecal microbiome-based machine learning for multi-class disease diagnosis
Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we pres...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649010/ https://www.ncbi.nlm.nih.gov/pubmed/36357393 http://dx.doi.org/10.1038/s41467-022-34405-3 |
_version_ | 1784827706927480832 |
---|---|
author | Su, Qi Liu, Qin Lau, Raphaela Iris Zhang, Jingwan Xu, Zhilu Yeoh, Yun Kit Leung, Thomas W. H. Tang, Whitney Zhang, Lin Liang, Jessie Q. Y. Yau, Yuk Kam Zheng, Jiaying Liu, Chengyu Zhang, Mengjing Cheung, Chun Pan Ching, Jessica Y. L. Tun, Hein M. Yu, Jun Chan, Francis K. L. Ng, Siew C. |
author_facet | Su, Qi Liu, Qin Lau, Raphaela Iris Zhang, Jingwan Xu, Zhilu Yeoh, Yun Kit Leung, Thomas W. H. Tang, Whitney Zhang, Lin Liang, Jessie Q. Y. Yau, Yuk Kam Zheng, Jiaying Liu, Chengyu Zhang, Mengjing Cheung, Chun Pan Ching, Jessica Y. L. Tun, Hein M. Yu, Jun Chan, Francis K. L. Ng, Siew C. |
author_sort | Su, Qi |
collection | PubMed |
description | Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration. |
format | Online Article Text |
id | pubmed-9649010 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96490102022-11-14 Faecal microbiome-based machine learning for multi-class disease diagnosis Su, Qi Liu, Qin Lau, Raphaela Iris Zhang, Jingwan Xu, Zhilu Yeoh, Yun Kit Leung, Thomas W. H. Tang, Whitney Zhang, Lin Liang, Jessie Q. Y. Yau, Yuk Kam Zheng, Jiaying Liu, Chengyu Zhang, Mengjing Cheung, Chun Pan Ching, Jessica Y. L. Tun, Hein M. Yu, Jun Chan, Francis K. L. Ng, Siew C. Nat Commun Article Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9649010/ /pubmed/36357393 http://dx.doi.org/10.1038/s41467-022-34405-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Su, Qi Liu, Qin Lau, Raphaela Iris Zhang, Jingwan Xu, Zhilu Yeoh, Yun Kit Leung, Thomas W. H. Tang, Whitney Zhang, Lin Liang, Jessie Q. Y. Yau, Yuk Kam Zheng, Jiaying Liu, Chengyu Zhang, Mengjing Cheung, Chun Pan Ching, Jessica Y. L. Tun, Hein M. Yu, Jun Chan, Francis K. L. Ng, Siew C. Faecal microbiome-based machine learning for multi-class disease diagnosis |
title | Faecal microbiome-based machine learning for multi-class disease diagnosis |
title_full | Faecal microbiome-based machine learning for multi-class disease diagnosis |
title_fullStr | Faecal microbiome-based machine learning for multi-class disease diagnosis |
title_full_unstemmed | Faecal microbiome-based machine learning for multi-class disease diagnosis |
title_short | Faecal microbiome-based machine learning for multi-class disease diagnosis |
title_sort | faecal microbiome-based machine learning for multi-class disease diagnosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649010/ https://www.ncbi.nlm.nih.gov/pubmed/36357393 http://dx.doi.org/10.1038/s41467-022-34405-3 |
work_keys_str_mv | AT suqi faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT liuqin faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT lauraphaelairis faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT zhangjingwan faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT xuzhilu faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT yeohyunkit faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT leungthomaswh faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT tangwhitney faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT zhanglin faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT liangjessieqy faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT yauyukkam faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT zhengjiaying faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT liuchengyu faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT zhangmengjing faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT cheungchunpan faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT chingjessicayl faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT tunheinm faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT yujun faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT chanfranciskl faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis AT ngsiewc faecalmicrobiomebasedmachinelearningformulticlassdiseasediagnosis |