Cargando…
Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE
BACKGROUND: The human microbiome plays a critical role in maintaining human health. Due to the recent advances in high-throughput sequencing technologies, the microbiome profiles present in the human body have become publicly available. Hence, many works have been done to analyze human microbiome pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067187/ https://www.ncbi.nlm.nih.gov/pubmed/37003965 http://dx.doi.org/10.1186/s12859-023-05251-x |
_version_ | 1785018413038436352 |
---|---|
author | Syama, K. Jothi, J. Angel Arul Khanna, Namita |
author_facet | Syama, K. Jothi, J. Angel Arul Khanna, Namita |
author_sort | Syama, K. |
collection | PubMed |
description | BACKGROUND: The human microbiome plays a critical role in maintaining human health. Due to the recent advances in high-throughput sequencing technologies, the microbiome profiles present in the human body have become publicly available. Hence, many works have been done to analyze human microbiome profiles. These works have identified that different microbiome profiles are present in healthy and sick individuals for different diseases. Recently, several computational methods have utilized the microbiome profiles to automatically diagnose and classify the host phenotype. RESULTS: In this work, a novel deep learning framework based on boosting GraphSAGE is proposed for automatic prediction of diseases from metagenomic data. The proposed framework has two main components, (a). Metagenomic Disease graph (MD-graph) construction module, (b). Disease prediction Network (DP-Net) module. The graph construction module constructs a graph by considering each metagenomic sample as a node in the graph. The graph captures the relationship between the samples using a proximity measure. The DP-Net consists of a boosting GraphSAGE model which predicts the status of a sample as sick or healthy. The effectiveness of the proposed method is verified using real and synthetic datasets corresponding to diseases like inflammatory bowel disease and colorectal cancer. The proposed model achieved a highest AUC of 93%, Accuracy of 95%, F1-score of 95%, AUPRC of 95% for the real inflammatory bowel disease dataset and a best AUC of 90%, Accuracy of 91%, F1-score of 87% and AUPRC of 93% for the real colorectal cancer dataset. CONCLUSION: The proposed framework outperforms other machine learning and deep learning models in terms of classification accuracy, AUC, F1-score and AUPRC for both synthetic and real metagenomic data. |
format | Online Article Text |
id | pubmed-10067187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-100671872023-04-03 Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE Syama, K. Jothi, J. Angel Arul Khanna, Namita BMC Bioinformatics Research BACKGROUND: The human microbiome plays a critical role in maintaining human health. Due to the recent advances in high-throughput sequencing technologies, the microbiome profiles present in the human body have become publicly available. Hence, many works have been done to analyze human microbiome profiles. These works have identified that different microbiome profiles are present in healthy and sick individuals for different diseases. Recently, several computational methods have utilized the microbiome profiles to automatically diagnose and classify the host phenotype. RESULTS: In this work, a novel deep learning framework based on boosting GraphSAGE is proposed for automatic prediction of diseases from metagenomic data. The proposed framework has two main components, (a). Metagenomic Disease graph (MD-graph) construction module, (b). Disease prediction Network (DP-Net) module. The graph construction module constructs a graph by considering each metagenomic sample as a node in the graph. The graph captures the relationship between the samples using a proximity measure. The DP-Net consists of a boosting GraphSAGE model which predicts the status of a sample as sick or healthy. The effectiveness of the proposed method is verified using real and synthetic datasets corresponding to diseases like inflammatory bowel disease and colorectal cancer. The proposed model achieved a highest AUC of 93%, Accuracy of 95%, F1-score of 95%, AUPRC of 95% for the real inflammatory bowel disease dataset and a best AUC of 90%, Accuracy of 91%, F1-score of 87% and AUPRC of 93% for the real colorectal cancer dataset. CONCLUSION: The proposed framework outperforms other machine learning and deep learning models in terms of classification accuracy, AUC, F1-score and AUPRC for both synthetic and real metagenomic data. BioMed Central 2023-03-31 /pmc/articles/PMC10067187/ /pubmed/37003965 http://dx.doi.org/10.1186/s12859-023-05251-x Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Syama, K. Jothi, J. Angel Arul Khanna, Namita Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title | Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title_full | Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title_fullStr | Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title_full_unstemmed | Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title_short | Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE |
title_sort | automatic disease prediction from human gut metagenomic data using boosting graphsage |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10067187/ https://www.ncbi.nlm.nih.gov/pubmed/37003965 http://dx.doi.org/10.1186/s12859-023-05251-x |
work_keys_str_mv | AT syamak automaticdiseasepredictionfromhumangutmetagenomicdatausingboostinggraphsage AT jothijangelarul automaticdiseasepredictionfromhumangutmetagenomicdatausingboostinggraphsage AT khannanamita automaticdiseasepredictionfromhumangutmetagenomicdatausingboostinggraphsage |