Cargando…

Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification

The microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Qiang, Jiang, Xingpeng, Zhu, Qing, Pan, Min, He, Tingting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883002/
https://www.ncbi.nlm.nih.gov/pubmed/31824573
http://dx.doi.org/10.3389/fgene.2019.01182
_version_ 1783474280838725632
author Zhu, Qiang
Jiang, Xingpeng
Zhu, Qing
Pan, Min
He, Tingting
author_facet Zhu, Qiang
Jiang, Xingpeng
Zhu, Qing
Pan, Min
He, Tingting
author_sort Zhu, Qiang
collection PubMed
description The microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are limited by the models' representation ability and cannot learn complex patterns from the data. Recently, deep learning has been widely applied to fields ranging from text processing to image recognition due to its efficient flexibility and high capacity. But the deep learning models must be trained with enough data in order to achieve good performance, which is impractical in reality. In addition, deep learning is considered as black box and hard to interpret. These factors make deep learning not widely used in microbiome-wide association studies. In this work, we construct a sparse microbial interaction network and embed this graph into deep model to alleviate the risk of overfitting and improve the performance. Further, we explore a Graph Embedding Deep Feedforward Network (GEDFN) to conduct feature selection and guide meaningful microbial markers' identification. Based on the experimental results, we verify the feasibility of combining the microbial graph model with the deep learning model, and demonstrate the feasibility of applying deep learning and feature selection on microbial data. Our main contributions are: firstly, we utilize different methods to construct a variety of microbial interaction networks and combine the network via graph embedding deep learning. Secondly, we introduce a feature selection method based on graph embedding and validate the biological meaning of microbial markers. The code is available at https://github.com/MicroAVA/GEDFN.git.
format Online
Article
Text
id pubmed-6883002
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68830022019-12-10 Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification Zhu, Qiang Jiang, Xingpeng Zhu, Qing Pan, Min He, Tingting Front Genet Genetics The microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are limited by the models' representation ability and cannot learn complex patterns from the data. Recently, deep learning has been widely applied to fields ranging from text processing to image recognition due to its efficient flexibility and high capacity. But the deep learning models must be trained with enough data in order to achieve good performance, which is impractical in reality. In addition, deep learning is considered as black box and hard to interpret. These factors make deep learning not widely used in microbiome-wide association studies. In this work, we construct a sparse microbial interaction network and embed this graph into deep model to alleviate the risk of overfitting and improve the performance. Further, we explore a Graph Embedding Deep Feedforward Network (GEDFN) to conduct feature selection and guide meaningful microbial markers' identification. Based on the experimental results, we verify the feasibility of combining the microbial graph model with the deep learning model, and demonstrate the feasibility of applying deep learning and feature selection on microbial data. Our main contributions are: firstly, we utilize different methods to construct a variety of microbial interaction networks and combine the network via graph embedding deep learning. Secondly, we introduce a feature selection method based on graph embedding and validate the biological meaning of microbial markers. The code is available at https://github.com/MicroAVA/GEDFN.git. Frontiers Media S.A. 2019-11-22 /pmc/articles/PMC6883002/ /pubmed/31824573 http://dx.doi.org/10.3389/fgene.2019.01182 Text en Copyright © 2019 Zhu, Jiang, Zhu, Pan and He http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhu, Qiang
Jiang, Xingpeng
Zhu, Qing
Pan, Min
He, Tingting
Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_full Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_fullStr Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_full_unstemmed Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_short Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_sort graph embedding deep learning guides microbial biomarkers' identification
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883002/
https://www.ncbi.nlm.nih.gov/pubmed/31824573
http://dx.doi.org/10.3389/fgene.2019.01182
work_keys_str_mv AT zhuqiang graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT jiangxingpeng graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT zhuqing graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT panmin graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT hetingting graphembeddingdeeplearningguidesmicrobialbiomarkersidentification