Cargando…
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048008/ https://www.ncbi.nlm.nih.gov/pubmed/32153643 http://dx.doi.org/10.3389/fgene.2020.00083 |
_version_ | 1783502220897026048 |
---|---|
author | Ma, Yuanyuan Liu, Guoying Ma, Yingjun Chen, Qianjun |
author_facet | Ma, Yuanyuan Liu, Guoying Ma, Yingjun Chen, Qianjun |
author_sort | Ma, Yuanyuan |
collection | PubMed |
description | Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications. |
format | Online Article Text |
id | pubmed-7048008 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70480082020-03-09 Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information Ma, Yuanyuan Liu, Guoying Ma, Yingjun Chen, Qianjun Front Genet Genetics Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications. Frontiers Media S.A. 2020-02-21 /pmc/articles/PMC7048008/ /pubmed/32153643 http://dx.doi.org/10.3389/fgene.2020.00083 Text en Copyright © 2020 Ma, Liu, Ma and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Ma, Yuanyuan Liu, Guoying Ma, Yingjun Chen, Qianjun Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title | Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_full | Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_fullStr | Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_full_unstemmed | Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_short | Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_sort | integrative analysis for identifying co-modules of microbe-disease data by matrix tri-factorization with phylogenetic information |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048008/ https://www.ncbi.nlm.nih.gov/pubmed/32153643 http://dx.doi.org/10.3389/fgene.2020.00083 |
work_keys_str_mv | AT mayuanyuan integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT liuguoying integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT mayingjun integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT chenqianjun integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation |