Cargando…

Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information

Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research s...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Yuanyuan, Liu, Guoying, Ma, Yingjun, Chen, Qianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048008/
https://www.ncbi.nlm.nih.gov/pubmed/32153643
http://dx.doi.org/10.3389/fgene.2020.00083
_version_ 1783502220897026048
author Ma, Yuanyuan
Liu, Guoying
Ma, Yingjun
Chen, Qianjun
author_facet Ma, Yuanyuan
Liu, Guoying
Ma, Yingjun
Chen, Qianjun
author_sort Ma, Yuanyuan
collection PubMed
description Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications.
format Online
Article
Text
id pubmed-7048008
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70480082020-03-09 Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information Ma, Yuanyuan Liu, Guoying Ma, Yingjun Chen, Qianjun Front Genet Genetics Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications. Frontiers Media S.A. 2020-02-21 /pmc/articles/PMC7048008/ /pubmed/32153643 http://dx.doi.org/10.3389/fgene.2020.00083 Text en Copyright © 2020 Ma, Liu, Ma and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ma, Yuanyuan
Liu, Guoying
Ma, Yingjun
Chen, Qianjun
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_full Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_fullStr Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_full_unstemmed Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_short Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_sort integrative analysis for identifying co-modules of microbe-disease data by matrix tri-factorization with phylogenetic information
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048008/
https://www.ncbi.nlm.nih.gov/pubmed/32153643
http://dx.doi.org/10.3389/fgene.2020.00083
work_keys_str_mv AT mayuanyuan integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT liuguoying integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT mayingjun integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT chenqianjun integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation