Cargando…

BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion

BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-n...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Jian-Yu, Huang, Hua, Zhang, Yan-Ning, Cao, Jiang-Bo, Yiu, Siu-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101089/
https://www.ncbi.nlm.nih.gov/pubmed/30367598
http://dx.doi.org/10.1186/s12859-018-2274-3
_version_ 1783348986168475648
author Shi, Jian-Yu
Huang, Hua
Zhang, Yan-Ning
Cao, Jiang-Bo
Yiu, Siu-Ming
author_facet Shi, Jian-Yu
Huang, Hua
Zhang, Yan-Ning
Cao, Jiang-Bo
Yiu, Siu-Ming
author_sort Shi, Jian-Yu
collection PubMed
description BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. RESULTS: In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. CONCLUSIONS: Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction).
format Online
Article
Text
id pubmed-6101089
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61010892018-08-27 BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Cao, Jiang-Bo Yiu, Siu-Ming BMC Bioinformatics Research BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. RESULTS: In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. CONCLUSIONS: Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction). BioMed Central 2018-08-13 /pmc/articles/PMC6101089/ /pubmed/30367598 http://dx.doi.org/10.1186/s12859-018-2274-3 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Shi, Jian-Yu
Huang, Hua
Zhang, Yan-Ning
Cao, Jiang-Bo
Yiu, Siu-Ming
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_full BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_fullStr BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_full_unstemmed BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_short BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_sort bmcmda: a novel model for predicting human microbe-disease associations via binary matrix completion
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101089/
https://www.ncbi.nlm.nih.gov/pubmed/30367598
http://dx.doi.org/10.1186/s12859-018-2274-3
work_keys_str_mv AT shijianyu bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT huanghua bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT zhangyanning bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT caojiangbo bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT yiusiuming bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion