Cargando…
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-n...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101089/ https://www.ncbi.nlm.nih.gov/pubmed/30367598 http://dx.doi.org/10.1186/s12859-018-2274-3 |
_version_ | 1783348986168475648 |
---|---|
author | Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Cao, Jiang-Bo Yiu, Siu-Ming |
author_facet | Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Cao, Jiang-Bo Yiu, Siu-Ming |
author_sort | Shi, Jian-Yu |
collection | PubMed |
description | BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. RESULTS: In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. CONCLUSIONS: Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction). |
format | Online Article Text |
id | pubmed-6101089 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61010892018-08-27 BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Cao, Jiang-Bo Yiu, Siu-Ming BMC Bioinformatics Research BACKGROUND: Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. RESULTS: In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. CONCLUSIONS: Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction). BioMed Central 2018-08-13 /pmc/articles/PMC6101089/ /pubmed/30367598 http://dx.doi.org/10.1186/s12859-018-2274-3 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Cao, Jiang-Bo Yiu, Siu-Ming BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title | BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_full | BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_fullStr | BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_full_unstemmed | BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_short | BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_sort | bmcmda: a novel model for predicting human microbe-disease associations via binary matrix completion |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101089/ https://www.ncbi.nlm.nih.gov/pubmed/30367598 http://dx.doi.org/10.1186/s12859-018-2274-3 |
work_keys_str_mv | AT shijianyu bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT huanghua bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT zhangyanning bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT caojiangbo bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT yiusiuming bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion |