Cargando…
HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927106/ https://www.ncbi.nlm.nih.gov/pubmed/31865916 http://dx.doi.org/10.1186/s12920-019-0625-1 |
_version_ | 1783482240710213632 |
---|---|
author | Gao, Junning Liu, Lizhi Yao, Shuwei Huang, Xiaodi Mamitsuka, Hiroshi Zhu, Shanfeng |
author_facet | Gao, Junning Liu, Lizhi Yao, Shuwei Huang, Xiaodi Mamitsuka, Hiroshi Zhu, Shanfeng |
author_sort | Gao, Junning |
collection | PubMed |
description | BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have been proposed. They are able to alleviate the problem to some extent, but their performances are still far from satisfactory. METHOD: For inferring large-scale protein-phenotype associations, we propose HPOAnnotator that incorporates multiple Protein-Protein Interaction (PPI) information and the hierarchical structure of HPO. Specifically, we use a dual graph to regularize Non-negative Matrix Factorization (NMF) in a way that the information from different sources can be seamlessly integrated. In essence, HPOAnnotator solves the sparsity problem of a protein-phenotype association matrix by using a low-rank approximation. RESULTS: By combining the hierarchical structure of HPO and co-annotations of proteins, our model can well capture the HPO semantic similarities. Moreover, graph Laplacian regularizations are imposed in the latent space so as to utilize multiple PPI networks. The performance of HPOAnnotator has been validated under cross-validation and independent test. Experimental results have shown that HPOAnnotator outperforms the competing methods significantly. CONCLUSIONS: Through extensive comparisons with the state-of-the-art methods, we conclude that the proposed HPOAnnotator is able to achieve the superior performance as a result of using a low-rank approximation with a graph regularization. It is promising in that our approach can be considered as a starting point to study more efficient matrix factorization-based algorithms. |
format | Online Article Text |
id | pubmed-6927106 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69271062019-12-30 HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks Gao, Junning Liu, Lizhi Yao, Shuwei Huang, Xiaodi Mamitsuka, Hiroshi Zhu, Shanfeng BMC Med Genomics Research BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have been proposed. They are able to alleviate the problem to some extent, but their performances are still far from satisfactory. METHOD: For inferring large-scale protein-phenotype associations, we propose HPOAnnotator that incorporates multiple Protein-Protein Interaction (PPI) information and the hierarchical structure of HPO. Specifically, we use a dual graph to regularize Non-negative Matrix Factorization (NMF) in a way that the information from different sources can be seamlessly integrated. In essence, HPOAnnotator solves the sparsity problem of a protein-phenotype association matrix by using a low-rank approximation. RESULTS: By combining the hierarchical structure of HPO and co-annotations of proteins, our model can well capture the HPO semantic similarities. Moreover, graph Laplacian regularizations are imposed in the latent space so as to utilize multiple PPI networks. The performance of HPOAnnotator has been validated under cross-validation and independent test. Experimental results have shown that HPOAnnotator outperforms the competing methods significantly. CONCLUSIONS: Through extensive comparisons with the state-of-the-art methods, we conclude that the proposed HPOAnnotator is able to achieve the superior performance as a result of using a low-rank approximation with a graph regularization. It is promising in that our approach can be considered as a starting point to study more efficient matrix factorization-based algorithms. BioMed Central 2019-12-23 /pmc/articles/PMC6927106/ /pubmed/31865916 http://dx.doi.org/10.1186/s12920-019-0625-1 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Gao, Junning Liu, Lizhi Yao, Shuwei Huang, Xiaodi Mamitsuka, Hiroshi Zhu, Shanfeng HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title | HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title_full | HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title_fullStr | HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title_full_unstemmed | HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title_short | HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks |
title_sort | hpoannotator: improving large-scale prediction of hpo annotations by low-rank approximation with hpo semantic similarities and multiple ppi networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927106/ https://www.ncbi.nlm.nih.gov/pubmed/31865916 http://dx.doi.org/10.1186/s12920-019-0625-1 |
work_keys_str_mv | AT gaojunning hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks AT liulizhi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks AT yaoshuwei hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks AT huangxiaodi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks AT mamitsukahiroshi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks AT zhushanfeng hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks |