Cargando…

HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks

BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Junning, Liu, Lizhi, Yao, Shuwei, Huang, Xiaodi, Mamitsuka, Hiroshi, Zhu, Shanfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927106/
https://www.ncbi.nlm.nih.gov/pubmed/31865916
http://dx.doi.org/10.1186/s12920-019-0625-1
_version_ 1783482240710213632
author Gao, Junning
Liu, Lizhi
Yao, Shuwei
Huang, Xiaodi
Mamitsuka, Hiroshi
Zhu, Shanfeng
author_facet Gao, Junning
Liu, Lizhi
Yao, Shuwei
Huang, Xiaodi
Mamitsuka, Hiroshi
Zhu, Shanfeng
author_sort Gao, Junning
collection PubMed
description BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have been proposed. They are able to alleviate the problem to some extent, but their performances are still far from satisfactory. METHOD: For inferring large-scale protein-phenotype associations, we propose HPOAnnotator that incorporates multiple Protein-Protein Interaction (PPI) information and the hierarchical structure of HPO. Specifically, we use a dual graph to regularize Non-negative Matrix Factorization (NMF) in a way that the information from different sources can be seamlessly integrated. In essence, HPOAnnotator solves the sparsity problem of a protein-phenotype association matrix by using a low-rank approximation. RESULTS: By combining the hierarchical structure of HPO and co-annotations of proteins, our model can well capture the HPO semantic similarities. Moreover, graph Laplacian regularizations are imposed in the latent space so as to utilize multiple PPI networks. The performance of HPOAnnotator has been validated under cross-validation and independent test. Experimental results have shown that HPOAnnotator outperforms the competing methods significantly. CONCLUSIONS: Through extensive comparisons with the state-of-the-art methods, we conclude that the proposed HPOAnnotator is able to achieve the superior performance as a result of using a low-rank approximation with a graph regularization. It is promising in that our approach can be considered as a starting point to study more efficient matrix factorization-based algorithms.
format Online
Article
Text
id pubmed-6927106
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69271062019-12-30 HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks Gao, Junning Liu, Lizhi Yao, Shuwei Huang, Xiaodi Mamitsuka, Hiroshi Zhu, Shanfeng BMC Med Genomics Research BACKGROUND: As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have been proposed. They are able to alleviate the problem to some extent, but their performances are still far from satisfactory. METHOD: For inferring large-scale protein-phenotype associations, we propose HPOAnnotator that incorporates multiple Protein-Protein Interaction (PPI) information and the hierarchical structure of HPO. Specifically, we use a dual graph to regularize Non-negative Matrix Factorization (NMF) in a way that the information from different sources can be seamlessly integrated. In essence, HPOAnnotator solves the sparsity problem of a protein-phenotype association matrix by using a low-rank approximation. RESULTS: By combining the hierarchical structure of HPO and co-annotations of proteins, our model can well capture the HPO semantic similarities. Moreover, graph Laplacian regularizations are imposed in the latent space so as to utilize multiple PPI networks. The performance of HPOAnnotator has been validated under cross-validation and independent test. Experimental results have shown that HPOAnnotator outperforms the competing methods significantly. CONCLUSIONS: Through extensive comparisons with the state-of-the-art methods, we conclude that the proposed HPOAnnotator is able to achieve the superior performance as a result of using a low-rank approximation with a graph regularization. It is promising in that our approach can be considered as a starting point to study more efficient matrix factorization-based algorithms. BioMed Central 2019-12-23 /pmc/articles/PMC6927106/ /pubmed/31865916 http://dx.doi.org/10.1186/s12920-019-0625-1 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gao, Junning
Liu, Lizhi
Yao, Shuwei
Huang, Xiaodi
Mamitsuka, Hiroshi
Zhu, Shanfeng
HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title_full HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title_fullStr HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title_full_unstemmed HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title_short HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
title_sort hpoannotator: improving large-scale prediction of hpo annotations by low-rank approximation with hpo semantic similarities and multiple ppi networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927106/
https://www.ncbi.nlm.nih.gov/pubmed/31865916
http://dx.doi.org/10.1186/s12920-019-0625-1
work_keys_str_mv AT gaojunning hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks
AT liulizhi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks
AT yaoshuwei hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks
AT huangxiaodi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks
AT mamitsukahiroshi hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks
AT zhushanfeng hpoannotatorimprovinglargescalepredictionofhpoannotationsbylowrankapproximationwithhposemanticsimilaritiesandmultipleppinetworks