Cargando…

PHENOstruct: Prediction of human phenotype ontology terms using heterogeneous data sources

The human phenotype ontology (HPO) was recently developed as a standardized vocabulary for describing the phenotype abnormalities associated with human diseases. At present, only a small fraction of human protein coding genes have HPO annotations. But, researchers believe that a large portion of cur...

Descripción completa

Detalles Bibliográficos
Autores principales: Kahanda, Indika, Funk, Christopher, Verspoor, Karin, Ben-Hur, Asa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4722686/
https://www.ncbi.nlm.nih.gov/pubmed/26834980
http://dx.doi.org/10.12688/f1000research.6670.1
Descripción
Sumario:The human phenotype ontology (HPO) was recently developed as a standardized vocabulary for describing the phenotype abnormalities associated with human diseases. At present, only a small fraction of human protein coding genes have HPO annotations. But, researchers believe that a large portion of currently unannotated genes are related to disease phenotypes. Therefore, it is important to predict gene-HPO term associations using accurate computational methods. In this work we demonstrate the performance advantage of the structured SVM approach which was shown to be highly effective for Gene Ontology term prediction in comparison to several baseline methods. Furthermore, we highlight a collection of informative data sources suitable for the problem of predicting gene-HPO associations, including large scale literature mining data.