Cargando…

A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization

Identification of disease-causing genes is a fundamental challenge for human health studies. The phenotypic similarity among diseases may reflect the interactions at the molecular level, and phenotype comparison can be used to predict disease candidate genes. Online Mendelian Inheritance in Man (OMI...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jianhua, Lin, Xiaoyan, Teng, Yueyang, Qi, Shouliang, Xiao, Dayu, Zhang, Jianying, Kang, Yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4944959/
https://www.ncbi.nlm.nih.gov/pubmed/27415759
http://dx.doi.org/10.1371/journal.pone.0159457
_version_ 1782442840344756224
author Li, Jianhua
Lin, Xiaoyan
Teng, Yueyang
Qi, Shouliang
Xiao, Dayu
Zhang, Jianying
Kang, Yan
author_facet Li, Jianhua
Lin, Xiaoyan
Teng, Yueyang
Qi, Shouliang
Xiao, Dayu
Zhang, Jianying
Kang, Yan
author_sort Li, Jianhua
collection PubMed
description Identification of disease-causing genes is a fundamental challenge for human health studies. The phenotypic similarity among diseases may reflect the interactions at the molecular level, and phenotype comparison can be used to predict disease candidate genes. Online Mendelian Inheritance in Man (OMIM) is a database of human genetic diseases and related genes that has become an authoritative source of disease phenotypes. However, disease phenotypes have been described by free text; thus, standardization of phenotypic descriptions is needed before diseases can be compared. Several disease phenotype networks have been established in OMIM using different standardization methods. Two of these networks are important for phenotypic similarity analysis: the first and most commonly used network (mimMiner) is standardized by medical subject heading, and the other network (resnikHPO) is the first to be standardized by human phenotype ontology. This paper comprehensively evaluates for the first time the accuracy of these two networks in gene prioritization based on protein–protein interactions using large-scale, leave-one-out cross-validation experiments. The results show that both networks can effectively prioritize disease-causing genes, and the approach that relates two diseases using a logistic function improves prioritization performance. Tanimoto, one of four methods for normalizing resnikHPO, generates a symmetric network and it performs similarly to mimMiner. Furthermore, an integration of these two networks outperforms either network alone in gene prioritization, indicating that these two disease networks are complementary.
format Online
Article
Text
id pubmed-4944959
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49449592016-08-08 A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization Li, Jianhua Lin, Xiaoyan Teng, Yueyang Qi, Shouliang Xiao, Dayu Zhang, Jianying Kang, Yan PLoS One Research Article Identification of disease-causing genes is a fundamental challenge for human health studies. The phenotypic similarity among diseases may reflect the interactions at the molecular level, and phenotype comparison can be used to predict disease candidate genes. Online Mendelian Inheritance in Man (OMIM) is a database of human genetic diseases and related genes that has become an authoritative source of disease phenotypes. However, disease phenotypes have been described by free text; thus, standardization of phenotypic descriptions is needed before diseases can be compared. Several disease phenotype networks have been established in OMIM using different standardization methods. Two of these networks are important for phenotypic similarity analysis: the first and most commonly used network (mimMiner) is standardized by medical subject heading, and the other network (resnikHPO) is the first to be standardized by human phenotype ontology. This paper comprehensively evaluates for the first time the accuracy of these two networks in gene prioritization based on protein–protein interactions using large-scale, leave-one-out cross-validation experiments. The results show that both networks can effectively prioritize disease-causing genes, and the approach that relates two diseases using a logistic function improves prioritization performance. Tanimoto, one of four methods for normalizing resnikHPO, generates a symmetric network and it performs similarly to mimMiner. Furthermore, an integration of these two networks outperforms either network alone in gene prioritization, indicating that these two disease networks are complementary. Public Library of Science 2016-07-14 /pmc/articles/PMC4944959/ /pubmed/27415759 http://dx.doi.org/10.1371/journal.pone.0159457 Text en © 2016 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Li, Jianhua
Lin, Xiaoyan
Teng, Yueyang
Qi, Shouliang
Xiao, Dayu
Zhang, Jianying
Kang, Yan
A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title_full A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title_fullStr A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title_full_unstemmed A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title_short A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization
title_sort comprehensive evaluation of disease phenotype networks for gene prioritization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4944959/
https://www.ncbi.nlm.nih.gov/pubmed/27415759
http://dx.doi.org/10.1371/journal.pone.0159457
work_keys_str_mv AT lijianhua acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT linxiaoyan acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT tengyueyang acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT qishouliang acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT xiaodayu acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT zhangjianying acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT kangyan acomprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT lijianhua comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT linxiaoyan comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT tengyueyang comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT qishouliang comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT xiaodayu comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT zhangjianying comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization
AT kangyan comprehensiveevaluationofdiseasephenotypenetworksforgeneprioritization