Cargando…

Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability

The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan, Zhu, Li-Na, Ma, Xiu-Wei, Yang, Fang, Xu, Xi-Lin, Yang, Yao, Yang, Xiao, Peng, Wei, Zhang, Wan-Qiao, Liang, Jin-Yu, Zhu, Wei-Dong, Jiang, Tai-Jiao, Zhang, Xin-Lei, Feng, Zhi-Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7019181/
https://www.ncbi.nlm.nih.gov/pubmed/32117926
http://dx.doi.org/10.3389/fbioe.2020.00045
_version_ 1783497467416805376
author Wang, Yan
Zhu, Li-Na
Ma, Xiu-Wei
Yang, Fang
Xu, Xi-Lin
Yang, Yao
Yang, Xiao
Peng, Wei
Zhang, Wan-Qiao
Liang, Jin-Yu
Zhu, Wei-Dong
Jiang, Tai-Jiao
Zhang, Xin-Lei
Feng, Zhi-Chun
author_facet Wang, Yan
Zhu, Li-Na
Ma, Xiu-Wei
Yang, Fang
Xu, Xi-Lin
Yang, Yao
Yang, Xiao
Peng, Wei
Zhang, Wan-Qiao
Liang, Jin-Yu
Zhu, Wei-Dong
Jiang, Tai-Jiao
Zhang, Xin-Lei
Feng, Zhi-Chun
author_sort Wang, Yan
collection PubMed
description The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients harboring ID, we identified 704 pathogenic genes, 3848 pathogenic sites, and 2075 standard phenotypes for underlying molecular perturbations and their phenotypic impact. We found the positive correlation between the number of phenotypes and that of patients that revealed their extreme heterogeneities, and the relative contribution of multiple determinants to the heterogeneity of ID phenotypes. Nevertheless, despite the extreme heterogeneity in phenotypes, the ID genes had a specific bias of mutation types, and the top 44 genes that ranked by the number of patients accounted for 39.9% of total patients. More interesting, enriched co-occurrent phenotypes and co-occurrent phenotype networks for each gene had the potential for prioritizing ID genes, further exhibited the convergences of ID phenotypes. Then we established a predictor called IDpred using machine learning methods for ID pathogenic genes prediction. Using10-fold cross-validation, our evaluation shows remarkable AUC values for IDpred (auc = 0.978), demonstrating the robustness and reliability of our tool. Besides, we built the most comprehensive database of ID phenotyped cohort to date: IDminer http://218.4.234.74:3100/IDminer/, which included the curated ID data and integrated IDpred tool for both clinical and experimental researchers. The IDminer serves as an important resource and user-friendly interface to help researchers investigate ID data, and provide important implications for the diagnosis and pathogenesis of developmental disorders of cognition.
format Online
Article
Text
id pubmed-7019181
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70191812020-02-28 Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability Wang, Yan Zhu, Li-Na Ma, Xiu-Wei Yang, Fang Xu, Xi-Lin Yang, Yao Yang, Xiao Peng, Wei Zhang, Wan-Qiao Liang, Jin-Yu Zhu, Wei-Dong Jiang, Tai-Jiao Zhang, Xin-Lei Feng, Zhi-Chun Front Bioeng Biotechnol Bioengineering and Biotechnology The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients harboring ID, we identified 704 pathogenic genes, 3848 pathogenic sites, and 2075 standard phenotypes for underlying molecular perturbations and their phenotypic impact. We found the positive correlation between the number of phenotypes and that of patients that revealed their extreme heterogeneities, and the relative contribution of multiple determinants to the heterogeneity of ID phenotypes. Nevertheless, despite the extreme heterogeneity in phenotypes, the ID genes had a specific bias of mutation types, and the top 44 genes that ranked by the number of patients accounted for 39.9% of total patients. More interesting, enriched co-occurrent phenotypes and co-occurrent phenotype networks for each gene had the potential for prioritizing ID genes, further exhibited the convergences of ID phenotypes. Then we established a predictor called IDpred using machine learning methods for ID pathogenic genes prediction. Using10-fold cross-validation, our evaluation shows remarkable AUC values for IDpred (auc = 0.978), demonstrating the robustness and reliability of our tool. Besides, we built the most comprehensive database of ID phenotyped cohort to date: IDminer http://218.4.234.74:3100/IDminer/, which included the curated ID data and integrated IDpred tool for both clinical and experimental researchers. The IDminer serves as an important resource and user-friendly interface to help researchers investigate ID data, and provide important implications for the diagnosis and pathogenesis of developmental disorders of cognition. Frontiers Media S.A. 2020-02-07 /pmc/articles/PMC7019181/ /pubmed/32117926 http://dx.doi.org/10.3389/fbioe.2020.00045 Text en Copyright © 2020 Wang, Zhu, Ma, Yang, Xu, Yang, Yang, Peng, Zhang, Liang, Zhu, Jiang, Zhang and Feng. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Wang, Yan
Zhu, Li-Na
Ma, Xiu-Wei
Yang, Fang
Xu, Xi-Lin
Yang, Yao
Yang, Xiao
Peng, Wei
Zhang, Wan-Qiao
Liang, Jin-Yu
Zhu, Wei-Dong
Jiang, Tai-Jiao
Zhang, Xin-Lei
Feng, Zhi-Chun
Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title_full Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title_fullStr Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title_full_unstemmed Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title_short Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
title_sort gene-focused networks underlying phenotypic convergence in a systematically phenotyped cohort with heterogeneous intellectual disability
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7019181/
https://www.ncbi.nlm.nih.gov/pubmed/32117926
http://dx.doi.org/10.3389/fbioe.2020.00045
work_keys_str_mv AT wangyan genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT zhulina genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT maxiuwei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT yangfang genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT xuxilin genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT yangyao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT yangxiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT pengwei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT zhangwanqiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT liangjinyu genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT zhuweidong genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT jiangtaijiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT zhangxinlei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability
AT fengzhichun genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability