Cargando…
Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability
The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7019181/ https://www.ncbi.nlm.nih.gov/pubmed/32117926 http://dx.doi.org/10.3389/fbioe.2020.00045 |
_version_ | 1783497467416805376 |
---|---|
author | Wang, Yan Zhu, Li-Na Ma, Xiu-Wei Yang, Fang Xu, Xi-Lin Yang, Yao Yang, Xiao Peng, Wei Zhang, Wan-Qiao Liang, Jin-Yu Zhu, Wei-Dong Jiang, Tai-Jiao Zhang, Xin-Lei Feng, Zhi-Chun |
author_facet | Wang, Yan Zhu, Li-Na Ma, Xiu-Wei Yang, Fang Xu, Xi-Lin Yang, Yao Yang, Xiao Peng, Wei Zhang, Wan-Qiao Liang, Jin-Yu Zhu, Wei-Dong Jiang, Tai-Jiao Zhang, Xin-Lei Feng, Zhi-Chun |
author_sort | Wang, Yan |
collection | PubMed |
description | The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients harboring ID, we identified 704 pathogenic genes, 3848 pathogenic sites, and 2075 standard phenotypes for underlying molecular perturbations and their phenotypic impact. We found the positive correlation between the number of phenotypes and that of patients that revealed their extreme heterogeneities, and the relative contribution of multiple determinants to the heterogeneity of ID phenotypes. Nevertheless, despite the extreme heterogeneity in phenotypes, the ID genes had a specific bias of mutation types, and the top 44 genes that ranked by the number of patients accounted for 39.9% of total patients. More interesting, enriched co-occurrent phenotypes and co-occurrent phenotype networks for each gene had the potential for prioritizing ID genes, further exhibited the convergences of ID phenotypes. Then we established a predictor called IDpred using machine learning methods for ID pathogenic genes prediction. Using10-fold cross-validation, our evaluation shows remarkable AUC values for IDpred (auc = 0.978), demonstrating the robustness and reliability of our tool. Besides, we built the most comprehensive database of ID phenotyped cohort to date: IDminer http://218.4.234.74:3100/IDminer/, which included the curated ID data and integrated IDpred tool for both clinical and experimental researchers. The IDminer serves as an important resource and user-friendly interface to help researchers investigate ID data, and provide important implications for the diagnosis and pathogenesis of developmental disorders of cognition. |
format | Online Article Text |
id | pubmed-7019181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70191812020-02-28 Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability Wang, Yan Zhu, Li-Na Ma, Xiu-Wei Yang, Fang Xu, Xi-Lin Yang, Yao Yang, Xiao Peng, Wei Zhang, Wan-Qiao Liang, Jin-Yu Zhu, Wei-Dong Jiang, Tai-Jiao Zhang, Xin-Lei Feng, Zhi-Chun Front Bioeng Biotechnol Bioengineering and Biotechnology The broad spectrum of intellectual disability (ID) patients’ clinical manifestations, the heterogeneity of ID genetic variation, and the diversity of the phenotypic variation represent major challenges for ID diagnosis. By exploiting a manually curated systematic phenotyping cohort of 3803 patients harboring ID, we identified 704 pathogenic genes, 3848 pathogenic sites, and 2075 standard phenotypes for underlying molecular perturbations and their phenotypic impact. We found the positive correlation between the number of phenotypes and that of patients that revealed their extreme heterogeneities, and the relative contribution of multiple determinants to the heterogeneity of ID phenotypes. Nevertheless, despite the extreme heterogeneity in phenotypes, the ID genes had a specific bias of mutation types, and the top 44 genes that ranked by the number of patients accounted for 39.9% of total patients. More interesting, enriched co-occurrent phenotypes and co-occurrent phenotype networks for each gene had the potential for prioritizing ID genes, further exhibited the convergences of ID phenotypes. Then we established a predictor called IDpred using machine learning methods for ID pathogenic genes prediction. Using10-fold cross-validation, our evaluation shows remarkable AUC values for IDpred (auc = 0.978), demonstrating the robustness and reliability of our tool. Besides, we built the most comprehensive database of ID phenotyped cohort to date: IDminer http://218.4.234.74:3100/IDminer/, which included the curated ID data and integrated IDpred tool for both clinical and experimental researchers. The IDminer serves as an important resource and user-friendly interface to help researchers investigate ID data, and provide important implications for the diagnosis and pathogenesis of developmental disorders of cognition. Frontiers Media S.A. 2020-02-07 /pmc/articles/PMC7019181/ /pubmed/32117926 http://dx.doi.org/10.3389/fbioe.2020.00045 Text en Copyright © 2020 Wang, Zhu, Ma, Yang, Xu, Yang, Yang, Peng, Zhang, Liang, Zhu, Jiang, Zhang and Feng. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioengineering and Biotechnology Wang, Yan Zhu, Li-Na Ma, Xiu-Wei Yang, Fang Xu, Xi-Lin Yang, Yao Yang, Xiao Peng, Wei Zhang, Wan-Qiao Liang, Jin-Yu Zhu, Wei-Dong Jiang, Tai-Jiao Zhang, Xin-Lei Feng, Zhi-Chun Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title | Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title_full | Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title_fullStr | Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title_full_unstemmed | Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title_short | Gene-Focused Networks Underlying Phenotypic Convergence in a Systematically Phenotyped Cohort With Heterogeneous Intellectual Disability |
title_sort | gene-focused networks underlying phenotypic convergence in a systematically phenotyped cohort with heterogeneous intellectual disability |
topic | Bioengineering and Biotechnology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7019181/ https://www.ncbi.nlm.nih.gov/pubmed/32117926 http://dx.doi.org/10.3389/fbioe.2020.00045 |
work_keys_str_mv | AT wangyan genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT zhulina genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT maxiuwei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT yangfang genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT xuxilin genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT yangyao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT yangxiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT pengwei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT zhangwanqiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT liangjinyu genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT zhuweidong genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT jiangtaijiao genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT zhangxinlei genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability AT fengzhichun genefocusednetworksunderlyingphenotypicconvergenceinasystematicallyphenotypedcohortwithheterogeneousintellectualdisability |