Cargando…
Unified inference of missense variant effects and gene constraints in the human genome
A challenge in medical genomics is to identify variants and genes associated with severe genetic disorders. Based on the premise that severe, early-onset disorders often result in a reduction of evolutionary fitness, several statistical methods have been developed to predict pathogenic variants or c...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384676/ https://www.ncbi.nlm.nih.gov/pubmed/32667917 http://dx.doi.org/10.1371/journal.pgen.1008922 |
_version_ | 1783563647129223168 |
---|---|
author | Huang, Yi-Fei |
author_facet | Huang, Yi-Fei |
author_sort | Huang, Yi-Fei |
collection | PubMed |
description | A challenge in medical genomics is to identify variants and genes associated with severe genetic disorders. Based on the premise that severe, early-onset disorders often result in a reduction of evolutionary fitness, several statistical methods have been developed to predict pathogenic variants or constrained genes based on the signatures of negative selection in human populations. However, we currently lack a statistical framework to jointly predict deleterious variants and constrained genes from both variant-level features and gene-level selective constraints. Here we present such a unified approach, UNEECON, based on deep learning and population genetics. UNEECON treats the contributions of variant-level features and gene-level constraints as a variant-level fixed effect and a gene-level random effect, respectively. The sum of the fixed and random effects is then combined with an evolutionary model to infer the strength of negative selection at both variant and gene levels. Compared with previously published methods, UNEECON shows improved performance in predicting missense variants and protein-coding genes associated with autosomal dominant disorders, and feature importance analysis suggests that both gene-level selective constraints and variant-level predictors are important for accurate variant prioritization. Furthermore, based on UNEECON, we observe a low correlation between gene-level intolerance to missense mutations and that to loss-of-function mutations, which can be partially explained by the prevalence of disordered protein regions that are highly tolerant to missense mutations. Finally, we show that genes intolerant to both missense and loss-of-function mutations play key roles in the central nervous system and the autism spectrum disorders. Overall, UNEECON is a promising framework for both variant and gene prioritization. |
format | Online Article Text |
id | pubmed-7384676 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73846762020-08-05 Unified inference of missense variant effects and gene constraints in the human genome Huang, Yi-Fei PLoS Genet Research Article A challenge in medical genomics is to identify variants and genes associated with severe genetic disorders. Based on the premise that severe, early-onset disorders often result in a reduction of evolutionary fitness, several statistical methods have been developed to predict pathogenic variants or constrained genes based on the signatures of negative selection in human populations. However, we currently lack a statistical framework to jointly predict deleterious variants and constrained genes from both variant-level features and gene-level selective constraints. Here we present such a unified approach, UNEECON, based on deep learning and population genetics. UNEECON treats the contributions of variant-level features and gene-level constraints as a variant-level fixed effect and a gene-level random effect, respectively. The sum of the fixed and random effects is then combined with an evolutionary model to infer the strength of negative selection at both variant and gene levels. Compared with previously published methods, UNEECON shows improved performance in predicting missense variants and protein-coding genes associated with autosomal dominant disorders, and feature importance analysis suggests that both gene-level selective constraints and variant-level predictors are important for accurate variant prioritization. Furthermore, based on UNEECON, we observe a low correlation between gene-level intolerance to missense mutations and that to loss-of-function mutations, which can be partially explained by the prevalence of disordered protein regions that are highly tolerant to missense mutations. Finally, we show that genes intolerant to both missense and loss-of-function mutations play key roles in the central nervous system and the autism spectrum disorders. Overall, UNEECON is a promising framework for both variant and gene prioritization. Public Library of Science 2020-07-15 /pmc/articles/PMC7384676/ /pubmed/32667917 http://dx.doi.org/10.1371/journal.pgen.1008922 Text en © 2020 Yi-Fei Huang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Huang, Yi-Fei Unified inference of missense variant effects and gene constraints in the human genome |
title | Unified inference of missense variant effects and gene constraints in the human genome |
title_full | Unified inference of missense variant effects and gene constraints in the human genome |
title_fullStr | Unified inference of missense variant effects and gene constraints in the human genome |
title_full_unstemmed | Unified inference of missense variant effects and gene constraints in the human genome |
title_short | Unified inference of missense variant effects and gene constraints in the human genome |
title_sort | unified inference of missense variant effects and gene constraints in the human genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384676/ https://www.ncbi.nlm.nih.gov/pubmed/32667917 http://dx.doi.org/10.1371/journal.pgen.1008922 |
work_keys_str_mv | AT huangyifei unifiedinferenceofmissensevarianteffectsandgeneconstraintsinthehumangenome |