Cargando…
Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been use...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5656839/ https://www.ncbi.nlm.nih.gov/pubmed/28815929 http://dx.doi.org/10.1002/humu.23313 |
_version_ | 1783273776419438592 |
---|---|
author | Wiel, Laurens Venselaar, Hanka Veltman, Joris A. Vriend, Gert Gilissen, Christian |
author_facet | Wiel, Laurens Venselaar, Hanka Veltman, Joris A. Vriend, Gert Gilissen, Christian |
author_sort | Wiel, Laurens |
collection | PubMed |
description | Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation. |
format | Online Article Text |
id | pubmed-5656839 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-56568392017-11-01 Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics Wiel, Laurens Venselaar, Hanka Veltman, Joris A. Vriend, Gert Gilissen, Christian Hum Mutat Informatics Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation. John Wiley and Sons Inc. 2017-08-31 2017-11 /pmc/articles/PMC5656839/ /pubmed/28815929 http://dx.doi.org/10.1002/humu.23313 Text en © 2017 The Authors. Human Mutation published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Informatics Wiel, Laurens Venselaar, Hanka Veltman, Joris A. Vriend, Gert Gilissen, Christian Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title | Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title_full | Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title_fullStr | Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title_full_unstemmed | Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title_short | Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
title_sort | aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics |
topic | Informatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5656839/ https://www.ncbi.nlm.nih.gov/pubmed/28815929 http://dx.doi.org/10.1002/humu.23313 |
work_keys_str_mv | AT wiellaurens aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics AT venselaarhanka aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics AT veltmanjorisa aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics AT vriendgert aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics AT gilissenchristian aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics |