Cargando…

Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics

Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been use...

Descripción completa

Detalles Bibliográficos
Autores principales: Wiel, Laurens, Venselaar, Hanka, Veltman, Joris A., Vriend, Gert, Gilissen, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5656839/
https://www.ncbi.nlm.nih.gov/pubmed/28815929
http://dx.doi.org/10.1002/humu.23313
_version_ 1783273776419438592
author Wiel, Laurens
Venselaar, Hanka
Veltman, Joris A.
Vriend, Gert
Gilissen, Christian
author_facet Wiel, Laurens
Venselaar, Hanka
Veltman, Joris A.
Vriend, Gert
Gilissen, Christian
author_sort Wiel, Laurens
collection PubMed
description Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation.
format Online
Article
Text
id pubmed-5656839
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-56568392017-11-01 Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics Wiel, Laurens Venselaar, Hanka Veltman, Joris A. Vriend, Gert Gilissen, Christian Hum Mutat Informatics Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation. John Wiley and Sons Inc. 2017-08-31 2017-11 /pmc/articles/PMC5656839/ /pubmed/28815929 http://dx.doi.org/10.1002/humu.23313 Text en © 2017 The Authors. Human Mutation published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Informatics
Wiel, Laurens
Venselaar, Hanka
Veltman, Joris A.
Vriend, Gert
Gilissen, Christian
Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title_full Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title_fullStr Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title_full_unstemmed Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title_short Aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
title_sort aggregation of population‐based genetic variation over protein domain homologues and its potential use in genetic diagnostics
topic Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5656839/
https://www.ncbi.nlm.nih.gov/pubmed/28815929
http://dx.doi.org/10.1002/humu.23313
work_keys_str_mv AT wiellaurens aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics
AT venselaarhanka aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics
AT veltmanjorisa aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics
AT vriendgert aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics
AT gilissenchristian aggregationofpopulationbasedgeneticvariationoverproteindomainhomologuesanditspotentialuseingeneticdiagnostics