Cargando…
Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% o...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7079346/ https://www.ncbi.nlm.nih.gov/pubmed/32183904 http://dx.doi.org/10.1186/s13073-020-00725-6 |
_version_ | 1783507802089586688 |
---|---|
author | Lal, Dennis May, Patrick Perez-Palma, Eduardo Samocha, Kaitlin E. Kosmicki, Jack A. Robinson, Elise B. Møller, Rikke S. Krause, Roland Nürnberg, Peter Weckhuysen, Sarah De Jonghe, Peter Guerrini, Renzo Niestroj, Lisa M. Du, Juliana Marini, Carla Ware, James S. Kurki, Mitja Gormley, Padhraig Tang, Sha Wu, Sitao Biskup, Saskia Poduri, Annapurna Neubauer, Bernd A. Koeleman, Bobby P. C. Helbig, Katherine L. Weber, Yvonne G. Helbig, Ingo Majithia, Amit R. Palotie, Aarno Daly, Mark J. |
author_facet | Lal, Dennis May, Patrick Perez-Palma, Eduardo Samocha, Kaitlin E. Kosmicki, Jack A. Robinson, Elise B. Møller, Rikke S. Krause, Roland Nürnberg, Peter Weckhuysen, Sarah De Jonghe, Peter Guerrini, Renzo Niestroj, Lisa M. Du, Juliana Marini, Carla Ware, James S. Kurki, Mitja Gormley, Padhraig Tang, Sha Wu, Sitao Biskup, Saskia Poduri, Annapurna Neubauer, Bernd A. Koeleman, Bobby P. C. Helbig, Katherine L. Weber, Yvonne G. Helbig, Ingo Majithia, Amit R. Palotie, Aarno Daly, Mark J. |
author_sort | Lal, Dennis |
collection | PubMed |
description | BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. METHODS: Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. RESULTS: We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. CONCLUSION: This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes. |
format | Online Article Text |
id | pubmed-7079346 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70793462020-03-23 Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders Lal, Dennis May, Patrick Perez-Palma, Eduardo Samocha, Kaitlin E. Kosmicki, Jack A. Robinson, Elise B. Møller, Rikke S. Krause, Roland Nürnberg, Peter Weckhuysen, Sarah De Jonghe, Peter Guerrini, Renzo Niestroj, Lisa M. Du, Juliana Marini, Carla Ware, James S. Kurki, Mitja Gormley, Padhraig Tang, Sha Wu, Sitao Biskup, Saskia Poduri, Annapurna Neubauer, Bernd A. Koeleman, Bobby P. C. Helbig, Katherine L. Weber, Yvonne G. Helbig, Ingo Majithia, Amit R. Palotie, Aarno Daly, Mark J. Genome Med Research BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. METHODS: Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. RESULTS: We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. CONCLUSION: This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes. BioMed Central 2020-03-17 /pmc/articles/PMC7079346/ /pubmed/32183904 http://dx.doi.org/10.1186/s13073-020-00725-6 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lal, Dennis May, Patrick Perez-Palma, Eduardo Samocha, Kaitlin E. Kosmicki, Jack A. Robinson, Elise B. Møller, Rikke S. Krause, Roland Nürnberg, Peter Weckhuysen, Sarah De Jonghe, Peter Guerrini, Renzo Niestroj, Lisa M. Du, Juliana Marini, Carla Ware, James S. Kurki, Mitja Gormley, Padhraig Tang, Sha Wu, Sitao Biskup, Saskia Poduri, Annapurna Neubauer, Bernd A. Koeleman, Bobby P. C. Helbig, Katherine L. Weber, Yvonne G. Helbig, Ingo Majithia, Amit R. Palotie, Aarno Daly, Mark J. Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title | Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title_full | Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title_fullStr | Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title_full_unstemmed | Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title_short | Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
title_sort | gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7079346/ https://www.ncbi.nlm.nih.gov/pubmed/32183904 http://dx.doi.org/10.1186/s13073-020-00725-6 |
work_keys_str_mv | AT laldennis genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT maypatrick genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT perezpalmaeduardo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT samochakaitline genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT kosmickijacka genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT robinsoneliseb genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT møllerrikkes genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT krauseroland genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT nurnbergpeter genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT weckhuysensarah genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT dejonghepeter genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT guerrinirenzo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT niestrojlisam genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT dujuliana genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT marinicarla genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT warejamess genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT kurkimitja genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT gormleypadhraig genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT tangsha genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT wusitao genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT biskupsaskia genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT poduriannapurna genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT neubauerbernda genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT koelemanbobbypc genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT helbigkatherinel genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT weberyvonneg genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT helbigingo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT majithiaamitr genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT palotieaarno genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders AT dalymarkj genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders |