Cargando…

Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders

BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% o...

Descripción completa

Detalles Bibliográficos
Autores principales: Lal, Dennis, May, Patrick, Perez-Palma, Eduardo, Samocha, Kaitlin E., Kosmicki, Jack A., Robinson, Elise B., Møller, Rikke S., Krause, Roland, Nürnberg, Peter, Weckhuysen, Sarah, De Jonghe, Peter, Guerrini, Renzo, Niestroj, Lisa M., Du, Juliana, Marini, Carla, Ware, James S., Kurki, Mitja, Gormley, Padhraig, Tang, Sha, Wu, Sitao, Biskup, Saskia, Poduri, Annapurna, Neubauer, Bernd A., Koeleman, Bobby P. C., Helbig, Katherine L., Weber, Yvonne G., Helbig, Ingo, Majithia, Amit R., Palotie, Aarno, Daly, Mark J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7079346/
https://www.ncbi.nlm.nih.gov/pubmed/32183904
http://dx.doi.org/10.1186/s13073-020-00725-6
_version_ 1783507802089586688
author Lal, Dennis
May, Patrick
Perez-Palma, Eduardo
Samocha, Kaitlin E.
Kosmicki, Jack A.
Robinson, Elise B.
Møller, Rikke S.
Krause, Roland
Nürnberg, Peter
Weckhuysen, Sarah
De Jonghe, Peter
Guerrini, Renzo
Niestroj, Lisa M.
Du, Juliana
Marini, Carla
Ware, James S.
Kurki, Mitja
Gormley, Padhraig
Tang, Sha
Wu, Sitao
Biskup, Saskia
Poduri, Annapurna
Neubauer, Bernd A.
Koeleman, Bobby P. C.
Helbig, Katherine L.
Weber, Yvonne G.
Helbig, Ingo
Majithia, Amit R.
Palotie, Aarno
Daly, Mark J.
author_facet Lal, Dennis
May, Patrick
Perez-Palma, Eduardo
Samocha, Kaitlin E.
Kosmicki, Jack A.
Robinson, Elise B.
Møller, Rikke S.
Krause, Roland
Nürnberg, Peter
Weckhuysen, Sarah
De Jonghe, Peter
Guerrini, Renzo
Niestroj, Lisa M.
Du, Juliana
Marini, Carla
Ware, James S.
Kurki, Mitja
Gormley, Padhraig
Tang, Sha
Wu, Sitao
Biskup, Saskia
Poduri, Annapurna
Neubauer, Bernd A.
Koeleman, Bobby P. C.
Helbig, Katherine L.
Weber, Yvonne G.
Helbig, Ingo
Majithia, Amit R.
Palotie, Aarno
Daly, Mark J.
author_sort Lal, Dennis
collection PubMed
description BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. METHODS: Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. RESULTS: We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. CONCLUSION: This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes.
format Online
Article
Text
id pubmed-7079346
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70793462020-03-23 Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders Lal, Dennis May, Patrick Perez-Palma, Eduardo Samocha, Kaitlin E. Kosmicki, Jack A. Robinson, Elise B. Møller, Rikke S. Krause, Roland Nürnberg, Peter Weckhuysen, Sarah De Jonghe, Peter Guerrini, Renzo Niestroj, Lisa M. Du, Juliana Marini, Carla Ware, James S. Kurki, Mitja Gormley, Padhraig Tang, Sha Wu, Sitao Biskup, Saskia Poduri, Annapurna Neubauer, Bernd A. Koeleman, Bobby P. C. Helbig, Katherine L. Weber, Yvonne G. Helbig, Ingo Majithia, Amit R. Palotie, Aarno Daly, Mark J. Genome Med Research BACKGROUND: Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. METHODS: Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. RESULTS: We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. CONCLUSION: This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes. BioMed Central 2020-03-17 /pmc/articles/PMC7079346/ /pubmed/32183904 http://dx.doi.org/10.1186/s13073-020-00725-6 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lal, Dennis
May, Patrick
Perez-Palma, Eduardo
Samocha, Kaitlin E.
Kosmicki, Jack A.
Robinson, Elise B.
Møller, Rikke S.
Krause, Roland
Nürnberg, Peter
Weckhuysen, Sarah
De Jonghe, Peter
Guerrini, Renzo
Niestroj, Lisa M.
Du, Juliana
Marini, Carla
Ware, James S.
Kurki, Mitja
Gormley, Padhraig
Tang, Sha
Wu, Sitao
Biskup, Saskia
Poduri, Annapurna
Neubauer, Bernd A.
Koeleman, Bobby P. C.
Helbig, Katherine L.
Weber, Yvonne G.
Helbig, Ingo
Majithia, Amit R.
Palotie, Aarno
Daly, Mark J.
Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title_full Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title_fullStr Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title_full_unstemmed Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title_short Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
title_sort gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7079346/
https://www.ncbi.nlm.nih.gov/pubmed/32183904
http://dx.doi.org/10.1186/s13073-020-00725-6
work_keys_str_mv AT laldennis genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT maypatrick genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT perezpalmaeduardo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT samochakaitline genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT kosmickijacka genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT robinsoneliseb genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT møllerrikkes genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT krauseroland genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT nurnbergpeter genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT weckhuysensarah genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT dejonghepeter genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT guerrinirenzo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT niestrojlisam genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT dujuliana genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT marinicarla genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT warejamess genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT kurkimitja genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT gormleypadhraig genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT tangsha genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT wusitao genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT biskupsaskia genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT poduriannapurna genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT neubauerbernda genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT koelemanbobbypc genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT helbigkatherinel genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT weberyvonneg genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT helbigingo genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT majithiaamitr genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT palotieaarno genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders
AT dalymarkj genefamilyinformationfacilitatesvariantinterpretationandidentificationofdiseaseassociatedgenesinneurodevelopmentaldisorders