Cargando…

Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data

Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and di...

Descripción completa

Detalles Bibliográficos
Autores principales: Aitken, Stuart, Firth, Helen V., McRae, Jeremy, Halachev, Mihail, Kini, Usha, Parker, Michael J., Lees, Melissa M., Lachlan, Katherine, Sarkar, Ajoy, Joss, Shelagh, Splitt, Miranda, McKee, Shane, Németh, Andrea H., Scott, Richard H., Wright, Caroline F., Marsh, Joseph A., Hurles, Matthew E., FitzPatrick, David R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6848993/
https://www.ncbi.nlm.nih.gov/pubmed/31607427
http://dx.doi.org/10.1016/j.ajhg.2019.09.015
_version_ 1783469114649477120
author Aitken, Stuart
Firth, Helen V.
McRae, Jeremy
Halachev, Mihail
Kini, Usha
Parker, Michael J.
Lees, Melissa M.
Lachlan, Katherine
Sarkar, Ajoy
Joss, Shelagh
Splitt, Miranda
McKee, Shane
Németh, Andrea H.
Scott, Richard H.
Wright, Caroline F.
Marsh, Joseph A.
Hurles, Matthew E.
FitzPatrick, David R.
author_facet Aitken, Stuart
Firth, Helen V.
McRae, Jeremy
Halachev, Mihail
Kini, Usha
Parker, Michael J.
Lees, Melissa M.
Lachlan, Katherine
Sarkar, Ajoy
Joss, Shelagh
Splitt, Miranda
McKee, Shane
Németh, Andrea H.
Scott, Richard H.
Wright, Caroline F.
Marsh, Joseph A.
Hurles, Matthew E.
FitzPatrick, David R.
author_sort Aitken, Stuart
collection PubMed
description Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclidean distances (mEuD) were employed as a simple measure of similarity of quantitative phenotypic data within sets of ≥10 individuals with plausibly causative de novo mutations (DNM) in 28 different developmental disorder genes. 13/28 (46.4%) showed significant similarity for growth or developmental milestone metrics, 10/28 (35.7%) showed similarity in HPO term usage, and 12/28 (43%) showed no phenotypic similarity. Pairwise comparisons of individuals with high-impact inherited variants to the 32 individuals with causative DNM in ANKRD11 using only growth z-scores highlighted 5 likely causative inherited variants and two unrecognized DNM resulting in an 18% diagnostic uplift for this gene. Using an independent approach, naive Bayes classification of growth and developmental data produced reasonably discriminative models for the 24 DNM genes with sufficiently complete data. An unsupervised naive Bayes classification of 6,993 probands with WES data and sufficient phenotypic information defined 23 in silico syndromes (ISSs) and was used to test a “phenotype first” approach to the discovery of causative genotypes using WES variants strictly filtered on allele frequency, mutation consequence, and evidence of constraint in humans. This highlighted heterozygous de novo nonsynonymous variants in SPTBN2 as causative in three DDD probands.
format Online
Article
Text
id pubmed-6848993
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-68489932020-05-07 Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data Aitken, Stuart Firth, Helen V. McRae, Jeremy Halachev, Mihail Kini, Usha Parker, Michael J. Lees, Melissa M. Lachlan, Katherine Sarkar, Ajoy Joss, Shelagh Splitt, Miranda McKee, Shane Németh, Andrea H. Scott, Richard H. Wright, Caroline F. Marsh, Joseph A. Hurles, Matthew E. FitzPatrick, David R. Am J Hum Genet Article Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclidean distances (mEuD) were employed as a simple measure of similarity of quantitative phenotypic data within sets of ≥10 individuals with plausibly causative de novo mutations (DNM) in 28 different developmental disorder genes. 13/28 (46.4%) showed significant similarity for growth or developmental milestone metrics, 10/28 (35.7%) showed similarity in HPO term usage, and 12/28 (43%) showed no phenotypic similarity. Pairwise comparisons of individuals with high-impact inherited variants to the 32 individuals with causative DNM in ANKRD11 using only growth z-scores highlighted 5 likely causative inherited variants and two unrecognized DNM resulting in an 18% diagnostic uplift for this gene. Using an independent approach, naive Bayes classification of growth and developmental data produced reasonably discriminative models for the 24 DNM genes with sufficiently complete data. An unsupervised naive Bayes classification of 6,993 probands with WES data and sufficient phenotypic information defined 23 in silico syndromes (ISSs) and was used to test a “phenotype first” approach to the discovery of causative genotypes using WES variants strictly filtered on allele frequency, mutation consequence, and evidence of constraint in humans. This highlighted heterozygous de novo nonsynonymous variants in SPTBN2 as causative in three DDD probands. Elsevier 2019-11-07 2019-10-10 /pmc/articles/PMC6848993/ /pubmed/31607427 http://dx.doi.org/10.1016/j.ajhg.2019.09.015 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Aitken, Stuart
Firth, Helen V.
McRae, Jeremy
Halachev, Mihail
Kini, Usha
Parker, Michael J.
Lees, Melissa M.
Lachlan, Katherine
Sarkar, Ajoy
Joss, Shelagh
Splitt, Miranda
McKee, Shane
Németh, Andrea H.
Scott, Richard H.
Wright, Caroline F.
Marsh, Joseph A.
Hurles, Matthew E.
FitzPatrick, David R.
Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title_full Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title_fullStr Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title_full_unstemmed Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title_short Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
title_sort finding diagnostically useful patterns in quantitative phenotypic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6848993/
https://www.ncbi.nlm.nih.gov/pubmed/31607427
http://dx.doi.org/10.1016/j.ajhg.2019.09.015
work_keys_str_mv AT aitkenstuart findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT firthhelenv findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT mcraejeremy findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT halachevmihail findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT kiniusha findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT parkermichaelj findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT leesmelissam findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT lachlankatherine findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT sarkarajoy findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT jossshelagh findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT splittmiranda findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT mckeeshane findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT nemethandreah findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT scottrichardh findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT wrightcarolinef findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT marshjosepha findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT hurlesmatthewe findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT fitzpatrickdavidr findingdiagnosticallyusefulpatternsinquantitativephenotypicdata
AT findingdiagnosticallyusefulpatternsinquantitativephenotypicdata