Cargando…

Hypothesis-free phenotype prediction within a genetics-first framework

Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Chang, Zaucha, Jan, Gam, Rihab, Fang, Hai, Ben Smithers, Oates, Matt E., Bernabe-Rubio, Miguel, Williams, James, Zelenka, Natalie, Pandurangan, Arun Prasad, Tandon, Himani, Shihab, Hashem, Kalaivani, Raju, Sung, Minkyung, Sardar, Adam J., Tzovoras, Bastian Greshake, Danovi, Davide, Gough, Julian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9938118/
https://www.ncbi.nlm.nih.gov/pubmed/36808136
http://dx.doi.org/10.1038/s41467-023-36634-6
_version_ 1784890567566557184
author Lu, Chang
Zaucha, Jan
Gam, Rihab
Fang, Hai
Ben Smithers
Oates, Matt E.
Bernabe-Rubio, Miguel
Williams, James
Zelenka, Natalie
Pandurangan, Arun Prasad
Tandon, Himani
Shihab, Hashem
Kalaivani, Raju
Sung, Minkyung
Sardar, Adam J.
Tzovoras, Bastian Greshake
Danovi, Davide
Gough, Julian
author_facet Lu, Chang
Zaucha, Jan
Gam, Rihab
Fang, Hai
Ben Smithers
Oates, Matt E.
Bernabe-Rubio, Miguel
Williams, James
Zelenka, Natalie
Pandurangan, Arun Prasad
Tandon, Himani
Shihab, Hashem
Kalaivani, Raju
Sung, Minkyung
Sardar, Adam J.
Tzovoras, Bastian Greshake
Danovi, Davide
Gough, Julian
author_sort Lu, Chang
collection PubMed
description Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied.
format Online
Article
Text
id pubmed-9938118
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99381182023-02-19 Hypothesis-free phenotype prediction within a genetics-first framework Lu, Chang Zaucha, Jan Gam, Rihab Fang, Hai Ben Smithers Oates, Matt E. Bernabe-Rubio, Miguel Williams, James Zelenka, Natalie Pandurangan, Arun Prasad Tandon, Himani Shihab, Hashem Kalaivani, Raju Sung, Minkyung Sardar, Adam J. Tzovoras, Bastian Greshake Danovi, Davide Gough, Julian Nat Commun Article Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied. Nature Publishing Group UK 2023-02-17 /pmc/articles/PMC9938118/ /pubmed/36808136 http://dx.doi.org/10.1038/s41467-023-36634-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lu, Chang
Zaucha, Jan
Gam, Rihab
Fang, Hai
Ben Smithers
Oates, Matt E.
Bernabe-Rubio, Miguel
Williams, James
Zelenka, Natalie
Pandurangan, Arun Prasad
Tandon, Himani
Shihab, Hashem
Kalaivani, Raju
Sung, Minkyung
Sardar, Adam J.
Tzovoras, Bastian Greshake
Danovi, Davide
Gough, Julian
Hypothesis-free phenotype prediction within a genetics-first framework
title Hypothesis-free phenotype prediction within a genetics-first framework
title_full Hypothesis-free phenotype prediction within a genetics-first framework
title_fullStr Hypothesis-free phenotype prediction within a genetics-first framework
title_full_unstemmed Hypothesis-free phenotype prediction within a genetics-first framework
title_short Hypothesis-free phenotype prediction within a genetics-first framework
title_sort hypothesis-free phenotype prediction within a genetics-first framework
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9938118/
https://www.ncbi.nlm.nih.gov/pubmed/36808136
http://dx.doi.org/10.1038/s41467-023-36634-6
work_keys_str_mv AT luchang hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT zauchajan hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT gamrihab hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT fanghai hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT bensmithers hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT oatesmatte hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT bernaberubiomiguel hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT williamsjames hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT zelenkanatalie hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT panduranganarunprasad hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT tandonhimani hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT shihabhashem hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT kalaivaniraju hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT sungminkyung hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT sardaradamj hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT tzovorasbastiangreshake hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT danovidavide hypothesisfreephenotypepredictionwithinageneticsfirstframework
AT goughjulian hypothesisfreephenotypepredictionwithinageneticsfirstframework