Cargando…
Hypothesis-free phenotype prediction within a genetics-first framework
Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants inf...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9938118/ https://www.ncbi.nlm.nih.gov/pubmed/36808136 http://dx.doi.org/10.1038/s41467-023-36634-6 |
_version_ | 1784890567566557184 |
---|---|
author | Lu, Chang Zaucha, Jan Gam, Rihab Fang, Hai Ben Smithers Oates, Matt E. Bernabe-Rubio, Miguel Williams, James Zelenka, Natalie Pandurangan, Arun Prasad Tandon, Himani Shihab, Hashem Kalaivani, Raju Sung, Minkyung Sardar, Adam J. Tzovoras, Bastian Greshake Danovi, Davide Gough, Julian |
author_facet | Lu, Chang Zaucha, Jan Gam, Rihab Fang, Hai Ben Smithers Oates, Matt E. Bernabe-Rubio, Miguel Williams, James Zelenka, Natalie Pandurangan, Arun Prasad Tandon, Himani Shihab, Hashem Kalaivani, Raju Sung, Minkyung Sardar, Adam J. Tzovoras, Bastian Greshake Danovi, Davide Gough, Julian |
author_sort | Lu, Chang |
collection | PubMed |
description | Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied. |
format | Online Article Text |
id | pubmed-9938118 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-99381182023-02-19 Hypothesis-free phenotype prediction within a genetics-first framework Lu, Chang Zaucha, Jan Gam, Rihab Fang, Hai Ben Smithers Oates, Matt E. Bernabe-Rubio, Miguel Williams, James Zelenka, Natalie Pandurangan, Arun Prasad Tandon, Himani Shihab, Hashem Kalaivani, Raju Sung, Minkyung Sardar, Adam J. Tzovoras, Bastian Greshake Danovi, Davide Gough, Julian Nat Commun Article Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed ‘rare’, even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied. Nature Publishing Group UK 2023-02-17 /pmc/articles/PMC9938118/ /pubmed/36808136 http://dx.doi.org/10.1038/s41467-023-36634-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Lu, Chang Zaucha, Jan Gam, Rihab Fang, Hai Ben Smithers Oates, Matt E. Bernabe-Rubio, Miguel Williams, James Zelenka, Natalie Pandurangan, Arun Prasad Tandon, Himani Shihab, Hashem Kalaivani, Raju Sung, Minkyung Sardar, Adam J. Tzovoras, Bastian Greshake Danovi, Davide Gough, Julian Hypothesis-free phenotype prediction within a genetics-first framework |
title | Hypothesis-free phenotype prediction within a genetics-first framework |
title_full | Hypothesis-free phenotype prediction within a genetics-first framework |
title_fullStr | Hypothesis-free phenotype prediction within a genetics-first framework |
title_full_unstemmed | Hypothesis-free phenotype prediction within a genetics-first framework |
title_short | Hypothesis-free phenotype prediction within a genetics-first framework |
title_sort | hypothesis-free phenotype prediction within a genetics-first framework |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9938118/ https://www.ncbi.nlm.nih.gov/pubmed/36808136 http://dx.doi.org/10.1038/s41467-023-36634-6 |
work_keys_str_mv | AT luchang hypothesisfreephenotypepredictionwithinageneticsfirstframework AT zauchajan hypothesisfreephenotypepredictionwithinageneticsfirstframework AT gamrihab hypothesisfreephenotypepredictionwithinageneticsfirstframework AT fanghai hypothesisfreephenotypepredictionwithinageneticsfirstframework AT bensmithers hypothesisfreephenotypepredictionwithinageneticsfirstframework AT oatesmatte hypothesisfreephenotypepredictionwithinageneticsfirstframework AT bernaberubiomiguel hypothesisfreephenotypepredictionwithinageneticsfirstframework AT williamsjames hypothesisfreephenotypepredictionwithinageneticsfirstframework AT zelenkanatalie hypothesisfreephenotypepredictionwithinageneticsfirstframework AT panduranganarunprasad hypothesisfreephenotypepredictionwithinageneticsfirstframework AT tandonhimani hypothesisfreephenotypepredictionwithinageneticsfirstframework AT shihabhashem hypothesisfreephenotypepredictionwithinageneticsfirstframework AT kalaivaniraju hypothesisfreephenotypepredictionwithinageneticsfirstframework AT sungminkyung hypothesisfreephenotypepredictionwithinageneticsfirstframework AT sardaradamj hypothesisfreephenotypepredictionwithinageneticsfirstframework AT tzovorasbastiangreshake hypothesisfreephenotypepredictionwithinageneticsfirstframework AT danovidavide hypothesisfreephenotypepredictionwithinageneticsfirstframework AT goughjulian hypothesisfreephenotypepredictionwithinageneticsfirstframework |