Cargando…
Estimating clinical risk in gene regions from population sequencing cohort data
While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare ger...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882564/ https://www.ncbi.nlm.nih.gov/pubmed/36711752 http://dx.doi.org/10.1101/2023.01.06.23284281 |
_version_ | 1784879318360391680 |
---|---|
author | Fife, James D. Cassa, Christopher A. |
author_facet | Fife, James D. Cassa, Christopher A. |
author_sort | Fife, James D. |
collection | PubMed |
description | While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare germline missense variants collectively. Here we introduce REGatta, a method to improve the estimation of clinical risk in gene segments. We define gene regions using the density of pathogenic diagnostic reports, and then calculate the relative risk in each of these regions using 109,581 exome sequences from women in the UK Biobank. We apply this method in seven established breast cancer genes, and identify regions in each gene with statistically significant differences in breast cancer incidence for rare missense carriers. Even in genes with no significant difference at the gene level, this approach significantly separates rare missense variant carriers at higher or lower risk (BRCA2 regional model OR=1.46 [1.12, 1.79], p=0.0036 vs. BRCA2 gene model OR=0.96 [0.85,1.07] p=0.4171). We find high concordance between these regional risk estimates and high-throughput functional assays of variant impact. We compare with existing methods and the use of protein domains (Pfam) as regions, and find REGatta better identifies individuals at elevated or reduced risk. These regions provide useful priors which can potentially be used to improve risk assessment and clinical management. |
format | Online Article Text |
id | pubmed-9882564 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-98825642023-01-28 Estimating clinical risk in gene regions from population sequencing cohort data Fife, James D. Cassa, Christopher A. medRxiv Article While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare germline missense variants collectively. Here we introduce REGatta, a method to improve the estimation of clinical risk in gene segments. We define gene regions using the density of pathogenic diagnostic reports, and then calculate the relative risk in each of these regions using 109,581 exome sequences from women in the UK Biobank. We apply this method in seven established breast cancer genes, and identify regions in each gene with statistically significant differences in breast cancer incidence for rare missense carriers. Even in genes with no significant difference at the gene level, this approach significantly separates rare missense variant carriers at higher or lower risk (BRCA2 regional model OR=1.46 [1.12, 1.79], p=0.0036 vs. BRCA2 gene model OR=0.96 [0.85,1.07] p=0.4171). We find high concordance between these regional risk estimates and high-throughput functional assays of variant impact. We compare with existing methods and the use of protein domains (Pfam) as regions, and find REGatta better identifies individuals at elevated or reduced risk. These regions provide useful priors which can potentially be used to improve risk assessment and clinical management. Cold Spring Harbor Laboratory 2023-01-09 /pmc/articles/PMC9882564/ /pubmed/36711752 http://dx.doi.org/10.1101/2023.01.06.23284281 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Fife, James D. Cassa, Christopher A. Estimating clinical risk in gene regions from population sequencing cohort data |
title | Estimating clinical risk in gene regions from population sequencing cohort data |
title_full | Estimating clinical risk in gene regions from population sequencing cohort data |
title_fullStr | Estimating clinical risk in gene regions from population sequencing cohort data |
title_full_unstemmed | Estimating clinical risk in gene regions from population sequencing cohort data |
title_short | Estimating clinical risk in gene regions from population sequencing cohort data |
title_sort | estimating clinical risk in gene regions from population sequencing cohort data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882564/ https://www.ncbi.nlm.nih.gov/pubmed/36711752 http://dx.doi.org/10.1101/2023.01.06.23284281 |
work_keys_str_mv | AT fifejamesd estimatingclinicalriskingeneregionsfrompopulationsequencingcohortdata AT cassachristophera estimatingclinicalriskingeneregionsfrompopulationsequencingcohortdata |