Cargando…

Estimating clinical risk in gene regions from population sequencing cohort data

While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare ger...

Descripción completa

Detalles Bibliográficos
Autores principales: Fife, James D., Cassa, Christopher A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882564/
https://www.ncbi.nlm.nih.gov/pubmed/36711752
http://dx.doi.org/10.1101/2023.01.06.23284281
_version_ 1784879318360391680
author Fife, James D.
Cassa, Christopher A.
author_facet Fife, James D.
Cassa, Christopher A.
author_sort Fife, James D.
collection PubMed
description While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare germline missense variants collectively. Here we introduce REGatta, a method to improve the estimation of clinical risk in gene segments. We define gene regions using the density of pathogenic diagnostic reports, and then calculate the relative risk in each of these regions using 109,581 exome sequences from women in the UK Biobank. We apply this method in seven established breast cancer genes, and identify regions in each gene with statistically significant differences in breast cancer incidence for rare missense carriers. Even in genes with no significant difference at the gene level, this approach significantly separates rare missense variant carriers at higher or lower risk (BRCA2 regional model OR=1.46 [1.12, 1.79], p=0.0036 vs. BRCA2 gene model OR=0.96 [0.85,1.07] p=0.4171). We find high concordance between these regional risk estimates and high-throughput functional assays of variant impact. We compare with existing methods and the use of protein domains (Pfam) as regions, and find REGatta better identifies individuals at elevated or reduced risk. These regions provide useful priors which can potentially be used to improve risk assessment and clinical management.
format Online
Article
Text
id pubmed-9882564
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98825642023-01-28 Estimating clinical risk in gene regions from population sequencing cohort data Fife, James D. Cassa, Christopher A. medRxiv Article While pathogenic variants significantly increase disease risk in many genes, it is still challenging to estimate the clinical impact of rare missense variants more generally. Even in genes such as BRCA2 or PALB2, large cohort studies find no significant association between breast cancer and rare germline missense variants collectively. Here we introduce REGatta, a method to improve the estimation of clinical risk in gene segments. We define gene regions using the density of pathogenic diagnostic reports, and then calculate the relative risk in each of these regions using 109,581 exome sequences from women in the UK Biobank. We apply this method in seven established breast cancer genes, and identify regions in each gene with statistically significant differences in breast cancer incidence for rare missense carriers. Even in genes with no significant difference at the gene level, this approach significantly separates rare missense variant carriers at higher or lower risk (BRCA2 regional model OR=1.46 [1.12, 1.79], p=0.0036 vs. BRCA2 gene model OR=0.96 [0.85,1.07] p=0.4171). We find high concordance between these regional risk estimates and high-throughput functional assays of variant impact. We compare with existing methods and the use of protein domains (Pfam) as regions, and find REGatta better identifies individuals at elevated or reduced risk. These regions provide useful priors which can potentially be used to improve risk assessment and clinical management. Cold Spring Harbor Laboratory 2023-01-09 /pmc/articles/PMC9882564/ /pubmed/36711752 http://dx.doi.org/10.1101/2023.01.06.23284281 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Fife, James D.
Cassa, Christopher A.
Estimating clinical risk in gene regions from population sequencing cohort data
title Estimating clinical risk in gene regions from population sequencing cohort data
title_full Estimating clinical risk in gene regions from population sequencing cohort data
title_fullStr Estimating clinical risk in gene regions from population sequencing cohort data
title_full_unstemmed Estimating clinical risk in gene regions from population sequencing cohort data
title_short Estimating clinical risk in gene regions from population sequencing cohort data
title_sort estimating clinical risk in gene regions from population sequencing cohort data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882564/
https://www.ncbi.nlm.nih.gov/pubmed/36711752
http://dx.doi.org/10.1101/2023.01.06.23284281
work_keys_str_mv AT fifejamesd estimatingclinicalriskingeneregionsfrompopulationsequencingcohortdata
AT cassachristophera estimatingclinicalriskingeneregionsfrompopulationsequencingcohortdata