Cargando…

Inferring compound heterozygosity from large-scale exome sequencing data

Recessive diseases arise when both the maternal and the paternal copies of a gene are impacted by a damaging genetic variant in the affected individual. When a patient carries two different potentially causal variants in a gene for a given disorder, accurate diagnosis requires determining that these...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Michael H., Francioli, Laurent C., Stenton, Sarah L., Goodrich, Julia K., Watts, Nicholas A., Singer-Berk, Moriel, Groopman, Emily, Darnowsky, Philip W., Solomonson, Matthew, Baxter, Samantha, Tiao, Grace, Neale, Benjamin M., Hirschhorn, Joel N., Rehm, Heidi L., Daly, Mark J., O’Donnell-Luria, Anne, Karczewski, Konrad J., MacArthur, Daniel G., Samocha, Kaitlin E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055215/
https://www.ncbi.nlm.nih.gov/pubmed/36993580
http://dx.doi.org/10.1101/2023.03.19.533370
_version_ 1785015839644188672
author Guo, Michael H.
Francioli, Laurent C.
Stenton, Sarah L.
Goodrich, Julia K.
Watts, Nicholas A.
Singer-Berk, Moriel
Groopman, Emily
Darnowsky, Philip W.
Solomonson, Matthew
Baxter, Samantha
Tiao, Grace
Neale, Benjamin M.
Hirschhorn, Joel N.
Rehm, Heidi L.
Daly, Mark J.
O’Donnell-Luria, Anne
Karczewski, Konrad J.
MacArthur, Daniel G.
Samocha, Kaitlin E.
author_facet Guo, Michael H.
Francioli, Laurent C.
Stenton, Sarah L.
Goodrich, Julia K.
Watts, Nicholas A.
Singer-Berk, Moriel
Groopman, Emily
Darnowsky, Philip W.
Solomonson, Matthew
Baxter, Samantha
Tiao, Grace
Neale, Benjamin M.
Hirschhorn, Joel N.
Rehm, Heidi L.
Daly, Mark J.
O’Donnell-Luria, Anne
Karczewski, Konrad J.
MacArthur, Daniel G.
Samocha, Kaitlin E.
author_sort Guo, Michael H.
collection PubMed
description Recessive diseases arise when both the maternal and the paternal copies of a gene are impacted by a damaging genetic variant in the affected individual. When a patient carries two different potentially causal variants in a gene for a given disorder, accurate diagnosis requires determining that these two variants occur on different copies of the chromosome (i.e., are in trans) rather than on the same copy (i.e. in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. We developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in exome sequencing data from the Genome Aggregation Database (gnomAD v2, n=125,748). When applied to trio data where phase can be determined by transmission, our approach estimates phase with 95.7% accuracy and remains accurate even for very rare variants (allele frequency < 1×10(−4)). We also correctly phase 95.9% of variant pairs in a set of 293 patients with Mendelian conditions carrying presumed causal compound heterozygous variants. We provide a public resource of phasing estimates from gnomAD, including phasing estimates for coding variants across the genome and counts per gene of rare variants in trans, that can aid interpretation of rare co-occurring variants in the context of recessive disease.
format Online
Article
Text
id pubmed-10055215
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100552152023-03-30 Inferring compound heterozygosity from large-scale exome sequencing data Guo, Michael H. Francioli, Laurent C. Stenton, Sarah L. Goodrich, Julia K. Watts, Nicholas A. Singer-Berk, Moriel Groopman, Emily Darnowsky, Philip W. Solomonson, Matthew Baxter, Samantha Tiao, Grace Neale, Benjamin M. Hirschhorn, Joel N. Rehm, Heidi L. Daly, Mark J. O’Donnell-Luria, Anne Karczewski, Konrad J. MacArthur, Daniel G. Samocha, Kaitlin E. bioRxiv Article Recessive diseases arise when both the maternal and the paternal copies of a gene are impacted by a damaging genetic variant in the affected individual. When a patient carries two different potentially causal variants in a gene for a given disorder, accurate diagnosis requires determining that these two variants occur on different copies of the chromosome (i.e., are in trans) rather than on the same copy (i.e. in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. We developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in exome sequencing data from the Genome Aggregation Database (gnomAD v2, n=125,748). When applied to trio data where phase can be determined by transmission, our approach estimates phase with 95.7% accuracy and remains accurate even for very rare variants (allele frequency < 1×10(−4)). We also correctly phase 95.9% of variant pairs in a set of 293 patients with Mendelian conditions carrying presumed causal compound heterozygous variants. We provide a public resource of phasing estimates from gnomAD, including phasing estimates for coding variants across the genome and counts per gene of rare variants in trans, that can aid interpretation of rare co-occurring variants in the context of recessive disease. Cold Spring Harbor Laboratory 2023-08-21 /pmc/articles/PMC10055215/ /pubmed/36993580 http://dx.doi.org/10.1101/2023.03.19.533370 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Guo, Michael H.
Francioli, Laurent C.
Stenton, Sarah L.
Goodrich, Julia K.
Watts, Nicholas A.
Singer-Berk, Moriel
Groopman, Emily
Darnowsky, Philip W.
Solomonson, Matthew
Baxter, Samantha
Tiao, Grace
Neale, Benjamin M.
Hirschhorn, Joel N.
Rehm, Heidi L.
Daly, Mark J.
O’Donnell-Luria, Anne
Karczewski, Konrad J.
MacArthur, Daniel G.
Samocha, Kaitlin E.
Inferring compound heterozygosity from large-scale exome sequencing data
title Inferring compound heterozygosity from large-scale exome sequencing data
title_full Inferring compound heterozygosity from large-scale exome sequencing data
title_fullStr Inferring compound heterozygosity from large-scale exome sequencing data
title_full_unstemmed Inferring compound heterozygosity from large-scale exome sequencing data
title_short Inferring compound heterozygosity from large-scale exome sequencing data
title_sort inferring compound heterozygosity from large-scale exome sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055215/
https://www.ncbi.nlm.nih.gov/pubmed/36993580
http://dx.doi.org/10.1101/2023.03.19.533370
work_keys_str_mv AT guomichaelh inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT franciolilaurentc inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT stentonsarahl inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT goodrichjuliak inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT wattsnicholasa inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT singerberkmoriel inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT groopmanemily inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT darnowskyphilipw inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT solomonsonmatthew inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT baxtersamantha inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT tiaograce inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT nealebenjaminm inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT hirschhornjoeln inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT rehmheidil inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT dalymarkj inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT odonnellluriaanne inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT karczewskikonradj inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT macarthurdanielg inferringcompoundheterozygosityfromlargescaleexomesequencingdata
AT samochakaitline inferringcompoundheterozygosityfromlargescaleexomesequencingdata