Cargando…

Identification of putative causal loci in whole-genome sequencing data via knockoff statistics

The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a r...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Zihuai, Liu, Linxi, Wang, Chen, Le Guen, Yann, Lee, Justin, Gogarten, Stephanie, Lu, Fred, Montgomery, Stephen, Tang, Hua, Silverman, Edwin K., Cho, Michael H., Greicius, Michael, Ionita-Laza, Iuliana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149672/
https://www.ncbi.nlm.nih.gov/pubmed/34035245
http://dx.doi.org/10.1038/s41467-021-22889-4
_version_ 1783697996237504512
author He, Zihuai
Liu, Linxi
Wang, Chen
Le Guen, Yann
Lee, Justin
Gogarten, Stephanie
Lu, Fred
Montgomery, Stephen
Tang, Hua
Silverman, Edwin K.
Cho, Michael H.
Greicius, Michael
Ionita-Laza, Iuliana
author_facet He, Zihuai
Liu, Linxi
Wang, Chen
Le Guen, Yann
Lee, Justin
Gogarten, Stephanie
Lu, Fred
Montgomery, Stephen
Tang, Hua
Silverman, Edwin K.
Cho, Michael H.
Greicius, Michael
Ionita-Laza, Iuliana
author_sort He, Zihuai
collection PubMed
description The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries.
format Online
Article
Text
id pubmed-8149672
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81496722021-06-01 Identification of putative causal loci in whole-genome sequencing data via knockoff statistics He, Zihuai Liu, Linxi Wang, Chen Le Guen, Yann Lee, Justin Gogarten, Stephanie Lu, Fred Montgomery, Stephen Tang, Hua Silverman, Edwin K. Cho, Michael H. Greicius, Michael Ionita-Laza, Iuliana Nat Commun Article The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries. Nature Publishing Group UK 2021-05-25 /pmc/articles/PMC8149672/ /pubmed/34035245 http://dx.doi.org/10.1038/s41467-021-22889-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
He, Zihuai
Liu, Linxi
Wang, Chen
Le Guen, Yann
Lee, Justin
Gogarten, Stephanie
Lu, Fred
Montgomery, Stephen
Tang, Hua
Silverman, Edwin K.
Cho, Michael H.
Greicius, Michael
Ionita-Laza, Iuliana
Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title_full Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title_fullStr Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title_full_unstemmed Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title_short Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
title_sort identification of putative causal loci in whole-genome sequencing data via knockoff statistics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149672/
https://www.ncbi.nlm.nih.gov/pubmed/34035245
http://dx.doi.org/10.1038/s41467-021-22889-4
work_keys_str_mv AT hezihuai identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT liulinxi identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT wangchen identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT leguenyann identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT leejustin identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT gogartenstephanie identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT lufred identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT montgomerystephen identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT tanghua identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT silvermanedwink identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT chomichaelh identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT greiciusmichael identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics
AT ionitalazaiuliana identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics