Cargando…
Identification of putative causal loci in whole-genome sequencing data via knockoff statistics
The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a r...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149672/ https://www.ncbi.nlm.nih.gov/pubmed/34035245 http://dx.doi.org/10.1038/s41467-021-22889-4 |
_version_ | 1783697996237504512 |
---|---|
author | He, Zihuai Liu, Linxi Wang, Chen Le Guen, Yann Lee, Justin Gogarten, Stephanie Lu, Fred Montgomery, Stephen Tang, Hua Silverman, Edwin K. Cho, Michael H. Greicius, Michael Ionita-Laza, Iuliana |
author_facet | He, Zihuai Liu, Linxi Wang, Chen Le Guen, Yann Lee, Justin Gogarten, Stephanie Lu, Fred Montgomery, Stephen Tang, Hua Silverman, Edwin K. Cho, Michael H. Greicius, Michael Ionita-Laza, Iuliana |
author_sort | He, Zihuai |
collection | PubMed |
description | The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries. |
format | Online Article Text |
id | pubmed-8149672 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81496722021-06-01 Identification of putative causal loci in whole-genome sequencing data via knockoff statistics He, Zihuai Liu, Linxi Wang, Chen Le Guen, Yann Lee, Justin Gogarten, Stephanie Lu, Fred Montgomery, Stephen Tang, Hua Silverman, Edwin K. Cho, Michael H. Greicius, Michael Ionita-Laza, Iuliana Nat Commun Article The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants from shadow effects of significant common variants nearby; (3) integrate multiple knockoffs for improved power, stability, and reproducibility; and (4) flexibly incorporate state-of-the-art and future association tests to achieve the benefits proposed here. In applications to whole-genome sequencing data from the Alzheimer’s Disease Sequencing Project (ADSP) and COPDGene samples from NHLBI Trans-Omics for Precision Medicine (TOPMed) Program we show that our method compared with conventional association tests can lead to substantially more discoveries. Nature Publishing Group UK 2021-05-25 /pmc/articles/PMC8149672/ /pubmed/34035245 http://dx.doi.org/10.1038/s41467-021-22889-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article He, Zihuai Liu, Linxi Wang, Chen Le Guen, Yann Lee, Justin Gogarten, Stephanie Lu, Fred Montgomery, Stephen Tang, Hua Silverman, Edwin K. Cho, Michael H. Greicius, Michael Ionita-Laza, Iuliana Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title | Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title_full | Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title_fullStr | Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title_full_unstemmed | Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title_short | Identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
title_sort | identification of putative causal loci in whole-genome sequencing data via knockoff statistics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149672/ https://www.ncbi.nlm.nih.gov/pubmed/34035245 http://dx.doi.org/10.1038/s41467-021-22889-4 |
work_keys_str_mv | AT hezihuai identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT liulinxi identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT wangchen identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT leguenyann identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT leejustin identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT gogartenstephanie identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT lufred identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT montgomerystephen identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT tanghua identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT silvermanedwink identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT chomichaelh identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT greiciusmichael identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics AT ionitalazaiuliana identificationofputativecausallociinwholegenomesequencingdataviaknockoffstatistics |