Cargando…

Optimized high-throughput screening of non-coding variants identified from genome-wide association studies

The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To f...

Descripción completa

Detalles Bibliográficos
Autores principales: Morova, Tunc, Ding, Yi, Huang, Chia-Chi F, Sar, Funda, Schwarz, Tommer, Giambartolomei, Claudia, Baca, Sylvan C, Grishin, Dennis, Hach, Faraz, Gusev, Alexander, Freedman, Matthew L, Pasaniuc, Bogdan, Lack, Nathan A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9943666/
https://www.ncbi.nlm.nih.gov/pubmed/36546757
http://dx.doi.org/10.1093/nar/gkac1198
_version_ 1784891757293469696
author Morova, Tunc
Ding, Yi
Huang, Chia-Chi F
Sar, Funda
Schwarz, Tommer
Giambartolomei, Claudia
Baca, Sylvan C
Grishin, Dennis
Hach, Faraz
Gusev, Alexander
Freedman, Matthew L
Pasaniuc, Bogdan
Lack, Nathan A
author_facet Morova, Tunc
Ding, Yi
Huang, Chia-Chi F
Sar, Funda
Schwarz, Tommer
Giambartolomei, Claudia
Baca, Sylvan C
Grishin, Dennis
Hach, Faraz
Gusev, Alexander
Freedman, Matthew L
Pasaniuc, Bogdan
Lack, Nathan A
author_sort Morova, Tunc
collection PubMed
description The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To functionally interrogate the activity of such variants we developed snpSTARRseq, a high-throughput experimental method that can interrogate the functional impact of hundreds to thousands of non-coding variants on enhancer activity. snpSTARRseq dramatically improves signal-to-noise by utilizing a novel sequencing and bioinformatic approach that increases both insert size and the number of variants tested per loci. Using this strategy, we interrogated known prostate cancer (PCa) risk-associated loci and demonstrated that 35% of them harbor SNPs that significantly altered enhancer activity. Combining these results with chromosomal looping data we could identify interacting genes and provide a mechanism of action for 20 PCa GWAS risk regions. When benchmarked to orthogonal methods, snpSTARRseq showed a strong correlation with in vivo experimental allelic-imbalance studies whereas there was no correlation with predictive in silico approaches. Overall, snpSTARRseq provides an integrated experimental and computational framework to functionally test non-coding genetic variants.
format Online
Article
Text
id pubmed-9943666
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99436662023-02-22 Optimized high-throughput screening of non-coding variants identified from genome-wide association studies Morova, Tunc Ding, Yi Huang, Chia-Chi F Sar, Funda Schwarz, Tommer Giambartolomei, Claudia Baca, Sylvan C Grishin, Dennis Hach, Faraz Gusev, Alexander Freedman, Matthew L Pasaniuc, Bogdan Lack, Nathan A Nucleic Acids Res Methods Online The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To functionally interrogate the activity of such variants we developed snpSTARRseq, a high-throughput experimental method that can interrogate the functional impact of hundreds to thousands of non-coding variants on enhancer activity. snpSTARRseq dramatically improves signal-to-noise by utilizing a novel sequencing and bioinformatic approach that increases both insert size and the number of variants tested per loci. Using this strategy, we interrogated known prostate cancer (PCa) risk-associated loci and demonstrated that 35% of them harbor SNPs that significantly altered enhancer activity. Combining these results with chromosomal looping data we could identify interacting genes and provide a mechanism of action for 20 PCa GWAS risk regions. When benchmarked to orthogonal methods, snpSTARRseq showed a strong correlation with in vivo experimental allelic-imbalance studies whereas there was no correlation with predictive in silico approaches. Overall, snpSTARRseq provides an integrated experimental and computational framework to functionally test non-coding genetic variants. Oxford University Press 2022-12-22 /pmc/articles/PMC9943666/ /pubmed/36546757 http://dx.doi.org/10.1093/nar/gkac1198 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Morova, Tunc
Ding, Yi
Huang, Chia-Chi F
Sar, Funda
Schwarz, Tommer
Giambartolomei, Claudia
Baca, Sylvan C
Grishin, Dennis
Hach, Faraz
Gusev, Alexander
Freedman, Matthew L
Pasaniuc, Bogdan
Lack, Nathan A
Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title_full Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title_fullStr Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title_full_unstemmed Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title_short Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
title_sort optimized high-throughput screening of non-coding variants identified from genome-wide association studies
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9943666/
https://www.ncbi.nlm.nih.gov/pubmed/36546757
http://dx.doi.org/10.1093/nar/gkac1198
work_keys_str_mv AT morovatunc optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT dingyi optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT huangchiachif optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT sarfunda optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT schwarztommer optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT giambartolomeiclaudia optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT bacasylvanc optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT grishindennis optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT hachfaraz optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT gusevalexander optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT freedmanmatthewl optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT pasaniucbogdan optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies
AT lacknathana optimizedhighthroughputscreeningofnoncodingvariantsidentifiedfromgenomewideassociationstudies