Cargando…
Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions
INTRODUCTION: As new late‐onset Alzheimer's disease (LOAD) genetic risk loci are identified and brain cell–type specific omics data becomes available, there is an unmet need for a bioinformatics framework to prioritize genes and variants for testing in single‐cell molecular profiling experiment...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864953/ https://www.ncbi.nlm.nih.gov/pubmed/35229021 http://dx.doi.org/10.1002/trc2.12244 |
_version_ | 1784655556589387776 |
---|---|
author | Lutz, Michael W. Chiba‐Falek, Ornit |
author_facet | Lutz, Michael W. Chiba‐Falek, Ornit |
author_sort | Lutz, Michael W. |
collection | PubMed |
description | INTRODUCTION: As new late‐onset Alzheimer's disease (LOAD) genetic risk loci are identified and brain cell–type specific omics data becomes available, there is an unmet need for a bioinformatics framework to prioritize genes and variants for testing in single‐cell molecular profiling experiments and validation using disease models and gene editing technologies. Prior work has characterized and prioritized active enhancers located in LOAD‐genome‐wide association study (GWAS) regions and their potential interactions with candidate genes. The current study extends this work by focusing on single nucleotide polymorphisms (SNPs) within these LOAD enhancers and their impact on altering transcription factor (TF) binding. The proposed bioinformatics pipeline progresses from SNPs located in LOAD‐GWAS regions to a filtered set of candidate regulatory SNPs that have a predicted strong effect on TF binding. METHODS: Active enhancers within LOAD‐associated regions were identified and SNPs located in the enhancers were catalogued. SNPs that disrupt TF binding sites were prioritized and the respective TFs were filtered to include only those that were expressed in brain tissues relevant to LOAD. The TFs binding to the corresponding sequence was further confirmed by ChIP‐seq signals. Finally, the high‐priority candidate SNPs were evaluated as expression quantitative trait loci (eQTLs) in disease‐relevant tissues. RESULTS: We catalogued 61 strong enhancers in LOAD‐GWAS regions encompassing 326 SNPs and 104 TF binding sites. Seventy‐seven and 78 of the TFs were expressed in brain and monocytes, respectively, out of which 19 TF‐binding sites showed ChIP‐seq signals. Eleven SNPs were found to interrupt with TF binding out of which three SNPs were also significant eQTL. DISCUSSION: This study provides a framework to catalogue noncoding variations in enhancers located in LOAD‐GWAS loci and characterize their likelihood to perturb TF binding. The approach integrates multiple data types to characterize and prioritize SNPs for putative regulatory function using single‐cell multi‐omics assays and gene editing. |
format | Online Article Text |
id | pubmed-8864953 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88649532022-02-27 Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions Lutz, Michael W. Chiba‐Falek, Ornit Alzheimers Dement (N Y) Research Articles INTRODUCTION: As new late‐onset Alzheimer's disease (LOAD) genetic risk loci are identified and brain cell–type specific omics data becomes available, there is an unmet need for a bioinformatics framework to prioritize genes and variants for testing in single‐cell molecular profiling experiments and validation using disease models and gene editing technologies. Prior work has characterized and prioritized active enhancers located in LOAD‐genome‐wide association study (GWAS) regions and their potential interactions with candidate genes. The current study extends this work by focusing on single nucleotide polymorphisms (SNPs) within these LOAD enhancers and their impact on altering transcription factor (TF) binding. The proposed bioinformatics pipeline progresses from SNPs located in LOAD‐GWAS regions to a filtered set of candidate regulatory SNPs that have a predicted strong effect on TF binding. METHODS: Active enhancers within LOAD‐associated regions were identified and SNPs located in the enhancers were catalogued. SNPs that disrupt TF binding sites were prioritized and the respective TFs were filtered to include only those that were expressed in brain tissues relevant to LOAD. The TFs binding to the corresponding sequence was further confirmed by ChIP‐seq signals. Finally, the high‐priority candidate SNPs were evaluated as expression quantitative trait loci (eQTLs) in disease‐relevant tissues. RESULTS: We catalogued 61 strong enhancers in LOAD‐GWAS regions encompassing 326 SNPs and 104 TF binding sites. Seventy‐seven and 78 of the TFs were expressed in brain and monocytes, respectively, out of which 19 TF‐binding sites showed ChIP‐seq signals. Eleven SNPs were found to interrupt with TF binding out of which three SNPs were also significant eQTL. DISCUSSION: This study provides a framework to catalogue noncoding variations in enhancers located in LOAD‐GWAS loci and characterize their likelihood to perturb TF binding. The approach integrates multiple data types to characterize and prioritize SNPs for putative regulatory function using single‐cell multi‐omics assays and gene editing. John Wiley and Sons Inc. 2022-02-23 /pmc/articles/PMC8864953/ /pubmed/35229021 http://dx.doi.org/10.1002/trc2.12244 Text en © 2022 The Authors. Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring published by Wiley Periodicals, LLC on behalf of Alzheimer's Association https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Lutz, Michael W. Chiba‐Falek, Ornit Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title | Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title_full | Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title_fullStr | Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title_full_unstemmed | Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title_short | Bioinformatics pipeline to guide late‐onset Alzheimer's disease (LOAD) post‐GWAS studies: Prioritizing transcription regulatory variants within LOAD‐associated regions |
title_sort | bioinformatics pipeline to guide late‐onset alzheimer's disease (load) post‐gwas studies: prioritizing transcription regulatory variants within load‐associated regions |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8864953/ https://www.ncbi.nlm.nih.gov/pubmed/35229021 http://dx.doi.org/10.1002/trc2.12244 |
work_keys_str_mv | AT lutzmichaelw bioinformaticspipelinetoguidelateonsetalzheimersdiseaseloadpostgwasstudiesprioritizingtranscriptionregulatoryvariantswithinloadassociatedregions AT chibafalekornit bioinformaticspipelinetoguidelateonsetalzheimersdiseaseloadpostgwasstudiesprioritizingtranscriptionregulatoryvariantswithinloadassociatedregions |