Cargando…
Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transc...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491373/ https://www.ncbi.nlm.nih.gov/pubmed/37693486 http://dx.doi.org/10.1101/2023.09.01.23294951 |
_version_ | 1785104047308537856 |
---|---|
author | Peña-Martínez, Edwin G. Pomales-Matos, Diego A. Rivera-Madera, Alejandro Messon-Bird, Jean L. Medina-Feliciano, Joshua G. Sanabria-Alberto, Leandro Barreiro-Rosario, Adriana C. Rodriguez-Rios, Jessica M. Rodríguez-Martínez, José A. |
author_facet | Peña-Martínez, Edwin G. Pomales-Matos, Diego A. Rivera-Madera, Alejandro Messon-Bird, Jean L. Medina-Feliciano, Joshua G. Sanabria-Alberto, Leandro Barreiro-Rosario, Adriana C. Rodriguez-Rios, Jessica M. Rodríguez-Martínez, José A. |
author_sort | Peña-Martínez, Edwin G. |
collection | PubMed |
description | Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transcription factors (TFs). However, due to the overwhelming number of GWAS single nucleotide polymorphisms (SNPs) (>500,000), prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1,535 CVD-associated SNPs that occur within human heart footprints/enhancers and 9,309 variants in linkage disequilibrium (LD) with differential gene expression profiles in cardiac tissue. Using hiPSC-CM ChIP-seq data from NKX2–5 and TBX5, two cardiac TFs essential for proper heart development, we trained a large-scale gapped k-mer SVM (LS-GKM-SVM) predictive model that can identify binding sites altered by CVD-associated SNPs. The computational predictive model was tested by scoring human heart footprints and enhancers in vitro through electrophoretic mobility shift assay (EMSA). Three variants (rs59310144, rs6715570, and rs61872084) were prioritized for in vitro validation based on their eQTL in cardiac tissue and LS-GKM-SVM prediction to alter NKX2–5 DNA binding. All three variants altered NKX2–5 DNA binding. In summary, we present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro experimental analysis. |
format | Online Article Text |
id | pubmed-10491373 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104913732023-09-09 Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach Peña-Martínez, Edwin G. Pomales-Matos, Diego A. Rivera-Madera, Alejandro Messon-Bird, Jean L. Medina-Feliciano, Joshua G. Sanabria-Alberto, Leandro Barreiro-Rosario, Adriana C. Rodriguez-Rios, Jessica M. Rodríguez-Martínez, José A. medRxiv Article Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transcription factors (TFs). However, due to the overwhelming number of GWAS single nucleotide polymorphisms (SNPs) (>500,000), prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1,535 CVD-associated SNPs that occur within human heart footprints/enhancers and 9,309 variants in linkage disequilibrium (LD) with differential gene expression profiles in cardiac tissue. Using hiPSC-CM ChIP-seq data from NKX2–5 and TBX5, two cardiac TFs essential for proper heart development, we trained a large-scale gapped k-mer SVM (LS-GKM-SVM) predictive model that can identify binding sites altered by CVD-associated SNPs. The computational predictive model was tested by scoring human heart footprints and enhancers in vitro through electrophoretic mobility shift assay (EMSA). Three variants (rs59310144, rs6715570, and rs61872084) were prioritized for in vitro validation based on their eQTL in cardiac tissue and LS-GKM-SVM prediction to alter NKX2–5 DNA binding. All three variants altered NKX2–5 DNA binding. In summary, we present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro experimental analysis. Cold Spring Harbor Laboratory 2023-09-02 /pmc/articles/PMC10491373/ /pubmed/37693486 http://dx.doi.org/10.1101/2023.09.01.23294951 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Peña-Martínez, Edwin G. Pomales-Matos, Diego A. Rivera-Madera, Alejandro Messon-Bird, Jean L. Medina-Feliciano, Joshua G. Sanabria-Alberto, Leandro Barreiro-Rosario, Adriana C. Rodriguez-Rios, Jessica M. Rodríguez-Martínez, José A. Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title | Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title_full | Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title_fullStr | Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title_full_unstemmed | Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title_short | Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach |
title_sort | prioritizing cardiovascular disease-associated variants altering nkx2–5 binding through an integrative computational approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491373/ https://www.ncbi.nlm.nih.gov/pubmed/37693486 http://dx.doi.org/10.1101/2023.09.01.23294951 |
work_keys_str_mv | AT penamartinezedwing prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT pomalesmatosdiegoa prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT riveramaderaalejandro prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT messonbirdjeanl prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT medinafelicianojoshuag prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT sanabriaalbertoleandro prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT barreirorosarioadrianac prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT rodriguezriosjessicam prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach AT rodriguezmartinezjosea prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach |