Cargando…

Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach

Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transc...

Descripción completa

Detalles Bibliográficos
Autores principales: Peña-Martínez, Edwin G., Pomales-Matos, Diego A., Rivera-Madera, Alejandro, Messon-Bird, Jean L., Medina-Feliciano, Joshua G., Sanabria-Alberto, Leandro, Barreiro-Rosario, Adriana C., Rodriguez-Rios, Jessica M., Rodríguez-Martínez, José A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491373/
https://www.ncbi.nlm.nih.gov/pubmed/37693486
http://dx.doi.org/10.1101/2023.09.01.23294951
_version_ 1785104047308537856
author Peña-Martínez, Edwin G.
Pomales-Matos, Diego A.
Rivera-Madera, Alejandro
Messon-Bird, Jean L.
Medina-Feliciano, Joshua G.
Sanabria-Alberto, Leandro
Barreiro-Rosario, Adriana C.
Rodriguez-Rios, Jessica M.
Rodríguez-Martínez, José A.
author_facet Peña-Martínez, Edwin G.
Pomales-Matos, Diego A.
Rivera-Madera, Alejandro
Messon-Bird, Jean L.
Medina-Feliciano, Joshua G.
Sanabria-Alberto, Leandro
Barreiro-Rosario, Adriana C.
Rodriguez-Rios, Jessica M.
Rodríguez-Martínez, José A.
author_sort Peña-Martínez, Edwin G.
collection PubMed
description Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transcription factors (TFs). However, due to the overwhelming number of GWAS single nucleotide polymorphisms (SNPs) (>500,000), prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1,535 CVD-associated SNPs that occur within human heart footprints/enhancers and 9,309 variants in linkage disequilibrium (LD) with differential gene expression profiles in cardiac tissue. Using hiPSC-CM ChIP-seq data from NKX2–5 and TBX5, two cardiac TFs essential for proper heart development, we trained a large-scale gapped k-mer SVM (LS-GKM-SVM) predictive model that can identify binding sites altered by CVD-associated SNPs. The computational predictive model was tested by scoring human heart footprints and enhancers in vitro through electrophoretic mobility shift assay (EMSA). Three variants (rs59310144, rs6715570, and rs61872084) were prioritized for in vitro validation based on their eQTL in cardiac tissue and LS-GKM-SVM prediction to alter NKX2–5 DNA binding. All three variants altered NKX2–5 DNA binding. In summary, we present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro experimental analysis.
format Online
Article
Text
id pubmed-10491373
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104913732023-09-09 Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach Peña-Martínez, Edwin G. Pomales-Matos, Diego A. Rivera-Madera, Alejandro Messon-Bird, Jean L. Medina-Feliciano, Joshua G. Sanabria-Alberto, Leandro Barreiro-Rosario, Adriana C. Rodriguez-Rios, Jessica M. Rodríguez-Martínez, José A. medRxiv Article Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transcription factors (TFs). However, due to the overwhelming number of GWAS single nucleotide polymorphisms (SNPs) (>500,000), prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1,535 CVD-associated SNPs that occur within human heart footprints/enhancers and 9,309 variants in linkage disequilibrium (LD) with differential gene expression profiles in cardiac tissue. Using hiPSC-CM ChIP-seq data from NKX2–5 and TBX5, two cardiac TFs essential for proper heart development, we trained a large-scale gapped k-mer SVM (LS-GKM-SVM) predictive model that can identify binding sites altered by CVD-associated SNPs. The computational predictive model was tested by scoring human heart footprints and enhancers in vitro through electrophoretic mobility shift assay (EMSA). Three variants (rs59310144, rs6715570, and rs61872084) were prioritized for in vitro validation based on their eQTL in cardiac tissue and LS-GKM-SVM prediction to alter NKX2–5 DNA binding. All three variants altered NKX2–5 DNA binding. In summary, we present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro experimental analysis. Cold Spring Harbor Laboratory 2023-09-02 /pmc/articles/PMC10491373/ /pubmed/37693486 http://dx.doi.org/10.1101/2023.09.01.23294951 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Peña-Martínez, Edwin G.
Pomales-Matos, Diego A.
Rivera-Madera, Alejandro
Messon-Bird, Jean L.
Medina-Feliciano, Joshua G.
Sanabria-Alberto, Leandro
Barreiro-Rosario, Adriana C.
Rodriguez-Rios, Jessica M.
Rodríguez-Martínez, José A.
Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title_full Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title_fullStr Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title_full_unstemmed Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title_short Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2–5 Binding through an Integrative Computational Approach
title_sort prioritizing cardiovascular disease-associated variants altering nkx2–5 binding through an integrative computational approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491373/
https://www.ncbi.nlm.nih.gov/pubmed/37693486
http://dx.doi.org/10.1101/2023.09.01.23294951
work_keys_str_mv AT penamartinezedwing prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT pomalesmatosdiegoa prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT riveramaderaalejandro prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT messonbirdjeanl prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT medinafelicianojoshuag prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT sanabriaalbertoleandro prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT barreirorosarioadrianac prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT rodriguezriosjessicam prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach
AT rodriguezmartinezjosea prioritizingcardiovasculardiseaseassociatedvariantsalteringnkx25bindingthroughanintegrativecomputationalapproach