Cargando…

Gene expression models based on transcription factor binding events confer insight into functional cis-regulatory variants

MOTIVATION: Deciphering the functional roles of cis-regulatory variants is a critical challenge in genome analysis and interpretation. It has been hypothesized that altered transcription factor (TF) binding events are a central mechanism by which cis-regulatory variants impact gene expression levels...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Wenqiang, Fornes, Oriol, Wasserman, Wyeth W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6662294/
https://www.ncbi.nlm.nih.gov/pubmed/30541050
http://dx.doi.org/10.1093/bioinformatics/bty992
Descripción
Sumario:MOTIVATION: Deciphering the functional roles of cis-regulatory variants is a critical challenge in genome analysis and interpretation. It has been hypothesized that altered transcription factor (TF) binding events are a central mechanism by which cis-regulatory variants impact gene expression levels. However, we lack a computational framework to understand and quantify such mechanistic contributions. RESULTS: We present TF2Exp, a gene-based framework to predict the impact of altered TF-binding events on gene expression levels. Using data from lymphoblastoid cell lines, TF2Exp models were applied successfully to predict the expression levels of 3196 genes. Alterations within DNase I hypersensitive, CTCF-bound and tissue-specific TF-bound regions were the greatest contributing features to the models. TF2Exp models performed as well as models based on common variants, both in cross-validation and external validation. Combining TF alteration and common variant features can further improve model performance. Unlike variant-based models, TF2Exp models have the unique advantage to evaluate the functional impact of variants in linkage disequilibrium and uncommon variants. We find that adding TF-binding events altered only by uncommon variants could increase the number of predictable genes (R(2) > 0.05). Taken together, TF2Exp represents a key step towards interpreting the functional roles of cis-regulatory variants in the human genome. AVAILABILITY AND IMPLEMENTATION: The code and model training results are publicly available at https://github.com/wqshi/TF2Exp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.