Cargando…

xcore: an R package for inference of gene expression regulators

BACKGROUND: Elucidating the Transcription Factors (TFs) that drive the gene expression changes in a given experiment is a common question asked by researchers. The existing methods rely on the predicted Transcription Factor Binding Site (TFBS) to model the changes in the motif activity. Such methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Migdał, Maciej, Arakawa, Takahiro, Takizawa, Satoshi, Furuno, Masaaki, Suzuki, Harukazu, Arner, Erik, Winata, Cecilia Lanny, Kaczkowski, Bogumił
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832628/
https://www.ncbi.nlm.nih.gov/pubmed/36631751
http://dx.doi.org/10.1186/s12859-022-05084-0
Descripción
Sumario:BACKGROUND: Elucidating the Transcription Factors (TFs) that drive the gene expression changes in a given experiment is a common question asked by researchers. The existing methods rely on the predicted Transcription Factor Binding Site (TFBS) to model the changes in the motif activity. Such methods only work for TFs that have a motif and assume the TF binding profile is the same in all cell types. RESULTS: Given the wealth of the ChIP-seq data available for a wide range of the TFs in various cell types, we propose that gene expression modeling can be done using ChIP-seq “signatures” directly, effectively skipping the motif finding and TFBS prediction steps. We present xcore, an R package that allows TF activity modeling based on ChIP-seq signatures and the user's gene expression data. We also provide xcoredata a companion data package that provides a collection of preprocessed ChIP-seq signatures. We demonstrate that xcore leads to biologically relevant predictions using transforming growth factor beta induced epithelial-mesenchymal transition time-courses, rinderpest infection time-courses, and embryonic stem cells differentiated to cardiomyocytes time-course profiled with Cap Analysis Gene Expression. CONCLUSIONS: xcore provides a simple analytical framework for gene expression modeling using linear models that can be easily incorporated into differential expression analysis pipelines. Taking advantage of public ChIP-seq databases, xcore can identify meaningful molecular signatures and relevant ChIP-seq experiments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05084-0.