Cargando…

xcore: an R package for inference of gene expression regulators

BACKGROUND: Elucidating the Transcription Factors (TFs) that drive the gene expression changes in a given experiment is a common question asked by researchers. The existing methods rely on the predicted Transcription Factor Binding Site (TFBS) to model the changes in the motif activity. Such methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Migdał, Maciej, Arakawa, Takahiro, Takizawa, Satoshi, Furuno, Masaaki, Suzuki, Harukazu, Arner, Erik, Winata, Cecilia Lanny, Kaczkowski, Bogumił
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832628/
https://www.ncbi.nlm.nih.gov/pubmed/36631751
http://dx.doi.org/10.1186/s12859-022-05084-0
_version_ 1784868092823732224
author Migdał, Maciej
Arakawa, Takahiro
Takizawa, Satoshi
Furuno, Masaaki
Suzuki, Harukazu
Arner, Erik
Winata, Cecilia Lanny
Kaczkowski, Bogumił
author_facet Migdał, Maciej
Arakawa, Takahiro
Takizawa, Satoshi
Furuno, Masaaki
Suzuki, Harukazu
Arner, Erik
Winata, Cecilia Lanny
Kaczkowski, Bogumił
author_sort Migdał, Maciej
collection PubMed
description BACKGROUND: Elucidating the Transcription Factors (TFs) that drive the gene expression changes in a given experiment is a common question asked by researchers. The existing methods rely on the predicted Transcription Factor Binding Site (TFBS) to model the changes in the motif activity. Such methods only work for TFs that have a motif and assume the TF binding profile is the same in all cell types. RESULTS: Given the wealth of the ChIP-seq data available for a wide range of the TFs in various cell types, we propose that gene expression modeling can be done using ChIP-seq “signatures” directly, effectively skipping the motif finding and TFBS prediction steps. We present xcore, an R package that allows TF activity modeling based on ChIP-seq signatures and the user's gene expression data. We also provide xcoredata a companion data package that provides a collection of preprocessed ChIP-seq signatures. We demonstrate that xcore leads to biologically relevant predictions using transforming growth factor beta induced epithelial-mesenchymal transition time-courses, rinderpest infection time-courses, and embryonic stem cells differentiated to cardiomyocytes time-course profiled with Cap Analysis Gene Expression. CONCLUSIONS: xcore provides a simple analytical framework for gene expression modeling using linear models that can be easily incorporated into differential expression analysis pipelines. Taking advantage of public ChIP-seq databases, xcore can identify meaningful molecular signatures and relevant ChIP-seq experiments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05084-0.
format Online
Article
Text
id pubmed-9832628
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98326282023-01-12 xcore: an R package for inference of gene expression regulators Migdał, Maciej Arakawa, Takahiro Takizawa, Satoshi Furuno, Masaaki Suzuki, Harukazu Arner, Erik Winata, Cecilia Lanny Kaczkowski, Bogumił BMC Bioinformatics Software BACKGROUND: Elucidating the Transcription Factors (TFs) that drive the gene expression changes in a given experiment is a common question asked by researchers. The existing methods rely on the predicted Transcription Factor Binding Site (TFBS) to model the changes in the motif activity. Such methods only work for TFs that have a motif and assume the TF binding profile is the same in all cell types. RESULTS: Given the wealth of the ChIP-seq data available for a wide range of the TFs in various cell types, we propose that gene expression modeling can be done using ChIP-seq “signatures” directly, effectively skipping the motif finding and TFBS prediction steps. We present xcore, an R package that allows TF activity modeling based on ChIP-seq signatures and the user's gene expression data. We also provide xcoredata a companion data package that provides a collection of preprocessed ChIP-seq signatures. We demonstrate that xcore leads to biologically relevant predictions using transforming growth factor beta induced epithelial-mesenchymal transition time-courses, rinderpest infection time-courses, and embryonic stem cells differentiated to cardiomyocytes time-course profiled with Cap Analysis Gene Expression. CONCLUSIONS: xcore provides a simple analytical framework for gene expression modeling using linear models that can be easily incorporated into differential expression analysis pipelines. Taking advantage of public ChIP-seq databases, xcore can identify meaningful molecular signatures and relevant ChIP-seq experiments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05084-0. BioMed Central 2023-01-11 /pmc/articles/PMC9832628/ /pubmed/36631751 http://dx.doi.org/10.1186/s12859-022-05084-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Migdał, Maciej
Arakawa, Takahiro
Takizawa, Satoshi
Furuno, Masaaki
Suzuki, Harukazu
Arner, Erik
Winata, Cecilia Lanny
Kaczkowski, Bogumił
xcore: an R package for inference of gene expression regulators
title xcore: an R package for inference of gene expression regulators
title_full xcore: an R package for inference of gene expression regulators
title_fullStr xcore: an R package for inference of gene expression regulators
title_full_unstemmed xcore: an R package for inference of gene expression regulators
title_short xcore: an R package for inference of gene expression regulators
title_sort xcore: an r package for inference of gene expression regulators
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832628/
https://www.ncbi.nlm.nih.gov/pubmed/36631751
http://dx.doi.org/10.1186/s12859-022-05084-0
work_keys_str_mv AT migdałmaciej xcoreanrpackageforinferenceofgeneexpressionregulators
AT arakawatakahiro xcoreanrpackageforinferenceofgeneexpressionregulators
AT takizawasatoshi xcoreanrpackageforinferenceofgeneexpressionregulators
AT furunomasaaki xcoreanrpackageforinferenceofgeneexpressionregulators
AT suzukiharukazu xcoreanrpackageforinferenceofgeneexpressionregulators
AT arnererik xcoreanrpackageforinferenceofgeneexpressionregulators
AT winatacecilialanny xcoreanrpackageforinferenceofgeneexpressionregulators
AT kaczkowskibogumił xcoreanrpackageforinferenceofgeneexpressionregulators