Cargando…

TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors

BACKGROUND: Eukaryotic gene expression is controlled by cis-regulatory elements (CREs), including promoters and enhancers, which are bound by transcription factors (TFs). Differential expression of TFs and their binding affinity at putative CREs determine tissue- and developmental-specific transcrip...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoffmann, Markus, Trummer, Nico, Schwartz, Leon, Jankowski, Jakub, Lee, Hye Kyung, Willruth, Lina-Liv, Lazareva, Olga, Yuan, Kevin, Baumgarten, Nina, Schmidt, Florian, Baumbach, Jan, Schulz, Marcel H, Blumenthal, David B, Hennighausen, Lothar, List, Markus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10155229/
https://www.ncbi.nlm.nih.gov/pubmed/37132521
http://dx.doi.org/10.1093/gigascience/giad026
_version_ 1785036287959367680
author Hoffmann, Markus
Trummer, Nico
Schwartz, Leon
Jankowski, Jakub
Lee, Hye Kyung
Willruth, Lina-Liv
Lazareva, Olga
Yuan, Kevin
Baumgarten, Nina
Schmidt, Florian
Baumbach, Jan
Schulz, Marcel H
Blumenthal, David B
Hennighausen, Lothar
List, Markus
author_facet Hoffmann, Markus
Trummer, Nico
Schwartz, Leon
Jankowski, Jakub
Lee, Hye Kyung
Willruth, Lina-Liv
Lazareva, Olga
Yuan, Kevin
Baumgarten, Nina
Schmidt, Florian
Baumbach, Jan
Schulz, Marcel H
Blumenthal, David B
Hennighausen, Lothar
List, Markus
author_sort Hoffmann, Markus
collection PubMed
description BACKGROUND: Eukaryotic gene expression is controlled by cis-regulatory elements (CREs), including promoters and enhancers, which are bound by transcription factors (TFs). Differential expression of TFs and their binding affinity at putative CREs determine tissue- and developmental-specific transcriptional activity. Consolidating genomic datasets can offer further insights into the accessibility of CREs, TF activity, and, thus, gene regulation. However, the integration and analysis of multimodal datasets are hampered by considerable technical challenges. While methods for highlighting differential TF activity from combined chromatin state data (e.g., chromatin immunoprecipitation [ChIP], ATAC, or DNase sequencing) and RNA sequencing data exist, they do not offer convenient usability, have limited support for large-scale data processing, and provide only minimal functionality for visually interpreting results. RESULTS: We developed TF-Prioritizer, an automated pipeline that prioritizes condition-specific TFs from multimodal data and generates an interactive web report. We demonstrated its potential by identifying known TFs along with their target genes, as well as previously unreported TFs active in lactating mouse mammary glands. Additionally, we studied a variety of ENCODE datasets for cell lines K562 and MCF-7, including 12 histone modification ChIP sequencing as well as ATAC and DNase sequencing datasets, where we observe and discuss assay-specific differences. CONCLUSION: TF-Prioritizer accepts ATAC, DNase, or ChIP sequencing and RNA sequencing data as input and identifies TFs with differential activity, thus offering an understanding of genome-wide gene regulation, potential pathogenesis, and therapeutic targets in biomedical research.
format Online
Article
Text
id pubmed-10155229
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101552292023-05-04 TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors Hoffmann, Markus Trummer, Nico Schwartz, Leon Jankowski, Jakub Lee, Hye Kyung Willruth, Lina-Liv Lazareva, Olga Yuan, Kevin Baumgarten, Nina Schmidt, Florian Baumbach, Jan Schulz, Marcel H Blumenthal, David B Hennighausen, Lothar List, Markus Gigascience Technical Note BACKGROUND: Eukaryotic gene expression is controlled by cis-regulatory elements (CREs), including promoters and enhancers, which are bound by transcription factors (TFs). Differential expression of TFs and their binding affinity at putative CREs determine tissue- and developmental-specific transcriptional activity. Consolidating genomic datasets can offer further insights into the accessibility of CREs, TF activity, and, thus, gene regulation. However, the integration and analysis of multimodal datasets are hampered by considerable technical challenges. While methods for highlighting differential TF activity from combined chromatin state data (e.g., chromatin immunoprecipitation [ChIP], ATAC, or DNase sequencing) and RNA sequencing data exist, they do not offer convenient usability, have limited support for large-scale data processing, and provide only minimal functionality for visually interpreting results. RESULTS: We developed TF-Prioritizer, an automated pipeline that prioritizes condition-specific TFs from multimodal data and generates an interactive web report. We demonstrated its potential by identifying known TFs along with their target genes, as well as previously unreported TFs active in lactating mouse mammary glands. Additionally, we studied a variety of ENCODE datasets for cell lines K562 and MCF-7, including 12 histone modification ChIP sequencing as well as ATAC and DNase sequencing datasets, where we observe and discuss assay-specific differences. CONCLUSION: TF-Prioritizer accepts ATAC, DNase, or ChIP sequencing and RNA sequencing data as input and identifies TFs with differential activity, thus offering an understanding of genome-wide gene regulation, potential pathogenesis, and therapeutic targets in biomedical research. Oxford University Press 2023-05-03 /pmc/articles/PMC10155229/ /pubmed/37132521 http://dx.doi.org/10.1093/gigascience/giad026 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Hoffmann, Markus
Trummer, Nico
Schwartz, Leon
Jankowski, Jakub
Lee, Hye Kyung
Willruth, Lina-Liv
Lazareva, Olga
Yuan, Kevin
Baumgarten, Nina
Schmidt, Florian
Baumbach, Jan
Schulz, Marcel H
Blumenthal, David B
Hennighausen, Lothar
List, Markus
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title_full TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title_fullStr TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title_full_unstemmed TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title_short TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors
title_sort tf-prioritizer: a java pipeline to prioritize condition-specific transcription factors
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10155229/
https://www.ncbi.nlm.nih.gov/pubmed/37132521
http://dx.doi.org/10.1093/gigascience/giad026
work_keys_str_mv AT hoffmannmarkus tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT trummernico tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT schwartzleon tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT jankowskijakub tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT leehyekyung tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT willruthlinaliv tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT lazarevaolga tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT yuankevin tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT baumgartennina tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT schmidtflorian tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT baumbachjan tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT schulzmarcelh tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT blumenthaldavidb tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT hennighausenlothar tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors
AT listmarkus tfprioritizerajavapipelinetoprioritizeconditionspecifictranscriptionfactors