Cargando…

MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data

Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors...

Descripción completa

Detalles Bibliográficos
Autor principal: Roopra, Avtar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7162552/
https://www.ncbi.nlm.nih.gov/pubmed/32251445
http://dx.doi.org/10.1371/journal.pcbi.1007800
_version_ 1783523055153184768
author Roopra, Avtar
author_facet Roopra, Avtar
author_sort Roopra, Avtar
collection PubMed
description Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.
format Online
Article
Text
id pubmed-7162552
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71625522020-04-24 MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data Roopra, Avtar PLoS Comput Biol Research Article Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments. Public Library of Science 2020-04-06 /pmc/articles/PMC7162552/ /pubmed/32251445 http://dx.doi.org/10.1371/journal.pcbi.1007800 Text en © 2020 Avtar Roopra http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Roopra, Avtar
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title_full MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title_fullStr MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title_full_unstemmed MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title_short MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
title_sort magic: a tool for predicting transcription factors and cofactors driving gene sets using encode data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7162552/
https://www.ncbi.nlm.nih.gov/pubmed/32251445
http://dx.doi.org/10.1371/journal.pcbi.1007800
work_keys_str_mv AT roopraavtar magicatoolforpredictingtranscriptionfactorsandcofactorsdrivinggenesetsusingencodedata