Cargando…
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data
Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7162552/ https://www.ncbi.nlm.nih.gov/pubmed/32251445 http://dx.doi.org/10.1371/journal.pcbi.1007800 |
_version_ | 1783523055153184768 |
---|---|
author | Roopra, Avtar |
author_facet | Roopra, Avtar |
author_sort | Roopra, Avtar |
collection | PubMed |
description | Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments. |
format | Online Article Text |
id | pubmed-7162552 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-71625522020-04-24 MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data Roopra, Avtar PLoS Comput Biol Research Article Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments. Public Library of Science 2020-04-06 /pmc/articles/PMC7162552/ /pubmed/32251445 http://dx.doi.org/10.1371/journal.pcbi.1007800 Text en © 2020 Avtar Roopra http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Roopra, Avtar MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title | MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title_full | MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title_fullStr | MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title_full_unstemmed | MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title_short | MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data |
title_sort | magic: a tool for predicting transcription factors and cofactors driving gene sets using encode data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7162552/ https://www.ncbi.nlm.nih.gov/pubmed/32251445 http://dx.doi.org/10.1371/journal.pcbi.1007800 |
work_keys_str_mv | AT roopraavtar magicatoolforpredictingtranscriptionfactorsandcofactorsdrivinggenesetsusingencodedata |