Cargando…

Predicting which genes will respond to transcription factor perturbations

The ability to predict which genes will respond to the perturbation of a transcription factor serves as a benchmark for our systems-level understanding of transcriptional regulatory networks. In previous work, machine learning models have been trained to predict static gene expression levels in a bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Yiming, Jung, Wooseok J, Brent, Michael R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9339286/
https://www.ncbi.nlm.nih.gov/pubmed/35666184
http://dx.doi.org/10.1093/g3journal/jkac144
_version_ 1784760156611936256
author Kang, Yiming
Jung, Wooseok J
Brent, Michael R
author_facet Kang, Yiming
Jung, Wooseok J
Brent, Michael R
author_sort Kang, Yiming
collection PubMed
description The ability to predict which genes will respond to the perturbation of a transcription factor serves as a benchmark for our systems-level understanding of transcriptional regulatory networks. In previous work, machine learning models have been trained to predict static gene expression levels in a biological sample by using data from the same or similar samples, including data on their transcription factor binding locations, histone marks, or DNA sequence. We report on a different challenge—training machine learning models to predict which genes will respond to the perturbation of a transcription factor without using any data from the perturbed cells. We find that existing transcription factor location data (ChIP-seq) from human cells have very little detectable utility for predicting which genes will respond to perturbation of a transcription factor. Features of genes, including their preperturbation expression level and expression variation, are very useful for predicting responses to perturbation of any transcription factor. This shows that some genes are poised to respond to transcription factor perturbations and others are resistant, shedding light on why it has been so difficult to predict responses from binding locations. Certain histone marks, including H3K4me1 and H3K4me3, have some predictive power when located downstream of the transcription start site. However, the predictive power of histone marks is much less than that of gene expression level and expression variation. Sequence-based or epigenetic properties of genes strongly influence their tendency to respond to direct transcription factor perturbations, partially explaining the oft-noted difficulty of predicting responsiveness from transcription factor binding location data. These molecular features are largely reflected in and summarized by the gene’s expression level and expression variation. Code is available at https://github.com/BrentLab/TFPertRespExplainer.
format Online
Article
Text
id pubmed-9339286
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93392862022-08-01 Predicting which genes will respond to transcription factor perturbations Kang, Yiming Jung, Wooseok J Brent, Michael R G3 (Bethesda) Investigation The ability to predict which genes will respond to the perturbation of a transcription factor serves as a benchmark for our systems-level understanding of transcriptional regulatory networks. In previous work, machine learning models have been trained to predict static gene expression levels in a biological sample by using data from the same or similar samples, including data on their transcription factor binding locations, histone marks, or DNA sequence. We report on a different challenge—training machine learning models to predict which genes will respond to the perturbation of a transcription factor without using any data from the perturbed cells. We find that existing transcription factor location data (ChIP-seq) from human cells have very little detectable utility for predicting which genes will respond to perturbation of a transcription factor. Features of genes, including their preperturbation expression level and expression variation, are very useful for predicting responses to perturbation of any transcription factor. This shows that some genes are poised to respond to transcription factor perturbations and others are resistant, shedding light on why it has been so difficult to predict responses from binding locations. Certain histone marks, including H3K4me1 and H3K4me3, have some predictive power when located downstream of the transcription start site. However, the predictive power of histone marks is much less than that of gene expression level and expression variation. Sequence-based or epigenetic properties of genes strongly influence their tendency to respond to direct transcription factor perturbations, partially explaining the oft-noted difficulty of predicting responsiveness from transcription factor binding location data. These molecular features are largely reflected in and summarized by the gene’s expression level and expression variation. Code is available at https://github.com/BrentLab/TFPertRespExplainer. Oxford University Press 2022-06-06 /pmc/articles/PMC9339286/ /pubmed/35666184 http://dx.doi.org/10.1093/g3journal/jkac144 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Kang, Yiming
Jung, Wooseok J
Brent, Michael R
Predicting which genes will respond to transcription factor perturbations
title Predicting which genes will respond to transcription factor perturbations
title_full Predicting which genes will respond to transcription factor perturbations
title_fullStr Predicting which genes will respond to transcription factor perturbations
title_full_unstemmed Predicting which genes will respond to transcription factor perturbations
title_short Predicting which genes will respond to transcription factor perturbations
title_sort predicting which genes will respond to transcription factor perturbations
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9339286/
https://www.ncbi.nlm.nih.gov/pubmed/35666184
http://dx.doi.org/10.1093/g3journal/jkac144
work_keys_str_mv AT kangyiming predictingwhichgeneswillrespondtotranscriptionfactorperturbations
AT jungwooseokj predictingwhichgeneswillrespondtotranscriptionfactorperturbations
AT brentmichaelr predictingwhichgeneswillrespondtotranscriptionfactorperturbations