Cargando…

Inferring TF activities and activity regulators from gene expression data with constraints from TF perturbation data

MOTIVATION: The activity of a transcription factor (TF) in a sample of cells is the extent to which it is exerting its regulatory potential. Many methods of inferring TF activity from gene expression data have been described, but due to the lack of appropriate large-scale datasets, systematic and ob...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Cynthia Z, Brent, Michael R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8189679/
https://www.ncbi.nlm.nih.gov/pubmed/33135076
http://dx.doi.org/10.1093/bioinformatics/btaa947
Descripción
Sumario:MOTIVATION: The activity of a transcription factor (TF) in a sample of cells is the extent to which it is exerting its regulatory potential. Many methods of inferring TF activity from gene expression data have been described, but due to the lack of appropriate large-scale datasets, systematic and objective validation has not been possible until now. RESULTS: We systematically evaluate and optimize the approach to TF activity inference in which a gene expression matrix is factored into a condition-independent matrix of control strengths and a condition-dependent matrix of TF activity levels. We find that expression data in which the activities of individual TFs have been perturbed are both necessary and sufficient for obtaining good performance. To a considerable extent, control strengths inferred using expression data from one growth condition carry over to other conditions, so the control strength matrices derived here can be used by others. Finally, we apply these methods to gain insight into the upstream factors that regulate the activities of yeast TFs Gcr2, Gln3, Gcn4 and Msn2. AVAILABILITY AND IMPLEMENTATION: Evaluation code and data are available at https://doi.org/10.5281/zenodo.4050573. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.