Cargando…

Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence

Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Kotha, Sanjana R, Staller, Max Valentín
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10550315/
https://www.ncbi.nlm.nih.gov/pubmed/37462277
http://dx.doi.org/10.1093/genetics/iyad131
_version_ 1785115506473172992
author Kotha, Sanjana R
Staller, Max Valentín
author_facet Kotha, Sanjana R
Staller, Max Valentín
author_sort Kotha, Sanjana R
collection PubMed
description Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of this human predictor arises from the clustering of aromatic, leucine, and acidic residues, which together are necessary for acidic activation domain function. When we combine our predictor with the predictions of convolutional neural network (CNN) models trained in yeast, the intersection is more accurate than individual models, emphasizing that each approach carries orthogonal information. We synthesize these findings into a new set of activation domain predictions on human transcription factors.
format Online
Article
Text
id pubmed-10550315
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105503152023-10-05 Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence Kotha, Sanjana R Staller, Max Valentín Genetics Investigation Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of this human predictor arises from the clustering of aromatic, leucine, and acidic residues, which together are necessary for acidic activation domain function. When we combine our predictor with the predictions of convolutional neural network (CNN) models trained in yeast, the intersection is more accurate than individual models, emphasizing that each approach carries orthogonal information. We synthesize these findings into a new set of activation domain predictions on human transcription factors. Oxford University Press 2023-07-18 /pmc/articles/PMC10550315/ /pubmed/37462277 http://dx.doi.org/10.1093/genetics/iyad131 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of The Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Kotha, Sanjana R
Staller, Max Valentín
Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title_full Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title_fullStr Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title_full_unstemmed Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title_short Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
title_sort clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10550315/
https://www.ncbi.nlm.nih.gov/pubmed/37462277
http://dx.doi.org/10.1093/genetics/iyad131
work_keys_str_mv AT kothasanjanar clustersofacidicandhydrophobicresiduescanpredictacidictranscriptionalactivationdomainsfromproteinsequence
AT stallermaxvalentin clustersofacidicandhydrophobicresiduescanpredictacidictranscriptionalactivationdomainsfromproteinsequence