Cargando…
Predictive modeling of moonlighting DNA-binding proteins
Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, tha...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716651/ https://www.ncbi.nlm.nih.gov/pubmed/36474806 http://dx.doi.org/10.1093/nargab/lqac091 |
_version_ | 1784842737814601728 |
---|---|
author | Varghese, Dana Mary Nussinov, Ruth Ahmad, Shandar |
author_facet | Varghese, Dana Mary Nussinov, Ruth Ahmad, Shandar |
author_sort | Varghese, Dana Mary |
collection | PubMed |
description | Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, that is, using sequence, predicted structure, evolutionary profiles, and global gene expression profiles, for only one functional class of proteins in a single organism at a time will significantly advance our understanding of multifunctional proteins. In this work, we investigated human moonlighting DNA-binding proteins (mDBPs) in terms of properties that distinguish them from other (non-moonlighting) proteins with the same DNA-binding protein (DBP) function. Following a careful and comprehensive analysis of discriminatory features, a machine learning model was developed to assess the predictability of mDBPs from other DBPs (oDBPs). We observed that mDBPs can be discriminated from oDBPs with high accuracy of 74% AUC of ROC using these first principles features. A number of novel predicted mDBPs were found to have literature support for their being moonlighting and others are proposed as candidates, for which the moonlighting function is currently unknown. We believe that this work will help in deciphering and annotating novel moonlighting DBPs and scale up other functions. The source codes and data sets used for this work are freely available at https://zenodo.org/record/7299265#.Y2pO3ctBxPY |
format | Online Article Text |
id | pubmed-9716651 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97166512022-12-05 Predictive modeling of moonlighting DNA-binding proteins Varghese, Dana Mary Nussinov, Ruth Ahmad, Shandar NAR Genom Bioinform Standard Article Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, that is, using sequence, predicted structure, evolutionary profiles, and global gene expression profiles, for only one functional class of proteins in a single organism at a time will significantly advance our understanding of multifunctional proteins. In this work, we investigated human moonlighting DNA-binding proteins (mDBPs) in terms of properties that distinguish them from other (non-moonlighting) proteins with the same DNA-binding protein (DBP) function. Following a careful and comprehensive analysis of discriminatory features, a machine learning model was developed to assess the predictability of mDBPs from other DBPs (oDBPs). We observed that mDBPs can be discriminated from oDBPs with high accuracy of 74% AUC of ROC using these first principles features. A number of novel predicted mDBPs were found to have literature support for their being moonlighting and others are proposed as candidates, for which the moonlighting function is currently unknown. We believe that this work will help in deciphering and annotating novel moonlighting DBPs and scale up other functions. The source codes and data sets used for this work are freely available at https://zenodo.org/record/7299265#.Y2pO3ctBxPY Oxford University Press 2022-12-02 /pmc/articles/PMC9716651/ /pubmed/36474806 http://dx.doi.org/10.1093/nargab/lqac091 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Varghese, Dana Mary Nussinov, Ruth Ahmad, Shandar Predictive modeling of moonlighting DNA-binding proteins |
title | Predictive modeling of moonlighting DNA-binding proteins |
title_full | Predictive modeling of moonlighting DNA-binding proteins |
title_fullStr | Predictive modeling of moonlighting DNA-binding proteins |
title_full_unstemmed | Predictive modeling of moonlighting DNA-binding proteins |
title_short | Predictive modeling of moonlighting DNA-binding proteins |
title_sort | predictive modeling of moonlighting dna-binding proteins |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716651/ https://www.ncbi.nlm.nih.gov/pubmed/36474806 http://dx.doi.org/10.1093/nargab/lqac091 |
work_keys_str_mv | AT varghesedanamary predictivemodelingofmoonlightingdnabindingproteins AT nussinovruth predictivemodelingofmoonlightingdnabindingproteins AT ahmadshandar predictivemodelingofmoonlightingdnabindingproteins |