Cargando…

Predictive modeling of moonlighting DNA-binding proteins

Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Varghese, Dana Mary, Nussinov, Ruth, Ahmad, Shandar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716651/
https://www.ncbi.nlm.nih.gov/pubmed/36474806
http://dx.doi.org/10.1093/nargab/lqac091
_version_ 1784842737814601728
author Varghese, Dana Mary
Nussinov, Ruth
Ahmad, Shandar
author_facet Varghese, Dana Mary
Nussinov, Ruth
Ahmad, Shandar
author_sort Varghese, Dana Mary
collection PubMed
description Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, that is, using sequence, predicted structure, evolutionary profiles, and global gene expression profiles, for only one functional class of proteins in a single organism at a time will significantly advance our understanding of multifunctional proteins. In this work, we investigated human moonlighting DNA-binding proteins (mDBPs) in terms of properties that distinguish them from other (non-moonlighting) proteins with the same DNA-binding protein (DBP) function. Following a careful and comprehensive analysis of discriminatory features, a machine learning model was developed to assess the predictability of mDBPs from other DBPs (oDBPs). We observed that mDBPs can be discriminated from oDBPs with high accuracy of 74% AUC of ROC using these first principles features. A number of novel predicted mDBPs were found to have literature support for their being moonlighting and others are proposed as candidates, for which the moonlighting function is currently unknown. We believe that this work will help in deciphering and annotating novel moonlighting DBPs and scale up other functions. The source codes and data sets used for this work are freely available at https://zenodo.org/record/7299265#.Y2pO3ctBxPY
format Online
Article
Text
id pubmed-9716651
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97166512022-12-05 Predictive modeling of moonlighting DNA-binding proteins Varghese, Dana Mary Nussinov, Ruth Ahmad, Shandar NAR Genom Bioinform Standard Article Moonlighting proteins are multifunctional, single-polypeptide chains capable of performing multiple autonomous functions. Most moonlighting proteins have been discovered through work unrelated to their multifunctionality. We believe that prediction of moonlighting proteins from first principles, that is, using sequence, predicted structure, evolutionary profiles, and global gene expression profiles, for only one functional class of proteins in a single organism at a time will significantly advance our understanding of multifunctional proteins. In this work, we investigated human moonlighting DNA-binding proteins (mDBPs) in terms of properties that distinguish them from other (non-moonlighting) proteins with the same DNA-binding protein (DBP) function. Following a careful and comprehensive analysis of discriminatory features, a machine learning model was developed to assess the predictability of mDBPs from other DBPs (oDBPs). We observed that mDBPs can be discriminated from oDBPs with high accuracy of 74% AUC of ROC using these first principles features. A number of novel predicted mDBPs were found to have literature support for their being moonlighting and others are proposed as candidates, for which the moonlighting function is currently unknown. We believe that this work will help in deciphering and annotating novel moonlighting DBPs and scale up other functions. The source codes and data sets used for this work are freely available at https://zenodo.org/record/7299265#.Y2pO3ctBxPY Oxford University Press 2022-12-02 /pmc/articles/PMC9716651/ /pubmed/36474806 http://dx.doi.org/10.1093/nargab/lqac091 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Varghese, Dana Mary
Nussinov, Ruth
Ahmad, Shandar
Predictive modeling of moonlighting DNA-binding proteins
title Predictive modeling of moonlighting DNA-binding proteins
title_full Predictive modeling of moonlighting DNA-binding proteins
title_fullStr Predictive modeling of moonlighting DNA-binding proteins
title_full_unstemmed Predictive modeling of moonlighting DNA-binding proteins
title_short Predictive modeling of moonlighting DNA-binding proteins
title_sort predictive modeling of moonlighting dna-binding proteins
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9716651/
https://www.ncbi.nlm.nih.gov/pubmed/36474806
http://dx.doi.org/10.1093/nargab/lqac091
work_keys_str_mv AT varghesedanamary predictivemodelingofmoonlightingdnabindingproteins
AT nussinovruth predictivemodelingofmoonlightingdnabindingproteins
AT ahmadshandar predictivemodelingofmoonlightingdnabindingproteins