Cargando…

Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination

We are now in an era where protein–DNA interactions have been experimentally assayed for thousands of DNA-binding proteins. In order to infer DNA-binding specificities from these data, numerous sophisticated computational methods have been developed. These approaches typically infer DNA-binding spec...

Descripción completa

Detalles Bibliográficos
Autores principales: Wetzel, Joshua L, Singh, Mona
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7028011/
https://www.ncbi.nlm.nih.gov/pubmed/31777934
http://dx.doi.org/10.1093/nar/gkz1087
_version_ 1783498938876166144
author Wetzel, Joshua L
Singh, Mona
author_facet Wetzel, Joshua L
Singh, Mona
author_sort Wetzel, Joshua L
collection PubMed
description We are now in an era where protein–DNA interactions have been experimentally assayed for thousands of DNA-binding proteins. In order to infer DNA-binding specificities from these data, numerous sophisticated computational methods have been developed. These approaches typically infer DNA-binding specificities by considering interactions for each protein independently, ignoring related and potentially valuable interaction information across other proteins that bind DNA via the same structural domain. Here we introduce a framework for inferring DNA-binding specificities by considering protein–DNA interactions for entire groups of structurally similar proteins simultaneously. We devise both constrained optimization and label propagation algorithms for this task, each balancing observations at the individual protein level against dataset-wide consistency of interaction preferences. We test our approaches on two large, independent Cys(2)His(2) zinc finger protein–DNA interaction datasets. We demonstrate that jointly inferring specificities within each dataset individually dramatically improves accuracy, leading to increased agreement both between these two datasets and with a fixed external standard. Overall, our results suggest that sharing protein–DNA interaction information across structurally similar proteins is a powerful means to enable accurate inference of DNA-binding specificities.
format Online
Article
Text
id pubmed-7028011
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70280112020-02-25 Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination Wetzel, Joshua L Singh, Mona Nucleic Acids Res Methods Online We are now in an era where protein–DNA interactions have been experimentally assayed for thousands of DNA-binding proteins. In order to infer DNA-binding specificities from these data, numerous sophisticated computational methods have been developed. These approaches typically infer DNA-binding specificities by considering interactions for each protein independently, ignoring related and potentially valuable interaction information across other proteins that bind DNA via the same structural domain. Here we introduce a framework for inferring DNA-binding specificities by considering protein–DNA interactions for entire groups of structurally similar proteins simultaneously. We devise both constrained optimization and label propagation algorithms for this task, each balancing observations at the individual protein level against dataset-wide consistency of interaction preferences. We test our approaches on two large, independent Cys(2)His(2) zinc finger protein–DNA interaction datasets. We demonstrate that jointly inferring specificities within each dataset individually dramatically improves accuracy, leading to increased agreement both between these two datasets and with a fixed external standard. Overall, our results suggest that sharing protein–DNA interaction information across structurally similar proteins is a powerful means to enable accurate inference of DNA-binding specificities. Oxford University Press 2020-01-24 2019-11-28 /pmc/articles/PMC7028011/ /pubmed/31777934 http://dx.doi.org/10.1093/nar/gkz1087 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Wetzel, Joshua L
Singh, Mona
Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title_full Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title_fullStr Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title_full_unstemmed Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title_short Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination
title_sort sharing dna-binding information across structurally similar proteins enables accurate specificity determination
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7028011/
https://www.ncbi.nlm.nih.gov/pubmed/31777934
http://dx.doi.org/10.1093/nar/gkz1087
work_keys_str_mv AT wetzeljoshual sharingdnabindinginformationacrossstructurallysimilarproteinsenablesaccuratespecificitydetermination
AT singhmona sharingdnabindinginformationacrossstructurallysimilarproteinsenablesaccuratespecificitydetermination