Cargando…

Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential

Computational evaluation of protein–DNA interaction is important for the identification of DNA-binding sites and genome annotation. It could validate the predicted binding motifs by sequence-based approaches through the calculation of the binding affinity between a protein and DNA. Such an evaluatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhijie, Mao, Fenglou, Guo, Jun-tao, Yan, Bo, Wang, Peng, Qu, Youxing, Xu, Ying
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548349/
https://www.ncbi.nlm.nih.gov/pubmed/15673715
http://dx.doi.org/10.1093/nar/gki204
_version_ 1782122346992107520
author Liu, Zhijie
Mao, Fenglou
Guo, Jun-tao
Yan, Bo
Wang, Peng
Qu, Youxing
Xu, Ying
author_facet Liu, Zhijie
Mao, Fenglou
Guo, Jun-tao
Yan, Bo
Wang, Peng
Qu, Youxing
Xu, Ying
author_sort Liu, Zhijie
collection PubMed
description Computational evaluation of protein–DNA interaction is important for the identification of DNA-binding sites and genome annotation. It could validate the predicted binding motifs by sequence-based approaches through the calculation of the binding affinity between a protein and DNA. Such an evaluation should take into account structural information to deal with the complicated effects from DNA structural deformation, distance-dependent multi-body interactions and solvation contributions. In this paper, we present a knowledge-based potential built on interactions between protein residues and DNA tri-nucleotides. The potential, which explicitly considers the distance-dependent two-body, three-body and four-body interactions between protein residues and DNA nucleotides, has been optimized in terms of a Z-score. We have applied this knowledge-based potential to evaluate the binding affinities of zinc-finger protein–DNA complexes. The predicted binding affinities are in good agreement with the experimental data (with a correlation coefficient of 0.950). On a larger test set containing 48 protein–DNA complexes with known experimental binding free energies, our potential has achieved a high correlation coefficient of 0.800, when compared with the experimental data. We have also used this potential to identify binding motifs in DNA sequences of transcription factors (TF). The TFs in 79.4% of the known TF–DNA complexes have accurately found their native binding sequences from a large pool of DNA sequences. When tested in a genome-scale search for TF-binding motifs of the cyclic AMP regulatory protein (CRP) of Escherichia coli, this potential ranks all known binding motifs of CRP in the top 15% of all candidate sequences.
format Text
id pubmed-548349
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-5483492005-02-10 Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential Liu, Zhijie Mao, Fenglou Guo, Jun-tao Yan, Bo Wang, Peng Qu, Youxing Xu, Ying Nucleic Acids Res Article Computational evaluation of protein–DNA interaction is important for the identification of DNA-binding sites and genome annotation. It could validate the predicted binding motifs by sequence-based approaches through the calculation of the binding affinity between a protein and DNA. Such an evaluation should take into account structural information to deal with the complicated effects from DNA structural deformation, distance-dependent multi-body interactions and solvation contributions. In this paper, we present a knowledge-based potential built on interactions between protein residues and DNA tri-nucleotides. The potential, which explicitly considers the distance-dependent two-body, three-body and four-body interactions between protein residues and DNA nucleotides, has been optimized in terms of a Z-score. We have applied this knowledge-based potential to evaluate the binding affinities of zinc-finger protein–DNA complexes. The predicted binding affinities are in good agreement with the experimental data (with a correlation coefficient of 0.950). On a larger test set containing 48 protein–DNA complexes with known experimental binding free energies, our potential has achieved a high correlation coefficient of 0.800, when compared with the experimental data. We have also used this potential to identify binding motifs in DNA sequences of transcription factors (TF). The TFs in 79.4% of the known TF–DNA complexes have accurately found their native binding sequences from a large pool of DNA sequences. When tested in a genome-scale search for TF-binding motifs of the cyclic AMP regulatory protein (CRP) of Escherichia coli, this potential ranks all known binding motifs of CRP in the top 15% of all candidate sequences. Oxford University Press 2005 2005-01-26 /pmc/articles/PMC548349/ /pubmed/15673715 http://dx.doi.org/10.1093/nar/gki204 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle Article
Liu, Zhijie
Mao, Fenglou
Guo, Jun-tao
Yan, Bo
Wang, Peng
Qu, Youxing
Xu, Ying
Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title_full Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title_fullStr Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title_full_unstemmed Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title_short Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential
title_sort quantitative evaluation of protein–dna interactions using an optimized knowledge-based potential
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548349/
https://www.ncbi.nlm.nih.gov/pubmed/15673715
http://dx.doi.org/10.1093/nar/gki204
work_keys_str_mv AT liuzhijie quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT maofenglou quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT guojuntao quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT yanbo quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT wangpeng quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT quyouxing quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential
AT xuying quantitativeevaluationofproteindnainteractionsusinganoptimizedknowledgebasedpotential