Cargando…

Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity

Nucleotides are involved in several cellular processes, ranging from the transmission of genetic information, to energy transfer and storage. Both sequence and structure based methods have been developed to predict the location of nucleotide-binding sites in proteins. Here we propose a novel methodo...

Descripción completa

Detalles Bibliográficos
Autores principales: Parca, Luca, Gherardini, Pier Federico, Truglio, Mauro, Mangone, Iolanda, Ferrè, Fabrizio, Helmer-Citterich, Manuela, Ausiello, Gabriele
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507729/
https://www.ncbi.nlm.nih.gov/pubmed/23209685
http://dx.doi.org/10.1371/journal.pone.0050240
_version_ 1782251119537291264
author Parca, Luca
Gherardini, Pier Federico
Truglio, Mauro
Mangone, Iolanda
Ferrè, Fabrizio
Helmer-Citterich, Manuela
Ausiello, Gabriele
author_facet Parca, Luca
Gherardini, Pier Federico
Truglio, Mauro
Mangone, Iolanda
Ferrè, Fabrizio
Helmer-Citterich, Manuela
Ausiello, Gabriele
author_sort Parca, Luca
collection PubMed
description Nucleotides are involved in several cellular processes, ranging from the transmission of genetic information, to energy transfer and storage. Both sequence and structure based methods have been developed to predict the location of nucleotide-binding sites in proteins. Here we propose a novel methodology that leverages the observation that nucleotide-binding sites have a modular structure. Nucleotides are composed of identifiable fragments, i.e. the phosphate, the nucleobase and the carbohydrate moieties. These fragments are bound by specific structural motifs that recur in proteins of different fold. Moreover these motifs behave as modules and are found in different combinations across fold space. Our method predicts binding sites for each nucleotide fragment by comparing a query protein with a database of templates extracted from proteins of known structure. Whenever a similarity is found the fragment bound by the template is transferred on the query protein, thus identifying a putative binding site. Predictions falling inside the surface of the protein are discarded, and the remaining ones are scored using clustering and conservation. The method is able to rank as first a correct prediction in the 48%, 48% and 68% of the analyzed proteins for the nucleobase, carbohydrate and phosphate respectively, while considering the first five predictions the performances change to 71%, 65% and 86% respectively. Furthermore we attempted to reconstruct the full structure of the binding site, starting from the predicted positions of the fragments. We calculated that in the 59% of the analyzed proteins the method ranks as first a reconstructed binding site or a part of it. Finally we tested the reliability of our method in a real world case in which it has to predict nucleotide-binding sites in unbound proteins. We analyzed proteins whose structure has been solved with and without the nucleotide and observed only little variations in the method performance.
format Online
Article
Text
id pubmed-3507729
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35077292012-12-03 Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity Parca, Luca Gherardini, Pier Federico Truglio, Mauro Mangone, Iolanda Ferrè, Fabrizio Helmer-Citterich, Manuela Ausiello, Gabriele PLoS One Research Article Nucleotides are involved in several cellular processes, ranging from the transmission of genetic information, to energy transfer and storage. Both sequence and structure based methods have been developed to predict the location of nucleotide-binding sites in proteins. Here we propose a novel methodology that leverages the observation that nucleotide-binding sites have a modular structure. Nucleotides are composed of identifiable fragments, i.e. the phosphate, the nucleobase and the carbohydrate moieties. These fragments are bound by specific structural motifs that recur in proteins of different fold. Moreover these motifs behave as modules and are found in different combinations across fold space. Our method predicts binding sites for each nucleotide fragment by comparing a query protein with a database of templates extracted from proteins of known structure. Whenever a similarity is found the fragment bound by the template is transferred on the query protein, thus identifying a putative binding site. Predictions falling inside the surface of the protein are discarded, and the remaining ones are scored using clustering and conservation. The method is able to rank as first a correct prediction in the 48%, 48% and 68% of the analyzed proteins for the nucleobase, carbohydrate and phosphate respectively, while considering the first five predictions the performances change to 71%, 65% and 86% respectively. Furthermore we attempted to reconstruct the full structure of the binding site, starting from the predicted positions of the fragments. We calculated that in the 59% of the analyzed proteins the method ranks as first a reconstructed binding site or a part of it. Finally we tested the reliability of our method in a real world case in which it has to predict nucleotide-binding sites in unbound proteins. We analyzed proteins whose structure has been solved with and without the nucleotide and observed only little variations in the method performance. Public Library of Science 2012-11-27 /pmc/articles/PMC3507729/ /pubmed/23209685 http://dx.doi.org/10.1371/journal.pone.0050240 Text en © 2012 Parca et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Parca, Luca
Gherardini, Pier Federico
Truglio, Mauro
Mangone, Iolanda
Ferrè, Fabrizio
Helmer-Citterich, Manuela
Ausiello, Gabriele
Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title_full Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title_fullStr Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title_full_unstemmed Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title_short Identification of Nucleotide-Binding Sites in Protein Structures: A Novel Approach Based on Nucleotide Modularity
title_sort identification of nucleotide-binding sites in protein structures: a novel approach based on nucleotide modularity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507729/
https://www.ncbi.nlm.nih.gov/pubmed/23209685
http://dx.doi.org/10.1371/journal.pone.0050240
work_keys_str_mv AT parcaluca identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT gherardinipierfederico identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT trugliomauro identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT mangoneiolanda identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT ferrefabrizio identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT helmercitterichmanuela identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity
AT ausiellogabriele identificationofnucleotidebindingsitesinproteinstructuresanovelapproachbasedonnucleotidemodularity